Blog #17

Blog #17

 5 Dec 2017

During my MSc a few years ago whilst specialising in machine learning, sentiment analysis and Bayesian theorem, I encountered a technique that I could use to improve the computers understanding of human language,  called POS Tagging.

What is POS Tagging?

POS tagging is the process of assigning a ‘tag/category’ (in the form of an abbreviated code) to each word (token) in a given sentence.

In the English language for example, common POS categories are:

  • nouns
  • verbs
  • adjectives
  • adverbs
  • pronouns
  • prepositions
  • conjunctions
  • interjections

Other categories can be derived from different forms of the above, for example a verb can be in its base form or in past tense.

Penn Treebank POS Tags

For the purposes of this blog post I have focussed on the Penn Treebank POS Tag Set.  You can see the entire list of these POS Tags in the below table:






Coordinating conjunction


Possessive pronoun


If you'd like to find out how to use POS tagging, click the image below to Download our Sentiment Analysis Whitepaper.

Or View Our Diary And Book In A Call

Also included in the Whitepaper:

  • Bayesian Theorem & It's Connection To Sentiment Analysis
  • The Difficulties Of Sentiment Analysis & The Solutions
  • How To Train Your Classifier When Performing Sentiment Analysis


Currently there are no comments. Be the first to post one!

Post Comment