Blog #17

Blog #17

 5 Dec 2017

During my MSc a few years ago whilst specialising in machine learning, sentiment analysis and Bayesian theorem, I encountered a technique that I could use to improve the computers understanding of human language,  called POS Tagging.

What is POS Tagging?

POS tagging is the process of assigning a ‘tag/category’ (in the form of an abbreviated code) to each word (token) in a given sentence.

In the English language for example, common POS categories are:

  • nouns
  • verbs
  • adjectives
  • adverbs
  • pronouns
  • prepositions
  • conjunctions
  • interjections

Other categories can be derived from different forms of the above, for example a verb can be in its base form or in past tense.

Penn Treebank POS Tags

For the purposes of this blog post I have focussed on the Penn Treebank POS Tag Set.  You can see the entire list of these POS Tags in the below table:

Tag

Description

Tag

Description

CC

Coordinating conjunction

PRP$

Possessive pronoun

 

If you'd like to find out how to use POS tagging, click the image below to Download our Sentiment Analysis Whitepaper.

Or View Our Diary And Book In A Call

Also included in the Whitepaper:

  • Bayesian Theorem & It's Connection To Sentiment Analysis
  • The Difficulties Of Sentiment Analysis & The Solutions
  • How To Train Your Classifier When Performing Sentiment Analysis

Comments

Currently there are no comments. Be the first to post one!

Post Comment

*
*
*