Natural language processing

Pre-processing data: tokenization, stemming, and removal of stop words

Pre-processing data: tokenization, stemming, and removal of stop words (compressed code)

POS (Parts of Speech) tagging – labelling words as nouns, verbs, adjectives, etc.

Using free text for classification – ‘Bag of Words’

Topic modelling (dividing documents into topic groups) with Gensim

Converting text to numbers

TensorFlow text-based classification – from raw text to prediction