Pre-processing data: tokenization, stemming, and removal of stop words
Pre-processing data: tokenization, stemming, and removal of stop words (compressed code)
POS (Parts of Speech) tagging – labelling words as nouns, verbs, adjectives, etc.
Using free text for classification – ‘Bag of Words’
Topic modelling (dividing documents into topic groups) with Gensim
TensorFlow text-based classification – from raw text to prediction