An NLP project that compares different approaches to document representation and classification. The techniques used include Topic-modeling, Tf-Idf, doc2vec, SVM, and CNN