Naive Bayes Spam Classifier
The objective of this project is to compare the performance of two popular Naive Bayes Spam Classifiers
The Ling-Spam corpus is used for training the models.
All the mails in the bare
subdirectory were preprocessed using the process.py
script and stored in another directory.
The following email preprocessing and normalization steps were carried out in the given order:
The Porter Stemming algorithm which was ported to Python from the
version coded up in ANSI C by the author was used for word stemming.
Copyright (c) 2017 Garvit Aggarwal