Case Study 11
Spam Classifier
TF-IDF + Multinomial Naive Bayes text classifier
2025
PythonScikit-learnNLTKNumPyMatplotlib
Built a spam classifier with an NLP preprocessing pipeline (tokenization, stop-word removal) feeding TF-IDF vectorization into a Multinomial Naive Bayes model.
What I did
3- 01
Built a spam classifier with an NLP preprocessing pipeline (tokenization, stop-word removal) feeding TF-IDF vectorization into a Multinomial Naive Bayes model.
- 02
Evaluated rigorously with precision, recall, F1, and a confusion matrix to control false positives on imbalanced spam-vs-ham data.
- 03
Surfaced the most informative tokens driving each prediction to make the model's behavior interpretable.
Tech stack
PythonScikit-learnNLTKNumPyMatplotlib
More projects