Case Study 11

Spam Classifier

TF-IDF + Multinomial Naive Bayes text classifier

2026

PythonScikit-learnNLTKNumPyMatplotlib

Key impact

Built a spam classifier with an NLP preprocessing pipeline (tokenization, stop-word removal) feeding TF-IDF vectorization into a Multinomial Naive Bayes model.

Representative mockup

What I did

01
Built a spam classifier with an NLP preprocessing pipeline (tokenization, stop-word removal) feeding TF-IDF vectorization into a Multinomial Naive Bayes model.
02
Evaluated rigorously with precision, recall, F1, and a confusion matrix to control false positives on imbalanced spam-vs-ham data.
03
Surfaced the most informative tokens driving each prediction to make the model's behavior interpretable.

Tech stack

PythonScikit-learnNLTKNumPyMatplotlib

More projects

← All projects