Language Detection Using Naive Bayes

A text-classification pipeline that uses TF-IDF features and a Multinomial Naïve Bayes classifier to predict the language of input sentences.

Python 3 pandas NumPy scikit-learn seaborn matplotlib
Back to Projects

Project Overview

The notebook loads the “Language Detection.csv” dataset into pandas (10 337 entries with “Text” and “Language” columns) , then splits into training and test sets. It vectorizes text using TfidfVectorizer, fits a MultinomialNB model, and evaluates performance via accuracy score, confusion matrix, and classification report .

Project Files

HTML File

Project Details

  • Completion Date May 2024
  • Category Machine Learning
  • Project Type HTML File

Project Preview

Language Detection Using Naive Bayes