๐ Website Classification App
This is a Machine Learning Website Classification application. It classifies websites into categories based on their content. The project includes data preprocessing, model training, testing, and deployment using Streamlit for a web interface.
โจ Features
โ End-to-end ML pipeline: EDA โ Preprocessing โ Training โ Testing โ Deployment
๐ Interactive web app with Streamlit
โก Real-time website category prediction
๐ฆ Supports .pkl model and label encoder files
๐ง How the Model Works- The model uses TfidfVectorizer to convert website content into numerical features, then a Support Vector Machine (SVM) classifier predicts the website category. User input is preprocessed before prediction.
๐ How to Run
Clone the repository:
git clone https://github.com/Sananda-Dutta/website-category-classifier cd website-classification
Install dependencies:
pip install -r requirements.txt
Run the Streamlit app:
streamlit run app.py
Enter the website input in the app and see the predicted category.
๐ Project Files
File Description
app.py Streamlit application
website_classifier.pkl Trained ML model
label_encoder.pkl Label encoder for categories
requirements.txt Python dependencies
๐ Technologies Used
Python
scikit-learn
pandas, numpy
Streamlit
โ๏ธ Author
Sananda Dutta