Placement-Prediction-Data-Science-Model

This project predicts student placement outcomes based on academic and profile data using a trained Random Forest Classifier. It uses a complete machine learning pipeline with preprocessing, label encoding, and model persistence using joblib.

Files and Structure

placement.csv # Original dataset
input.csv # Automatically generated test data (from StratifiedShuffleSplit)
output.csv # Output file with predictions
model.pkl # Trained Random Forest model
preprocessing_pipeline.pkl # Preprocessing pipeline (numerical + categorical)
label_encoder.pkl # Encoder to convert labels to/from numeric

How the Model Works

Loads placement.csv as the dataset. (CSV to be downloaded from kaggle)
Performs a StratifiedShuffleSplit to ensure class distribution is preserved.
Splits the data into training and test sets.
Stores the test set as input.csv (used later for inference).
Encodes the target labels using LabelEncoder.
Separates numerical and categorical columns.
Constructs a preprocessing pipeline using:
- SimpleImputer and StandardScaler for numerical data.
- OneHotEncoder for categorical data.
Trains a RandomForestClassifier on the transformed training data.
Saves the trained model, preprocessing pipeline, and label encoder using joblib.

On rerunning the script:

If model files exist, the script loads the model and pipeline.
Reads input.csv, applies the pipeline, and predicts outcomes.
Saves the result to output.csv with an additional column Placement_Prediction.

Requirements

pandas
numpy
scikit-learn
joblib

Highlights

Stratified sampling ensures balanced training.
Full pipeline for preprocessing both numerical and categorical features.
Label encoding ensures clean conversion between text and numbers.
Model persistence using joblib ensures easy inference without retraining.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
input.csv		input.csv
model.py		model.py
output.csv		output.csv
placement.csv		placement.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Placement-Prediction-Data-Science-Model

Files and Structure

How the Model Works

Requirements

Highlights

About

Uh oh!

Releases

Packages

Languages

Kritank07/Placement-Prediction-Data-Science-Model

Folders and files

Latest commit

History

Repository files navigation

Placement-Prediction-Data-Science-Model

Files and Structure

How the Model Works

Requirements

Highlights

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages