This repository contains the code and data for the Salary Prediction Project. The project aims to predict individuals' salaries based on their age, years of experience, and education level.
-
Upload the Data:
- Upload the
data.csvfile to your Google Drive. You can find the dataset here : https://www.kaggle.com/datasets/rkiattisak/salaly-prediction-for-beginer - Make sure the data file is renamed : "data.csv" and placed in "content" named directory of your Google Drive.
- Upload the
-
Set up the Environment:
- Open the notebook in Google Colab or Jupyter Notebook.
- Mount your Google Drive in the notebook by executing the following code snippet: python from google.colab import drive drive.mount('/content/drive')
-
Access the Data, Data Preprocessing, Feature Engineering, Model Training and Evaluation :
- Run each section of the notebook to reproduce and visualize the results.
- Python 3.x
- Required libraries: scikit-learn, pandas, numpy, matplotlib, seaborn, statsmodels.
- Clone the repository: "git clone https://github.com/anhadkwatra55/CIND820.git"