GitHub - amaheshwari01/Machine-learning.github.io: AIML projects

Sales Prediction Capstone Project

To view all the code and files click here

Problem Statement

The problem is that people from supermarkets have trouble finding out which products will sell and which ones will sell and which one will not which causes them to waste a lot of money trying and testing products

Data Import and Wrangling

In this project I had to do a lot of data wrangling and cleaning of the data. Since there were many ways the data was written since there were different ways the data was written for example the low fat was also writing as LF which we had to replace(look at the fat section from before and after data wrangling graphs) there were many missing values which we had to impute and that there were.

Methodology

I used a regression model to predict the sales for each item. I also had to sort out the data and remove columns that had no effect on the model, I created new features which are simpler using all the data we had, and I used one hot encoding and label encoding to make my model more efficient

Algorithms Used

To create the model I used the Linear regression model(LM)

Challenges

Data wrangling -Had many NA’s and missing values -Had data which didn't make sense -Removing data which didn't make sense One hot encoding and linear encoding Deciding which feature to drop or not

Significance

This project helps people from supermarkets and helps them decide which products will sell and which wouldn't. This saves them a lot of money and it could also help smaller stores since they can’t afford to buy random things and see if it sells but instead they can use this model to see which items to sell.

Conclusion

In conclusion i was able to get an R2 of 0.564684 using the LM model.Which is pretty good for this situation. Throughout this project I was able to learn about the different types of encodings and how they impact the model. I also learnt the importance of the data wrangling and how that takes up most of the time when building a model.

Graph before data wrangling

![hello](graph before data wrangling.png)

Graph after data wrangling

![hello](new graph after data wrangling.png)

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Dataset.csv		Dataset.csv
README.md		README.md
_config.yml		_config.yml
capstone project.R		capstone project.R
graph before data wrangling.png		graph before data wrangling.png
new graph after data wrangling.png		new graph after data wrangling.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sales Prediction Capstone Project

Problem Statement

Data Import and Wrangling

Methodology

Algorithms Used

Challenges

Significance

Conclusion

Graph before data wrangling

Graph after data wrangling

About

Uh oh!

Releases

Packages

Languages

amaheshwari01/Machine-learning.github.io

Folders and files

Latest commit

History

Repository files navigation

Sales Prediction Capstone Project

Problem Statement

Data Import and Wrangling

Methodology

Algorithms Used

Challenges

Significance

Conclusion

Graph before data wrangling

Graph after data wrangling

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages