Welcome to my GitHub portfolio! I am a passionate and dedicated data enthusiast with a strong background in Data Science. My journey in the world of technology and data-driven solutions has led me to work on various projects that showcase my skills and expertise.
๐ญ Hi, I'm Jin, and I'm a proud graduate with a Distinction degree in Master of Data Science from Durham University. I'm a versatile professional, combining the roles of a data analyst, data scientist, and recently, I've ventured into the exciting world of data engineering. My passion lies in harnessing the transformative power of data to uncover actionable insights!
๐ Within this portfolio, you'll discover a collection of comprehensive end-to-end data science projects that showcase my dedication to mastering this dynamic field. I'm committed to a lifelong journey of expanding my skills and knowledge in Data Science, always staying at the forefront of emerging technologies and trends. ๐
-
Project 1 - Insurance Claim Prediction
- A random forest model to predict whether the policyholder files a claim in the next 6 months.
- Frameworks/Tech Stack:
- Python
- Scikit-Learn
- Pandas
-
Project 2 - K-means Clustering for customer segmentation
- A K-Means Clustering model to segment customers into 5 distinct groups for a retailer
- Frameworks/Tech Stack:
- Python
- Scikit-Learn
- Pandas
- matplotlib
-
Project 3 - XGBoost Customer Churn Prediction
- An XGBoost classifier to predict customer churn rates
- Frameworks/Tech Stack:
- Python
- jupyter notebook
- Scikit-Learn
- Pandas
- seaborn
-
Project 3 - 1D-CNN Pytorch timer series classifier
- A timer series classifier can automate detection and classify seismic waves in real-time
- Frameworks/Tech Stack:
- Python
- jupyter notebook
- Scikit-Learn
- Pandas
- Pytorch
-
Project 1 - EDA wholesale data
- An Exploratory Data Analysis on customers from a Wholesale Distributor
- Frameworks/Tech Stack:
- R
-
Project 2 - Customer shopping dashboards
- A Power BI dashboard to monitor customer shopping status
- Frameworks/Tech Stack:
- Power BI
- Power Query
- Project 1 - Azure End-to-End Data-Engineering Project
- An end-to-end data engineering project
- Frameworks/Tech Stack:
- Azure Data Factory
- Azure Synapse studio
- SQL
- Apache Spark
- Databricks
- Lakehouse
My skill set includes proficiency in the following major frameworks and libraries:
- Data Analytics
- Machine Learning
- Deep Learning
My skill set includes proficiency in the following major frameworks and libraries:
- Python
- SQL
- Azure
- R
- Scikit-Learn
- PyTorch
- NumPy
- Pandas
- SciPy
- Cloud Computing
Feel free to reach out to me through the following channels:
- ๐ฎ [[email protected]]
- ๐ LinkedIn