This repository contains a collection of machine learning projects completed. Each branch represents a different project focusing on various machine learning concepts and implementations.
knn: Implementation of K-Nearest Neighbors algorithm and performance evaluation proceduresdecision-tree-and-model-evaluations: Implementation of Decision Tree algorithm and model evaluation proceduresTweets sentiment analysis: Tweet Sentiment Analysis for Stock Market Performance PredictionPerceptron-model: Perceptron Implementation and Model Comparison Study
- Implementation of KNN classifier
- Data preprocessing techniques
- Model comparison with Naive Bayes
- Performance visualization and analysis

- Implementation of Decision Tree Classifier for heart disease prediction
- Evaluation using various metrics (accuracy, precision, recall, F-measure, etc.)
- Detailed performance analysis and model evaluation

- Analysis of tweet impact on stock market performance
- Feature extraction from text data using TF-IDF and binary representations
- Implementation of various feature selection techniques
- Comparison of different linear regression approaches (closed-form, LASSO, SGD)
- Large-scale data processing and optimization
- Custom implementation of Perceptron algorithm
- Comprehensive model comparison study including:
- Logistic Regression (with different regularizations)
- Decision Trees
- K-Nearest Neighbors
- Neural Networks
- Implementation of grid search and random search for hyperparameter tuning
- ROC curve analysis and performance metrics comparison

Jinghan (Summer) Sun jinghan.sun@emory.edu