Heart Disease Prediction Model This project builds a machine learning model to predict the presence of heart disease in patients using 14 key medical records:
• Features:
- Age
- Sex
- Chest pain type
- Resting blood pressure (trestbps)
- Cholesterol level
- Fasting blood sugar
- Resting electrocardiographic results (restecg)
- Maximum heart rate achieved (thalach)
- Exercise-induced angina
- Old peak (ST depression induced by exercise)
- Peak exercise ST segment
- Number of major vessels (0-3) colored by fluoroscopy
- Thalium stress test result
- Target: Presence or absence of heart disease
• Workflow:
- Data Preparation: The dataset was cleaned, processed, and split for training and testing.
- Model Selection & Tuning: Various models were tested, with XGBClassifier performing best.
- Accuracy: Fine-tuning the model achieved a high accuracy of 92.13%, indicating its robustness.
- Evaluation: Model performance was assessed using key metrics such as precision, recall, and F1 score.
This repository demonstrates the practical application of Python libraries like Pandas, NumPy, Matplotlib, and Scikit-Learn in machine learning. It showcases end-to-end implementation, including data analysis, visualization, modeling, and optimization. Note: XGBClassifier was used for its ability to handle imbalanced datasets and achieve high predictive accuracy of 92.13%.