Decision_Tree_Classifier_Heart_Disease

I implemented an end-to-end machine learning model utilizing decision trees and random forests to predict heart disease due to a variety of environmental and biologic factors. In this project I really delved under the hood to better understand the hyperparameter tuning of each model. One large difficulty in creating this model was that the dataset was extremely imbalanced.

Questions:

Which factors contribute most to an individual being at risk for coronary heart disease (CHD)?
How can an imbalanced dataset be mitigated?

Dataset:

Data Analysis:

Decision Tree Classifier, Random Forest Classifier, Imbalanced Data

-All analysis and visualization done in Python using pandas numpy sklearn seaborn matplotlib

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Heart_Disease_Prediction.pdf		Heart_Disease_Prediction.pdf
ML_Project.py		ML_Project.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Decision_Tree_Classifier_Heart_Disease

Questions:

Dataset:

Data Analysis:

About

Releases

Packages

Languages

micahwiesner67/Decision_Tree_Classifier_Heart_Disease

Folders and files

Latest commit

History

Repository files navigation

Decision_Tree_Classifier_Heart_Disease

Questions:

Dataset:

Data Analysis:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages