Machine-learning-Models-Testing-on-Diabetes-Data

This repository focuses on training and comparing machine learning models to predict a binary outcome (0 or 1) for diabetes diagnosis using a highly imbalanced dataset. The models include XGBoost, Gradient Boosting, Random Forest, and Logistic Regression. It implements bootstrap resampling to generate precision-recall curves for model comparison, which is particularly useful for imbalanced data. Additionally, it analyzes classification thresholds (False Negatives, False Positives, True Positives, True Negatives) to identify individuals who are harder to classify and determine optimal thresholds for diabetes detection.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
Resampling.ipynb		Resampling.ipynb
Theshold_Analysis_2.ipynb		Theshold_Analysis_2.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine-learning-Models-Testing-on-Diabetes-Data

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Machine-learning-Models-Testing-on-Diabetes-Data

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages