This repo includes a simple ML analysis script for the Kaggle Pima Indians Diabetes dataset. - Mainly for course project
Steps:
- Download
diabetes.csvfrom Kaggle and place it atdata/diabetes.csv. - Install dependencies:
pip install -r requirements.txt. - Run the analysis:
python pima_analysis.py --data data/diabetes.csv.
The script prints basic EDA, treats zeros as missing for selected columns, trains multiple models,
reports metrics, and optionally saves the best model with --save.