π Live Website Β Β·Β π Report Bug Β Β·Β β¨ Request Feature
A curated portfolio of 26 end-to-end machine learning projects β spanning healthcare AI, real-time computer vision, NLP chatbots, time series forecasting, and classical ML. Each project applies theory to a practical problem, with several fully deployed as web and GUI applications.
| π Projects | π·οΈ Domains | π Deployed Apps | π₯οΈ GUI Apps | β GitHub Stars |
|---|---|---|---|---|
| 26 | 6 | 5 | 3 | 1.3k+ |
Click to expand / collapse
Legend: Β π’ Beginner Β π‘ Intermediate Β π΄ Advanced Β |Β π Web App Β π₯οΈ GUI App Β π Notebook
6 Projects β click to collapse
| Project | Description | Tools & Algorithms | Level | Type |
|---|---|---|---|---|
| Brain Tumor Detection | Detects tumors in MRI scans using a CNN. Upload a scan and get a real-time prediction. | PyTorch Β· CNN Β· Flask | π΄ | π |
| Diabetes Prediction | Predicts diabetes likelihood from 8 health markers (glucose, BMI, insulin, age) using the Pima Indians dataset. | scikit-learn Β· SVM Β· Flask | π‘ | π |
| Heart Disease Prediction | Predicts cardiac risk from 13 clinical features with ~92% accuracy. | scikit-learn Β· Logistic Reg. Β· Flask | π‘ | π |
| Arrhythmia Classification | Classifies 16 arrhythmia types from 279 ECG features (UCI dataset). | SVM Β· KNN Β· Decision Tree | π‘ | π |
| Medical Chatbot | NLP chatbot mapping user-described symptoms to diagnoses via a curated medical knowledge base. | NLTK Β· TF-IDF Β· Flask | π΄ | π |
| MoA Prediction | Predicts drug biological activity from gene expression and cell viability data (Kaggle competition). | PyTorch Β· TabNet Β· Multi-label | π΄ | π |
9 Projects β click to collapse
| Project | Description | Tools & Algorithms | Level | Type |
|---|---|---|---|---|
| Driver Drowsiness Detection | Monitors driver eye state via Eye Aspect Ratio (EAR) and triggers an audio alert on drowsiness. | OpenCV Β· dlib Β· EAR | π‘ | π₯οΈ |
| Distracted Driver Detection | Classifies 10 distracted behaviors (texting, eating, phone call, etc.) from dashboard camera images. | CNN Β· Keras Β· ImageDataGenerator | π΄ | π |
| Lane Line Detection | Overlays detected road lane lines on images/video using Canny edge detection and Hough transforms. | OpenCV Β· Canny Β· Hough Transform | π’ | π₯οΈ |
| Human Detection & Counting | Detects and counts people in live video or images using HOG + SVM. | OpenCV Β· HOG Β· SVM | π’ | π₯οΈ |
| Gender & Age Detection | Predicts gender and age group from a face image using pre-trained Caffe models. | OpenCV DNN Β· Caffe Models | π‘ | π₯οΈ |
| Image Colorization | Adds realistic color to grayscale photos using the Zhang et al. deep colorization network. | OpenCV DNN Β· Zhang et al. Β· LAB space | π‘ | π |
| Smile Selfie Capture | Auto-captures a photo the instant a smile is detected in the webcam feed. No button needed. | OpenCV Β· Haar Cascades | π’ | π₯οΈ |
| Emoji Creator from Emotions | Detects real-time facial emotions via webcam and overlays the matching emoji on screen. | OpenCV Β· CNN Β· FER dataset | π‘ | π₯οΈ |
| Human Activity Recognition | Classifies activities (walking, sitting, standing) from pose estimation keypoints over time. | LSTM Β· Keras Β· 2D Pose Estimation | π΄ | π |
7 Projects β click to collapse
| Project | Description | Tools & Algorithms | Level | Type |
|---|---|---|---|---|
| Iris Flower Classification | Classic benchmark β classifies iris species from petal/sepal measurements. Ideal for comparing classifiers side-by-side. | KNN Β· SVM Β· Decision Tree Β· Naive Bayes | π’ | π |
| Wine Quality Prediction | Predicts wine quality score (3β8) from 11 physicochemical properties like acidity, sulfates, and alcohol. | Random Forest Β· XGBoost | π‘ | π |
| Loan Repayment Prediction | Predicts whether a LendingClub borrower will repay based on credit history, income, and loan purpose. | Random Forest Β· XGBoost Β· Class Balancing | π‘ | π |
| College Admission Prediction | Estimates graduate admission probability from GRE, TOEFL, GPA, and research experience. | Linear Reg. Β· Ridge Β· Lasso Β· SVR | π’ | π |
| Employee Turnover Prediction | Identifies employees at high risk of leaving using HR data (satisfaction, evaluations, workload, promotions). | Decision Tree Β· Random Forest | π‘ | π |
| Property Maintenance Fines | Predicts fine compliance from Detroit's blight dataset β a real-world class-imbalance problem (Michigan Data Science Team). | Gradient Boosting Β· SMOTE Β· AUC optimization | π΄ | π |
| Research Topic Prediction | Classifies academic papers into topic categories using NLP-based feature extraction on titles/abstracts. | TF-IDF Β· Naive Bayes Β· SVM Β· NLTK | π‘ | π |
2 Projects β click to collapse
| Project | Description | Tools & Algorithms | Level | Type |
|---|---|---|---|---|
| AI Room Booking Chatbot | Hotel room booking chatbot using IBM Watson. Handles slot-filling, availability queries, and booking confirmations through a web interface. | IBM Watson Assistant Β· Watson Discovery | π‘ | π |
| Medical Chatbot | Symptom-to-diagnosis NLP chatbot with multi-turn conversation support. (Also listed under Healthcare.) | NLTK Β· Flask Β· TF-IDF Β· Cosine Similarity | π΄ | π |
2 Projects β click to collapse
| Project | Description | Tools & Algorithms | Level | Type |
|---|---|---|---|---|
| Multi-Store Sales Prediction | Forecasts daily sales for 50 items across 10 stores using three time series approaches and model ensembling. | ARIMA Β· Facebook Prophet Β· LSTM (Keras) | π΄ | π |
| IPL Score Prediction | Predicts first-innings T20 scores from ball-by-ball match data with deep EDA and multiple regression models. | Linear/Ridge Reg. Β· Random Forest Β· ANN | π‘ | π |
1 Project β click to collapse
| Project | Description | Tools & Algorithms | Level | Type |
|---|---|---|---|---|
| The Battle of Neighborhoods | IBM Capstone β clusters city neighborhoods using Foursquare API data to recommend optimal business locations. | K-Means Β· Foursquare API Β· Folium Β· Geopy | π‘ | π |
Every project follows a consistent layout for easy navigation and reuse:
ProjectName/
β
βββ π data/ # Raw and processed datasets
βββ π notebooks/ # Jupyter notebooks (EDA β Training β Evaluation)
βββ π models/ # Saved weights (.pkl / .h5 / .pt)
βββ π static/ # CSS, JS, images (Flask apps)
βββ π templates/ # Jinja2 HTML templates (Flask apps)
βββ π src/
β βββ preprocess.py # Data cleaning & feature engineering
β βββ train.py # Model training pipeline
β βββ predict.py # Inference logic
βββ app.py # Flask entry point (web apps)
βββ requirements.txt # Python dependencies
βββ README.md # Project-specific documentation
Python 3.7+ | pip | Git
# Clone the repository
git clone https://github.com/shsarv/Machine-Learning-Projects.git
cd Machine-Learning-Projects
# Navigate to any project
cd "Heart Disease Prediction [END 2 END]"
# (Recommended) Create a virtual environment
python -m venv venv
source venv/bin/activate # Linux / macOS
venv\Scripts\activate # Windows
# Install dependencies
pip install -r requirements.txt
# For Flask web apps
python app.py
# β Open http://127.0.0.1:5000
# For notebooks
jupyter notebookheroku login
heroku create your-app-name
echo "web: gunicorn app:app" > Procfile
git push heroku main
heroku openWe welcome contributions to this project! If you would like to improve the existing codebase or contribute new features, feel free to submit a pull request. Before submitting, please ensure that you adhere to the following:
- Fork this repo
- Branch:
git checkout -b feature/YourProjectName - Structure your folder with a
README.mdandrequirements.txt - Commit:
git commit -m "Add: YourProjectName" - Push:
git push origin feature/YourProjectName - Open a Pull Request β target
main
Please read CONTRIBUTING.md and follow the Code of Conduct.
- Integrate Explainable AI (XAI) models for better understanding of predictions in complex models.
- Add Docker support for easy containerization of all projects.
- Incorporate CI/CD pipelines using GitHub Actions for automated testing and deployment.
- Migrate some projects to use streamlit for interactive dashboards.
- Explore Reinforcement Learning for game-based AI projects.
- Expand the NLP section to include text summarization, translation, and more chatbot capabilities.
- Official Python Documentation: Python.org
- Flask Documentation: Flask.palletsprojects.com
- Scikit-learn User Guide: Scikit-learn.org
- Keras Documentation: Keras.io
- TensorFlow Documentation: Tensorflow.org
- PyTorch Documentation: Pytorch.org
For a deeper understanding of AI, machine learning, and data science, I recommend the following courses:
- Coursera - Machine Learning by Andrew Ng
- Udacity - AI for Everyone
- Kaggle Learn - Data Science
- The wonderful Kaggle community, which provided open datasets and insightful discussions.
- Udemy, Coursera, and edX instructors who have helped me build a solid foundation in AI.
Distributed under the MIT License. See LICENSE for more information.
Sarvesh Sharma |
