Skip to content

shsarv/Machine-Learning-Projects

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

160 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Typing SVG

GIF

Stars Forks Contributors Issues License Last Commit

Project Overview

A curated portfolio of 26 end-to-end machine learning projects β€” spanning healthcare AI, real-time computer vision, NLP chatbots, time series forecasting, and classical ML. Each project applies theory to a practical problem, with several fully deployed as web and GUI applications.

πŸ“Š Repository at a Glance

πŸ“ Projects 🏷️ Domains πŸš€ Deployed Apps πŸ–₯️ GUI Apps ⭐ GitHub Stars
26 6 5 3 1.3k+

πŸ“š Table of Contents

Click to expand / collapse

All Projects by Category

Legend: Β  🟒 Beginner Β  🟑 Intermediate Β  πŸ”΄ Advanced Β |Β  🌐 Web App Β  πŸ–₯️ GUI App Β  πŸ““ Notebook

πŸ₯ Healthcare & Medical AI

6 Projects β€” click to collapse
Project Description Tools & Algorithms Level Type
Brain Tumor Detection Detects tumors in MRI scans using a CNN. Upload a scan and get a real-time prediction. PyTorch Β· CNN Β· Flask πŸ”΄ 🌐
Diabetes Prediction Predicts diabetes likelihood from 8 health markers (glucose, BMI, insulin, age) using the Pima Indians dataset. scikit-learn · SVM · Flask 🟑 🌐
Heart Disease Prediction Predicts cardiac risk from 13 clinical features with ~92% accuracy. scikit-learn · Logistic Reg. · Flask 🟑 🌐
Arrhythmia Classification Classifies 16 arrhythmia types from 279 ECG features (UCI dataset). SVM Β· KNN Β· Decision Tree 🟑 πŸ““
Medical Chatbot NLP chatbot mapping user-described symptoms to diagnoses via a curated medical knowledge base. NLTK Β· TF-IDF Β· Flask πŸ”΄ 🌐
MoA Prediction Predicts drug biological activity from gene expression and cell viability data (Kaggle competition). PyTorch Β· TabNet Β· Multi-label πŸ”΄ πŸ““

πŸŽ₯ Computer Vision & OpenCV

9 Projects β€” click to collapse
Project Description Tools & Algorithms Level Type
Driver Drowsiness Detection Monitors driver eye state via Eye Aspect Ratio (EAR) and triggers an audio alert on drowsiness. OpenCV Β· dlib Β· EAR 🟑 πŸ–₯️
Distracted Driver Detection Classifies 10 distracted behaviors (texting, eating, phone call, etc.) from dashboard camera images. CNN Β· Keras Β· ImageDataGenerator πŸ”΄ πŸ““
Lane Line Detection Overlays detected road lane lines on images/video using Canny edge detection and Hough transforms. OpenCV Β· Canny Β· Hough Transform 🟒 πŸ–₯️
Human Detection & Counting Detects and counts people in live video or images using HOG + SVM. OpenCV Β· HOG Β· SVM 🟒 πŸ–₯️
Gender & Age Detection Predicts gender and age group from a face image using pre-trained Caffe models. OpenCV DNN Β· Caffe Models 🟑 πŸ–₯️
Image Colorization Adds realistic color to grayscale photos using the Zhang et al. deep colorization network. OpenCV DNN Β· Zhang et al. Β· LAB space 🟑 πŸ““
Smile Selfie Capture Auto-captures a photo the instant a smile is detected in the webcam feed. No button needed. OpenCV Β· Haar Cascades 🟒 πŸ–₯️
Emoji Creator from Emotions Detects real-time facial emotions via webcam and overlays the matching emoji on screen. OpenCV Β· CNN Β· FER dataset 🟑 πŸ–₯️
Human Activity Recognition Classifies activities (walking, sitting, standing) from pose estimation keypoints over time. LSTM Β· Keras Β· 2D Pose Estimation πŸ”΄ πŸ““

πŸ“ˆ Classical ML & Prediction

7 Projects β€” click to collapse
Project Description Tools & Algorithms Level Type
Iris Flower Classification Classic benchmark β€” classifies iris species from petal/sepal measurements. Ideal for comparing classifiers side-by-side. KNN Β· SVM Β· Decision Tree Β· Naive Bayes 🟒 πŸ““
Wine Quality Prediction Predicts wine quality score (3–8) from 11 physicochemical properties like acidity, sulfates, and alcohol. Random Forest Β· XGBoost 🟑 πŸ““
Loan Repayment Prediction Predicts whether a LendingClub borrower will repay based on credit history, income, and loan purpose. Random Forest Β· XGBoost Β· Class Balancing 🟑 πŸ““
College Admission Prediction Estimates graduate admission probability from GRE, TOEFL, GPA, and research experience. Linear Reg. Β· Ridge Β· Lasso Β· SVR 🟒 πŸ““
Employee Turnover Prediction Identifies employees at high risk of leaving using HR data (satisfaction, evaluations, workload, promotions). Decision Tree Β· Random Forest 🟑 πŸ““
Property Maintenance Fines Predicts fine compliance from Detroit's blight dataset β€” a real-world class-imbalance problem (Michigan Data Science Team). Gradient Boosting Β· SMOTE Β· AUC optimization πŸ”΄ πŸ““
Research Topic Prediction Classifies academic papers into topic categories using NLP-based feature extraction on titles/abstracts. TF-IDF Β· Naive Bayes Β· SVM Β· NLTK 🟑 πŸ““

πŸ’¬ NLP & Conversational AI

2 Projects β€” click to collapse
Project Description Tools & Algorithms Level Type
AI Room Booking Chatbot Hotel room booking chatbot using IBM Watson. Handles slot-filling, availability queries, and booking confirmations through a web interface. IBM Watson Assistant · Watson Discovery 🟑 🌐
Medical Chatbot Symptom-to-diagnosis NLP chatbot with multi-turn conversation support. (Also listed under Healthcare.) NLTK Β· Flask Β· TF-IDF Β· Cosine Similarity πŸ”΄ 🌐

πŸ“Š Time Series & Business Analytics

2 Projects β€” click to collapse
Project Description Tools & Algorithms Level Type
Multi-Store Sales Prediction Forecasts daily sales for 50 items across 10 stores using three time series approaches and model ensembling. ARIMA Β· Facebook Prophet Β· LSTM (Keras) πŸ”΄ πŸ““
IPL Score Prediction Predicts first-innings T20 scores from ball-by-ball match data with deep EDA and multiple regression models. Linear/Ridge Reg. Β· Random Forest Β· ANN 🟑 πŸ““

πŸ—ΊοΈ Geospatial & Data Science

1 Project β€” click to collapse
Project Description Tools & Algorithms Level Type
The Battle of Neighborhoods IBM Capstone β€” clusters city neighborhoods using Foursquare API data to recommend optimal business locations. K-Means Β· Foursquare API Β· Folium Β· Geopy 🟑 πŸ““

πŸ› οΈ Tech Stack

Languages & Environments : Python Jupyter Google Colab

Machine Learning & Deep Learning : scikit-learn TensorFlow Keras PyTorch XGBoost

Computer Vision & NLP : OpenCV NLTK IBM Watson

Data & Visualization : Pandas NumPy Matplotlib Seaborn

Deployment : Flask Heroku Tkinter

πŸ“ Project Structure

Every project follows a consistent layout for easy navigation and reuse:

ProjectName/
β”‚
β”œβ”€β”€ πŸ“‚ data/                  # Raw and processed datasets
β”œβ”€β”€ πŸ“‚ notebooks/             # Jupyter notebooks (EDA β†’ Training β†’ Evaluation)
β”œβ”€β”€ πŸ“‚ models/                # Saved weights (.pkl / .h5 / .pt)
β”œβ”€β”€ πŸ“‚ static/                # CSS, JS, images  (Flask apps)
β”œβ”€β”€ πŸ“‚ templates/             # Jinja2 HTML templates  (Flask apps)
β”œβ”€β”€ πŸ“‚ src/
β”‚   β”œβ”€β”€ preprocess.py         # Data cleaning & feature engineering
β”‚   β”œβ”€β”€ train.py              # Model training pipeline
β”‚   └── predict.py            # Inference logic
β”œβ”€β”€ app.py                    # Flask entry point  (web apps)
β”œβ”€β”€ requirements.txt          # Python dependencies
└── README.md                 # Project-specific documentation

πŸš€ Getting Started

Prerequisites

Python 3.7+  |  pip  |  Git

Clone & Run

# Clone the repository
git clone https://github.com/shsarv/Machine-Learning-Projects.git
cd Machine-Learning-Projects

# Navigate to any project
cd "Heart Disease Prediction [END 2 END]"

# (Recommended) Create a virtual environment
python -m venv venv
source venv/bin/activate        # Linux / macOS
venv\Scripts\activate           # Windows

# Install dependencies
pip install -r requirements.txt

# For Flask web apps
python app.py
# β†’ Open http://127.0.0.1:5000

# For notebooks
jupyter notebook

Deploy to Heroku

heroku login
heroku create your-app-name
echo "web: gunicorn app:app" > Procfile
git push heroku main
heroku open

Contributions 🌱

We welcome contributions to this project! If you would like to improve the existing codebase or contribute new features, feel free to submit a pull request. Before submitting, please ensure that you adhere to the following:

  1. Fork this repo
  2. Branch: git checkout -b feature/YourProjectName
  3. Structure your folder with a README.md and requirements.txt
  4. Commit: git commit -m "Add: YourProjectName"
  5. Push: git push origin feature/YourProjectName
  6. Open a Pull Request β†’ target main

Please read CONTRIBUTING.md and follow the Code of Conduct.

Future Enhancements:

  • Integrate Explainable AI (XAI) models for better understanding of predictions in complex models.
  • Add Docker support for easy containerization of all projects.
  • Incorporate CI/CD pipelines using GitHub Actions for automated testing and deployment.
  • Migrate some projects to use streamlit for interactive dashboards.
  • Explore Reinforcement Learning for game-based AI projects.
  • Expand the NLP section to include text summarization, translation, and more chatbot capabilities.

πŸ“š Resources and References

For a deeper understanding of AI, machine learning, and data science, I recommend the following courses:

  • Coursera - Machine Learning by Andrew Ng
  • Udacity - AI for Everyone
  • Kaggle Learn - Data Science

⭐ Acknowledgments

  • The wonderful Kaggle community, which provided open datasets and insightful discussions.
  • Udemy, Coursera, and edX instructors who have helped me build a solid foundation in AI.

License

Distributed under the MIT License. See LICENSE for more information.

Maintained By


Sarvesh Sharma

About

This repository showcases a selection of machine learning projects undertaken to understand and master various ML concepts. Each project reflects commitment to applying theoretical knowledge to practical scenarios, demonstrating proficiency in machine learning techniques and tools.

Topics

Resources

Code of conduct

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages