This project is a Machine Learning based Rainfall Prediction System developed using Python in Google Colab.
The system analyzes weather-related parameters and predicts whether rainfall will occur or not using a Random Forest Classifier model.
https://your-streamlit-link.streamlit.app
- Python
- Google Colab
- Pandas
- NumPy
- Scikit-learn
- Matplotlib
- Data Preprocessing
- Handling Missing Values
- Feature Selection
- Oversampling for Data Balancing
- Random Forest Classification
- Model Evaluation
- ROC Curve & AUC Score
- Import Libraries
- Load Dataset
- Clean Dataset
- Handle Missing Values
- Convert Target Variable
- Balance Dataset using Oversampling
- Feature Selection
- Split Training & Testing Data
- Train Random Forest Model
- Predict Rainfall
- Evaluate Accuracy
- Generate ROC Curve and AUC Score
- Predicts rainfall occurrence
- Handles missing data efficiently
- Uses balanced dataset for better accuracy
- Evaluates performance using ROC Curve
- Supports prediction on random/custom input data
MLE_PROJECT_ON_RAINFALL_PREDICTION.ipynb→ Main project notebookREADME.md→ Project documentation
- Open the notebook in Google Colab
- Upload the dataset file
- Install required libraries if needed
- Run all notebook cells sequentially
Random Forest Classifier
- Deploy as a web application
- Add real-time weather API integration
- Improve prediction accuracy
- Add graphical user interface
Harshita Singh
B.Tech CSE (AI & DS) Student at Marwadi University
IITM BS Degree Student
Interested in Machine Learning and Data Science

