Skip to content

Soubhagyadev/Invisible-Pollution-Data-Engineering-Visualization

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🏭 India Air Pollution Visualization Dashboard

A full-stack data visualization and engineering project designed to track and raise awareness about harmful air pollutants across India.
Built using Tableau, Python, AWS, and Figma, this project utilizes 20M+ rows of real-time sensor data sourced from OpenAQ.


Portfolio Thumbnai

📌 Overview

This is a full visualization and data engineering project designed to visualize harmful air pollutants in India, aiming to raise public awareness.
The data, sourced from OpenAQ, contains 20 million rows and is processed through a series of steps, including compression, transformation, and outlier removal to address sensor errors.
This visualization focuses on air pollution trends for the years 2024–2025.

Tech Stack: Tableau · Python · AWS · OpenAQ API · Data Engineering · Figma · Pandas · Database Manipulation


⚙️ Methodology

  • 📥 Data Retrieval: Fetched from the OpenAQ AWS S3 bucket using a custom Python script with the Boto3 SDK.
  • 🧹 Data Cleaning: Sensor errors (extreme values) removed using Pandas, and the dataset is transformed for usability.
  • 📊 EDA: Performed for quick insights into pollutants across time and locations.
  • 📈 Tableau: Used to dive deep into data patterns and spatial pollution mapping.
  • 🎨 Figma: Used to prototype dashboard layout—scatter plots, maps, and correlation visuals were planned beforehand.
  • 📚 Research: Conducted on Indian air quality issues to ensure insights are grounded in reality.

🔍 Findings

  • 🏙️ Delhi records the highest levels of PM2.5 and PM10 pollution.
  • 📈 Multiple cities operate above WHO safety limits, posing risks even to healthy individuals.
  • 🔗 A strong correlation between PM2.5 and PM10 values is observed across states.
  • 🏗️ Construction dust and residential fuel usage are major contributing factors.

✅ Conclusions

The dashboard successfully shows pollutant trends over time, across locations, and between different air quality parameters, making the data intuitive and informative for the general public.


🔗 Tableau Visualization

👉 Click here to view the full Tableau dashboard


🖼️ Preview

Add screenshots or GIFs of your dashboard here: Linkedln Share


💡 How to Use

  • Open the Tableau dashboard via the link above.
  • Hover over visuals for detailed insights and metrics.

⭐ Feel free to star this repo if you found the project helpful or insightful!

About

A Full Visualization & Data Engineering Project Which Aims To Visualize Air Quality Data For India Using Tableau Dashboard & Plotly Dash

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors