A full-stack data visualization and engineering project designed to track and raise awareness about harmful air pollutants across India.
Built using Tableau, Python, AWS, and Figma, this project utilizes 20M+ rows of real-time sensor data sourced from OpenAQ.
This is a full visualization and data engineering project designed to visualize harmful air pollutants in India, aiming to raise public awareness.
The data, sourced from OpenAQ, contains 20 million rows and is processed through a series of steps, including compression, transformation, and outlier removal to address sensor errors.
This visualization focuses on air pollution trends for the years 2024–2025.
Tech Stack: Tableau · Python · AWS · OpenAQ API · Data Engineering · Figma · Pandas · Database Manipulation
- 📥 Data Retrieval: Fetched from the OpenAQ AWS S3 bucket using a custom Python script with the Boto3 SDK.
- 🧹 Data Cleaning: Sensor errors (extreme values) removed using Pandas, and the dataset is transformed for usability.
- 📊 EDA: Performed for quick insights into pollutants across time and locations.
- 📈 Tableau: Used to dive deep into data patterns and spatial pollution mapping.
- 🎨 Figma: Used to prototype dashboard layout—scatter plots, maps, and correlation visuals were planned beforehand.
- 📚 Research: Conducted on Indian air quality issues to ensure insights are grounded in reality.
- 🏙️ Delhi records the highest levels of PM2.5 and PM10 pollution.
- 📈 Multiple cities operate above WHO safety limits, posing risks even to healthy individuals.
- 🔗 A strong correlation between PM2.5 and PM10 values is observed across states.
- 🏗️ Construction dust and residential fuel usage are major contributing factors.
The dashboard successfully shows pollutant trends over time, across locations, and between different air quality parameters, making the data intuitive and informative for the general public.
👉 Click here to view the full Tableau dashboard
- Open the Tableau dashboard via the link above.
- Hover over visuals for detailed insights and metrics.
⭐ Feel free to star this repo if you found the project helpful or insightful!

