SQL-Based Recommendation System

This project implements a complete Data Science lifecycle focused on Applied Artificial Intelligence. It transitions from raw SQL data to a prescriptive strategy engine.

🏗️ System Architecture

The project is modularized into three distinct phases to ensure scalability and maintainability (SOLID principles):

Descriptive Phase: SQL-based performance metrics extraction and business health visualization.
Predictive Phase: Item-Item Collaborative Filtering using Cosine Similarity to identify hidden patterns in user behavior.
Prescriptive Phase: An AI-driven strategy engine that transforms similarity scores into actionable executive plans.

🛠️ Tech Stack

Language: Python 3.12 (Strict Type Hinting)
Database: SQLite (Relational Persistence)
Core Libraries: Pandas, Scikit-Learn, Pydantic, Logging
Interface: Streamlit (Dashboarding)
DevOps: Dotenv (Config Management), Pytest (Quality Assurance)

🚀 How to Run

Clone the repository.
Install dependencies: pip install -r requirements.txt.
Configure your .env file (see src/config.py).
(Optional) Run python seed_data.py to populate the database.
Launch the dashboard: streamlit run app.py.

❓ Interview FAQ

Q: Why use Cosine Similarity for the recommendation engine? A: Cosine similarity is effective for recommendation systems because it measures the orientation (angle) between item vectors rather than their magnitude. This makes it robust against variations in the number of ratings per user.

Q: How does the system handle the 'Cold Start' problem? A: In this senior implementation, the Prescriptive Phase includes a "Fallthrough" logic. If the similarity score is 0.00% due to lack of data (sparsity), the system shifts from collaborative filtering to popularity-based recommendations (Descriptive Phase).

Q: Why use Pydantic schemas in a Data Science project? A: To ensure data integrity. By validating the output of the recommendation engine before it reaches the UI, we prevent runtime errors and ensure that the business strategy always receives the expected parameters.

📄 License This project is distributed under the MIT license. Its purpose is strictly educational and research-based, developed as an Applied Data Science solution.

Note for recruiters: This project demonstrates advanced skills in Software Engineering applied to Artificial Intelligence. Modularity, dependency injection for database management, and state persistence in web applications (Streamlit) were prioritized. It provides a solid foundation for scaling to ML microservices or integrations with LLMs.

Autor: JUAN S. Contacto: https://github.com/johnyse99

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
data		data
src		src
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
preview.png		preview.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SQL-Based Recommendation System

🏗️ System Architecture

🛠️ Tech Stack

🚀 How to Run

❓ Interview FAQ

About

Uh oh!

Releases

Packages

Languages

License

johnyse99/SQL-Based-Recommendation-System

Folders and files

Latest commit

History

Repository files navigation

SQL-Based Recommendation System

🏗️ System Architecture

🛠️ Tech Stack

🚀 How to Run

❓ Interview FAQ

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages