al1sr

Hi, I am Alicia 👋

A results-driven data professional passionate about transforming raw, complex, and un-structured datasets into scalable data pipelines, predictive models, and strategic business insights.

About me

Graduate with a Bachelor's degree in Economics, totally studied in english at University Carlos III.
Graduate with a Master's Degree in Data Science, specializing in end to end analytical pipelines, statistical modeling, and data engineering architectures.
My core expertise lies in designing robust data curation workflows, managing database schemas, and building interactive business intelligence systems.
Currently exploring advanced distributed computing workflows, automated deployment of pipelines, and production-level machine learning architectures.

Technical stack

Category	Technologies & Tools
Data engineering
Databases & query
Programming languages
Analytics & BI
Environments & versioning

Featured repositories

End to End Ticketing BI Pipeline

Stack: Apache Hop • MySQL • Tableau
Design of a complete data warehouse infrastructure using a medallion architecture (bronze, silver, gold) to isolate data extraction and monitor service level agreements (SLA) alongside internal backlog dynamics.

Telecom Customer Churn Prediction

Stack: R • Caret • Glmnet • pROC
Application of advanced statistical inference, logistic regression, and predictive regularization models (Ridge and Lasso) using cross-validation to prevent user attrition and handle corporate dataset balancing.

Streaming Platform Content Analysis

Stack: Python • Apache Spark • Parquet
Cloud-ready big data engineering pipeline targetting compressed columnar metadata and nested arrays to track international streaming production hubs and release timeline trends.

Car Depreciation and Market Value Analysis

Stack: Python • Pandas • Seaborn
Detailed exploratory data analysis (EDA) and data cleansing pipeline on automotive marketplace transactions, implementing domain-specific outlier filtering and mathematical stabilization.

Global Emissions and Climate Change

Stack: R • Tidyverse • Plotly
Statistical transformation, custom multi-variable reshaping, and data cleaning using advanced functional pivoting techniques to isolate specific environmental pollution metrics by industry.

Traffic Accident Analysis in Madrid

Stack: Python • Pandas • Seaborn
Consolidation and deep data cleaning of a multi-year municipal open data corpus encompassing over 312,000 records to identify temporal and seasonal trends in road safety.

Let's connect

Are you interested in my profile or looking to discuss data architecture and analytics? Feel free to reach out!

LinkedIn: linkedin.com/in/aliciasantamariaroman
Email: your-email@example.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

al1sr

Highlights

Block or report al1sr

Hi, I am Alicia 👋

About me

Technical stack

Featured repositories

End to End Ticketing BI Pipeline

Telecom Customer Churn Prediction

Streaming Platform Content Analysis

Car Depreciation and Market Value Analysis

Global Emissions and Climate Change

Traffic Accident Analysis in Madrid

Let's connect

Popular repositories Loading

Uh oh!