Skip to content
View al1sr's full-sized avatar

Highlights

  • Pro

Block or report al1sr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
al1sr/README.md

Hi, I am Alicia 👋

A results-driven data professional passionate about transforming raw, complex, and un-structured datasets into scalable data pipelines, predictive models, and strategic business insights.

About me

  • Graduate with a Bachelor's degree in Economics, totally studied in english at University Carlos III.
  • Graduate with a Master's Degree in Data Science, specializing in end to end analytical pipelines, statistical modeling, and data engineering architectures.
  • My core expertise lies in designing robust data curation workflows, managing database schemas, and building interactive business intelligence systems.
  • Currently exploring advanced distributed computing workflows, automated deployment of pipelines, and production-level machine learning architectures.

Technical stack

Category Technologies & Tools
Data engineering Apache Hop Apache Spark
Databases & query MySQL SQL
Programming languages Python R
Analytics & BI Tableau Pandas Tidyverse
Environments & versioning Git RStudio

Featured repositories

Stack: Apache HopMySQLTableau
Design of a complete data warehouse infrastructure using a medallion architecture (bronze, silver, gold) to isolate data extraction and monitor service level agreements (SLA) alongside internal backlog dynamics.


Stack: RCaretGlmnetpROC
Application of advanced statistical inference, logistic regression, and predictive regularization models (Ridge and Lasso) using cross-validation to prevent user attrition and handle corporate dataset balancing.


Stack: PythonApache SparkParquet
Cloud-ready big data engineering pipeline targetting compressed columnar metadata and nested arrays to track international streaming production hubs and release timeline trends.


Stack: PythonPandasSeaborn
Detailed exploratory data analysis (EDA) and data cleansing pipeline on automotive marketplace transactions, implementing domain-specific outlier filtering and mathematical stabilization.


Stack: RTidyversePlotly
Statistical transformation, custom multi-variable reshaping, and data cleaning using advanced functional pivoting techniques to isolate specific environmental pollution metrics by industry.


Stack: PythonPandasSeaborn
Consolidation and deep data cleaning of a multi-year municipal open data corpus encompassing over 312,000 records to identify temporal and seasonal trends in road safety.

Let's connect

Are you interested in my profile or looking to discuss data architecture and analytics? Feel free to reach out!

Popular repositories Loading

  1. Depreciation_analysis_and_market_value Depreciation_analysis_and_market_value Public

    Depreciation analysis and the market value of second hand cars in Ukranie

    Jupyter Notebook

  2. Madrid_accidents Madrid_accidents Public

    Exploratory Data Analysis about Madrid accidents between 2019 and 2025

    Jupyter Notebook

  3. Global_emissions_analysis_R Global_emissions_analysis_R Public

    Exploratory Data Analysis of Global Emissions and Temperature Change Using R and Tidyverse

  4. al1sr al1sr Public

    Get to know me more!!!

  5. TelcoChurn_R TelcoChurn_R Public

    Telecom customer churn and predictive modeling with R

  6. Netflix_bigdata Netflix_bigdata Public

    Streaming platform content analysis and big data engineering