Skip to content

Allra-Fintech/data-engineer-assignment

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 

Repository files navigation

Data Engineer System Design Assignment

Overview

You are tasked with designing a real-time analytics platform for a growing e-commerce company.

Business Context

Our e-commerce platform currently processes 50,000 orders per day, with peak traffic during sales events reaching 500,000 orders per day. The business needs insights to:

  • Monitor key business metrics (GMV, conversion rates, inventory levels)
  • Personalize user experiences in real-time based on behavior
  • Generate daily/weekly executive reports

Current State

  • Order data: MySQL database (primary transactional system)
  • User behavior: Web analytics events via JavaScript SDK (~1M events/day)
  • Inventory: Third-party API with rate limits (100 requests/minute)
  • Team size: 3 data engineers, 2 analysts, 1 ML engineer

Requirements

Functional:

  • Ingest data from multiple sources in real-time
  • Enable near real-time personalization features for ML models with <5 minute latency
  • Support both streaming and batch analytics
  • Handle schema evolution gracefully

Non-functional:

  • Scale to 10x current volume over next 2 years
  • Data retention: 7 years for compliance
  • Cost-conscious solution (startup budget)

Constraints

  • Must use cloud services (AWS, GCP, or Azure)
  • Cannot modify existing MySQL schema
  • The team has experience with Python, SQL, and basic Docker
  • Time to deliver: The solution must be implemented within 3 months

Deliverable

Write a 1-2 page system design document using the RFC template by Juan Pablo Buriticá as a starting point. You don't need to follow the exact same structure as the template, but use it as guidance. What matters is delivering your ideas in a concise document. Focus on:

  • High-level architecture
  • Technology choices and rationale
  • Data flow and processing strategy
  • Scalability and reliability considerations
  • Implementation phases/timeline

Submission

Please send your completed document via email to: dev@allra.co.kr

About

Data engineer system design assignment

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published