Implementation of Database Migrations & Automated Backups #63

markoceri · 2026-01-23T22:12:50Z

Summary

This pull request upgrades the data management layer of the application by implementing a robust migration system. By transitioning from manual SQLite schema changes to a structured system using SQLAlchemy ORM with Alembic migrations, this approach allows for safe database evolution without the risk of data loss.

Key Drivers

Previously, changing any entity property required manual SQL intervention, which often led to data being wiped if the schema became incompatible. To support the long-term growth of the project and protect existing data, I have implemented an automated migration workflow that:

Preserves data integrity during schema changes
Provides version control for database structure
Enables safe rollback to previous states
Maintains clean separation between domain logic and persistence (DDD/Hexagonal Architecture)

Key Improvements

1. Data Preservation Through Alembic Migrations

I integrated Alembic to handle schema evolutions automatically. The system now:

Tracks all database schema changes through versioned migration files
Applies migrations incrementally, transforming the database to the new version while keeping all existing records intact
Detects pending migrations and applies only necessary changes
Enforces a migration-first approach: all schema changes must go through Alembic (no manual SQL)

Reference: See docs/ALEMBIC_MIGRATIONS.md for complete migration system documentation.

2. Automated Backups for Safety

As an extra layer of security, I introduced an automatic backup routine that:

Creates a timestamped database backup before applying any migration
Uses format: dbname_backup_YYYYMMDD_HHMMSS.db
Only triggers when there are pending migrations (skips backup if database is up-to-date)
Can be enabled/disabled via BACKUP_BEFORE_MIGRATION environment variable
Currently supports SQLite databases

Configuration:

# .env file
BACKUP_BEFORE_MIGRATION=true  # Enable automatic backups (default: true)
RUN_MIGRATIONS_ON_STARTUP=true  # Run migrations on app startup (default: true)

Reference: See Startup Workflow section in the migration guide.

3. Standardized Access with SQLAlchemy ORM

By using SQLAlchemy with imperative mapping, I have:

Moved to a robust Object-Relational Mapping (ORM) approach
Kept domain entities completely clean and independent of persistence details
Implemented a 4-phase conversion system for complex value objects:
- Load phase: Convert database primitives to domain types (EntityId, enums, value objects)
- Before insert/update: Flatten value objects to primitives or serialize to JSON
- After insert/update: Restore domain types after persistence
Maintained strict adherence to DDD and Hexagonal Architecture principles

Technical Details:

Domain entities remain pure @dataclass objects
SQLAlchemy mapping happens separately in adapter layer
Event listeners handle all type conversions transparently
Repositories work directly with domain entities

Reference: See DDD Principles section and Migration Example.

4. Future-Proofing and Developer Experience

This setup removes the "fear of updating" and enables:

Rapid feature development: Add or modify entity properties without data loss concerns
Safe experimentation: Rollback capability if migrations encounter issues
Team collaboration: Version-controlled migrations prevent conflicts
Multi-database support: Easy transition to PostgreSQL, MySQL, or other databases
CI/CD integration: Automated migration checks and testing

Developer Workflow:

# 1. Modify domain entity and table definition
# 2. Generate migration automatically
python scripts/migrate.py create "Add new field"

# 3. Review generated migration
cat alembic/versions/xxx_add_new_field.py

# 4. Apply migration (automatic on startup or manual)
python -m edge_mining  # Automatic
python scripts/migrate.py upgrade  # Manual

Reference: See Practical Example for complete step-by-step workflow.

Key Updates

1. SQLAlchemy Implementation

Replaced raw SQLite queries with SQLAlchemy models:

Implemented imperative mapping for all domain entities
Created type-safe table definitions in adapter layer
Added custom SQLAlchemy types for enums (e.g., MinerStatusType, EnergySourceType)
Implemented event listeners for value object conversions

Files Changed:

edge_mining/adapters/domain/*/tables.py - Table definitions and mappings
edge_mining/adapters/domain/*/repositories.py - SQLAlchemy repositories
edge_mining/adapters/infrastructure/persistence/sqlalchemy/ - Infrastructure layer

2. Alembic Initialization

Initialized Alembic for version control of database schema:

Configured alembic.ini with proper paths and settings
Created alembic/env.py that integrates with SQLAlchemy metadata
Configured migration template (script.py.mako)
Generated initial migration with all existing tables

3. Pre-Migration Backup Hook

Added automatic database backup before migrations:

Implemented in BaseSQLAlchemyRepository.initialize_database()
Creates timestamped backups only when migrations are pending
Integrated with startup workflow for zero-configuration safety
Configurable via environment variables

4. Automatic Migration on Startup

Updated deployment and startup scripts:

Modified bootstrap.py to use initialize_database() method
Added configuration options for migration behavior
Implemented smart migration detection (skips if up-to-date)
Created CLI tool scripts/migrate.py for manual migration management

CLI Commands:

# Check migration status
python scripts/migrate.py status

# View migration history
python scripts/migrate.py history

# Apply migrations manually
python scripts/migrate.py upgrade

# Rollback migrations
python scripts/migrate.py downgrade [n]

# Create new migration
python scripts/migrate.py create "Description"

Testing

Comprehensive test coverage added:

Unit Tests (42 tests)

tests/unit/adapters/domain/energy/test_tables_event_listeners.py
- Event listener behavior for all conversion phases
- Value object serialization/deserialization
- EntityId and enum conversions

Integration Tests (34 tests)

tests/integration/adapters/persistence/test_sqlalchemy_energy_repositories.py (21 tests)
- Full CRUD operations with real database
- Complex queries and relationships
tests/integration/adapters/persistence/test_alembic_migrations.py (9 tests)
- Migration system validation
- Upgrade/downgrade workflows
- Backup creation
tests/integration/adapters/persistence/test_e2e_persistence.py (8 tests)
- End-to-end persistence scenarios
- Multi-entity workflows

Run tests:

# All tests
pytest

# Only migration tests
pytest tests/integration/adapters/persistence/test_alembic_migrations.py

# With coverage
pytest --cov=edge_mining --cov-report=html

Documentation

Complete documentation has been added:

docs/ALEMBIC_MIGRATIONS.md
- Complete guide to the migration system
- Configuration options and environment variables
- Startup workflow and automatic migration process
- CLI commands and manual migration management
- Best practices for development and production
- Troubleshooting common issues
- CI/CD integration examples
docs/MIGRATION_EXAMPLE.md
- Practical step-by-step example: adding a "temperature" field to miners
- Domain entity modification
- Table definition updates
- Migration generation and verification
- Testing the changes
- Rollback procedures
- Complex migration scenarios (rename, NOT NULL, indexes)

Configuration

Environment Variables

Add to your .env file:

# Database configuration
DB_PATH=sqlite:///edgemining.db
# For PostgreSQL: DB_PATH=postgresql://user:pass@localhost/dbname
# For MySQL: DB_PATH=mysql+pymysql://user:pass@localhost/dbname

# Migration settings
RUN_MIGRATIONS_ON_STARTUP=true  # Automatic migrations on startup
BACKUP_BEFORE_MIGRATION=true    # Create backup before migrations (SQLite only)

# Persistence adapter
PERSISTENCE_ADAPTER=sqlalchemy

Settings

Configuration in edge_mining/shared/settings/settings.py:

class AppSettings(BaseSettings):
    persistence_adapter: str = "sqlalchemy"
    db_path: str = "sqlite:///edgemining.db"
    run_migrations_on_startup: bool = True
    backup_before_migration: bool = True

Migration from Previous Version

For existing installations:

Automatic Migration (Recommended):

Backup your current database manually (just in case)
Pull the latest code
Start the application:
```
python -m edge_mining
```

The system will automatically:

✅ Detect your existing database structure
✅ Generate initial migration if not present
✅ Create automatic backup before any changes
✅ Apply only necessary migrations
✅ Preserve all existing data

Note: If your database is already up-to-date, no migrations will be applied and no backup will be created.

Manual Migration (Development Only):

If you need to manage migrations manually for development purposes:

Generate initial migration (if not present):

alembic revision --autogenerate -m "Initial schema with all tables"

Apply migrations manually:
```
python scripts/migrate.py upgrade
```

For production use, always rely on automatic migrations that run on application startup.

Rollback Plan

If issues arise after deployment:

# Check current migration status
python scripts/migrate.py status

# Rollback to previous version
python scripts/migrate.py downgrade 1

# Or rollback to specific revision
alembic downgrade <revision_id>

# Restore from backup (if needed)
cp edgemining_backup_YYYYMMDD_HHMMSS.db edgemining.db

Benefits Summary

Aspect	Before	After
Schema Changes	Manual SQL, risk of data loss	Automated migrations, data preserved
Safety	No backup system	Automatic pre-migration backups
Rollback	Manual restore	One-command rollback
Team Workflow	Ad-hoc changes	Version-controlled migrations
Domain Purity	Mixed concerns	Clean separation (DDD)
Code Quality	Raw SQL queries	Type-safe ORM
Future Changes	High risk, slow	Low risk, fast
Database Support	SQLite only	SQLite, PostgreSQL, MySQL, etc.

Next Steps

After merging:

Monitor first production deployment with automatic migrations
Verify backup creation works as expected
Train team on migration workflow (see docs/MIGRATION_EXAMPLE.md)
Consider setting up migration testing in CI/CD pipeline
Plan migration to PostgreSQL (if desired) using existing infrastructure

This PR represents a major infrastructure improvement that will enable rapid, safe evolution of the data model while maintaining architectural integrity and data safety.

…ller domain with imperative mapping

…ovider

…HomeLoadsProfile with custom serialization

…erialization

…tory and ORM mappings

…sitory and ORM mappings

…pings

…th ORM mappings

…nitor with ORM mappings

…e adapters

… mappings

…icies persistence

…e support

… proper registration

…ings

…tion subclasses

…g conversion

… for value object conversions

…ySource and EnergyMonitor

…in ExternalService

…rces_table

…ables

…s script for Alembic setup checks

…ation in energy and miner tables

…is enabled

…e field to miners

…ations across multiple tables

…ase initialization instructions for SQLAlchemy

…iguration details

…migration handling

…l, and documentation improvements

…nterpreter path

…owngrades

… migrations

…ructions and migration enforcement details

…update related references

…omain

…ergyMonitor and EnergySource

…gration tests for SQLAlchemy event listeners and migration functionality

markoceri added 30 commits January 20, 2026 02:16

feat: implement SQLAlchemy-based repository for Miner and MinerContro…

d0ab94d

…ller domain with imperative mapping

feat: enhance error handling in miner controller config deserialization

2a6b26f

feat: add database ORM and migrations dependencies to requirements

772511b

feat: implement SQLAlchemy repository and ORM mappings for ForecastPr…

b41cf3a

…ovider

feat: implement SQLAlchemy repositories for HomeForecastProvider and …

18eb8bc

…HomeLoadsProfile with custom serialization

feat: implement SQLAlchemy repository for Notifier with custom JSON s…

b138b9c

…erialization

feat: add SQLAlchemy implementation for EnergyOptimizationUnit reposi…

7e6cff1

…tory and ORM mappings

feat: add SQLAlchemy implementation for MiningPerformanceTracker repo…

a182ee8

…sitory and ORM mappings

feat: implement SQLAlchemy repository for SystemSettings with ORM map…

1ea06f9

…pings

feat: add SQLAlchemy implementation for ExternalService repository wi…

ed8283f

…th ORM mappings

feat: exclude composite columns from mapping to avoid conflicts

07977d0

feat: implement SQLAlchemy repositories for EnergySource and EnergyMo…

b62b9df

…nitor with ORM mappings

feat: add SQLAlchemy support in bootstrap and settings for persistenc…

522a5e9

…e adapters

feat: implement SQLAlchemy repository for OptimizationPolicy with ORM…

73aa140

… mappings

feat: add SQLAlchemyOptimizationPolicyRepository to bootstrap for pol…

0209ec2

…icies persistence

feat: add aiosqlite dependency for database support in requirements

d767a19

feat: add SQLAlchemy, Alembic, and aiosqlite dependencies for databas…

30893e3

…e support

feat: add registry loader for SQLAlchemy table definitions and ensure…

389d573

… proper registration

refactor: remove aiosqlite dependency and update database URL in sett…

96bd4e8

…ings

feat: implement ConfigurationType for JSON serialization of Configura…

af497df

…tion subclasses

feat: add custom SQLAlchemy type for MinerStatus enum to handle strin…

25de6ac

…g conversion

feat: enhance Miner and MinerController mappings with event listeners…

4d24966

… for value object conversions

feat: implement event listeners for value object conversions in Energ…

e72230c

…ySource and EnergyMonitor

feat: add event listener to convert adapter_type from string to enum …

ea6be5e

…in ExternalService

fix: add foreign key constraint to forecast_provider_id in energy_sou…

dece37e

…rces_table

feat: add foreign key constraint to external_service_id in multiple t…

efc82ce

…ables

feat: update persistence_adapter to use sqlalchemy as default

ccf674f

feat: implement SQLAlchemy persistence with Alembic migrations support

0ca06da

feat: add CLI utility for managing Alembic migrations

0a864b6

feat: update migrate.py usage instructions and add validate_migration…

5e7ccae

…s script for Alembic setup checks

markoceri added 26 commits January 22, 2026 20:20

feat: add SQLAlchemy database file patterns to .gitignore

1fd2c91

feat: enhance SQLAlchemy tables docstring

202733f

fix: update type hints and add type ignore comments for JSON serializ…

3daa952

…ation in energy and miner tables

feat: add database backup option before running migrations

192fb28

fix: update comments for migration settings and ensure backup option …

645fea4

…is enabled

feat: add comprehensive migration documentation for adding temperatur…

6db7858

…e field to miners

feat: add developer warning for schema changes requiring Alembic migr…

524f035

…ations across multiple tables

feat: update README with environment variable configuration and datab…

aa4884c

…ase initialization instructions for SQLAlchemy

feat: add README for utility scripts with usage instructions and conf…

0065332

…iguration details

feat: add README for Alembic single-database configuration

3bd8d58

fix: update test migration path for automated testing

907beaa

feat: add unit tests for BaseSQLAlchemyRepository initialization and …

988b1f4

…migration handling

feat: update CHANGELOG with automatic Alembic migrations, new CLI too…

70f872e

…l, and documentation improvements

fix: update launch and settings configuration for consistent Python i…

31c09e4

…nterpreter path

fix: initializate database schema with the first migration

90258f0

feat: add Alembic migration script template for schema upgrades and d…

ac645ee

…owngrades

fix: enhance database schema initialization documentation for Alembic…

25b8300

… migrations

docs: enhance Alembic migration documentation with initial setup inst…

6db0222

…ructions and migration enforcement details

refactor: rename energy_optimization_units to optimization_units and …

de7c46b

…update related references

feat: update energy source schema to store complex value objects as JSON

5576d14

feat: add integration and unit test modules for adapters and energy d…

fe4028d

…omain

feat: add event listeners to convert and restore value objects for En…

06336a2

…ergyMonitor and EnergySource

feat: added tests for persistence

35af125

feat: enhance testing infrastructure with comprehensive unit and inte…

28e244f

…gration tests for SQLAlchemy event listeners and migration functionality

Merge branch 'dev' into sqlalchemy-and-elembic

92a35bd

fix: update changelog to reflect version 0.1.0 release

95c5d0b

markoceri added the enhancement New feature or request label Jan 23, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation of Database Migrations & Automated Backups #63

Implementation of Database Migrations & Automated Backups #63

Uh oh!

markoceri commented Jan 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Implementation of Database Migrations & Automated Backups #63

Are you sure you want to change the base?

Implementation of Database Migrations & Automated Backups #63

Uh oh!

Conversation

markoceri commented Jan 23, 2026

Summary

Key Drivers

Key Improvements

1. Data Preservation Through Alembic Migrations

2. Automated Backups for Safety

3. Standardized Access with SQLAlchemy ORM

4. Future-Proofing and Developer Experience

Key Updates

1. SQLAlchemy Implementation

2. Alembic Initialization

3. Pre-Migration Backup Hook

4. Automatic Migration on Startup

Testing

Unit Tests (42 tests)

Integration Tests (34 tests)

Documentation

Configuration

Environment Variables

Settings

Migration from Previous Version

Rollback Plan

Benefits Summary

Next Steps

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants