Add MS-SQL (SQL Server) support

This issue tracks the work done on the `mssql` branch to make Datafaker work end-to-end against Microsoft SQL Server databases.

### Background

Datafaker was originally written targeting PostgreSQL and DuckDB. MS-SQL differs in several ways that required systematic fixes across the codebase: different SQL dialect (no `RANDOM()`/`LIMIT`, no `EXTRACT`, no `STDDEV`), mandatory schema qualification when a non-default schema is in use, different type names, identity column behaviour, and stricter rules around foreign-key cascade paths.

---

### Changes by area

#### Foundation and initial driver support

- [`b2f14ab`](https://github.com/SAFEHR-data/datafaker/commit/b2f14ab) — Add initial MS-SQL support: `pyodbc`/`aioodbc` dependencies, async DSN rewriting, schema routing via SQLAlchemy `schema_translate_map`
- [`d872fd9`](https://github.com/SAFEHR-data/datafaker/commit/d872fd9) — Document macOS ODBC driver setup and add `.env.example` for MS-SQL connection strings
- [`fc97b2c`](https://github.com/SAFEHR-data/datafaker/commit/fc97b2c) — Add `TrustServerCertificate=yes` to MS-SQL DSN examples

#### Type system

- [`76fec75`](https://github.com/SAFEHR-data/datafaker/commit/76fec75) — Extend type parser for MS-SQL column types (issue #96): `BIGINT`, `NVARCHAR`, `DATETIME2`, etc.
- [`da4a69b`](https://github.com/SAFEHR-data/datafaker/commit/da4a69b) — Strip `SERIAL`/`IDENTITY` for MS-SQL target databases (issue #97)

#### Schema and foreign keys

- [`81a65eb`](https://github.com/SAFEHR-data/datafaker/commit/81a65eb) — Fix schema-qualified FK resolution and MS-SQL multiple cascade paths (closes #101): resolve `schema.table.column` references in `orm.yaml` and avoid `FOREIGN KEY … CASCADE` conflicts that MS-SQL rejects

#### Primary keys

- [`84a209f`](https://github.com/SAFEHR-data/datafaker/commit/84a209f) — Let the database generate integer primary keys on MS-SQL (closes #104): suppress Datafaker-generated PKs when the column has `IDENTITY`

#### Dialect-correct SQL in generators and interactive shell

MS-SQL does not support `RANDOM()` / `LIMIT n` or `EXTRACT(… FROM …)` / `STDDEV`. Every code path that emits these had to be updated:

- [`3637258`](https://github.com/SAFEHR-data/datafaker/commit/3637258) — Fix dialect-specific SQL in generator commands (closes #105): `RANDOM()`→`NEWID()`, `LIMIT`→`TOP`, `EXTRACT`→`DATEPART`, `STDDEV`→`STDEV` in the interactive shell
- [`7c0add6`](https://github.com/SAFEHR-data/datafaker/commit/7c0add6) — Fix `RANDOM()`/`LIMIT` in `ChoiceGeneratorFactory` (closes #106)
- [`2bcea2b`](https://github.com/SAFEHR-data/datafaker/commit/2bcea2b) — Fix schema-qualified table in live queries of `ChoiceGeneratorFactory`
- [`41b96f6`](https://github.com/SAFEHR-data/datafaker/commit/41b96f6) — Fix schema-missing FROM clause in `Buckets` queries
- [`29e7889`](https://github.com/SAFEHR-data/datafaker/commit/29e7889) — Fix remaining `RANDOM()`/`LIMIT` incompatibilities across `CovariateQuery`, `MissingnessType`, and `do_peek`/`print_column_data` (closes #107)

#### Schema qualification for SQL stored in `src-stats`

Queries written by `configure-generators` into `config.yaml`'s `src-stats` section are later executed by `make-stats` via raw `text()` — SQLAlchemy's `schema_translate_map` does not apply. Each code path that writes these strings needed to embed the schema-qualified table name explicitly:

- [`91578da`](https://github.com/SAFEHR-data/datafaker/commit/91578da) — Fix missing schema qualification in interactive shell `SELECT` statements (`do_peek`, `do_counts`, `print_column_data`, `_get_column_data`)
- [`4d03aa3`](https://github.com/SAFEHR-data/datafaker/commit/4d03aa3) — Fix unqualified table names in raw SQL during `configure-generators` (`ContinuousLogDistributionGeneratorFactory`, `MultivariateNormalGeneratorFactory`)
- [`173f6da`](https://github.com/SAFEHR-data/datafaker/commit/173f6da) — Fix unqualified table names in `src-stats` queries written by `configure-generators` (`_get_aggregate_query`, `PredefinedGenerator.SELECT_AGGREGATE_RE`)
- [`1132dc6`](https://github.com/SAFEHR-data/datafaker/commit/1132dc6) — Fix schema-qualified table name in `ChoiceGenerator` stored queries (closes #108): use `text(schema_qualified_name(...))` so the compiled SQL string is not bracket-quoted as `[mimic100.person]`

#### Tests

Dialect-correctness tests were added alongside each fix. A dedicated test module `tests/test_generators_dialect.py` covers:
- `DATEPART` vs `EXTRACT` in `MimesisDateTimeGenerator`
- `STDEV` vs `STDDEV` in `Buckets`
- `NEWID()`/`TOP` vs `RANDOM()`/`LIMIT` in `ChoiceGeneratorFactory`, `CovariateQuery`, `MissingnessType`
- Schema qualification in `Buckets`, `ContinuousLogDistributionGeneratorFactory`, `ChoiceGeneratorFactory`, `_get_aggregate_query`, and `PredefinedGenerator`

`tests/test_interactive_dialect.py` covers the interactive shell commands (`do_peek`, `do_counts`, `_get_column_data`, `print_column_data`) for both MS-SQL and PostgreSQL dialects.

#### Examples

- [`40fc121`](https://github.com/SAFEHR-data/datafaker/commit/40fc121) — Rename `mimic_omop` example to `omop-mssql`
- [`1857e76`](https://github.com/SAFEHR-data/datafaker/commit/1857e76) — Add `omop-postgresql` example

---

### Known limitations / deferred work

- Challenge 8 (deferred, tracked in #99): full end-to-end test against a live MS-SQL instance in CI
- UUID type mapping (#98) is documented but not fully resolved for all edge cases

---

### Summary

The `mssql` branch adds end-to-end MS-SQL support to Datafaker. The main categories of change were: (1) driver and connection plumbing, (2) SQL dialect translation (`RANDOM`→`NEWID`, `LIMIT`→`TOP`, `EXTRACT`→`DATEPART`, `STDDEV`→`STDEV`), (3) schema qualification in stored SQL strings that bypass `schema_translate_map`, and (4) MS-SQL-specific schema/type/FK constraints. All changes are covered by unit tests using mocked engines and dialect instances.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MS-SQL (SQL Server) support #109

Background

Changes by area

Foundation and initial driver support

Type system

Schema and foreign keys

Primary keys

Dialect-correct SQL in generators and interactive shell

Schema qualification for SQL stored in `src-stats`

Tests

Examples

Known limitations / deferred work

Summary

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Add MS-SQL (SQL Server) support #109

Description

Background

Changes by area

Foundation and initial driver support

Type system

Schema and foreign keys

Primary keys

Dialect-correct SQL in generators and interactive shell

Schema qualification for SQL stored in src-stats

Tests

Examples

Known limitations / deferred work

Summary

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions

Schema qualification for SQL stored in `src-stats`