docs: Add DSQL loader operations reference#176
Conversation
90abd0d to
4161f36
Compare
Functional Eval Results (with-skill, live run)Ran evals 10-12 with the skill loaded. All 11/11 expectations pass.
The skill teaches DSQL-loader-specific operational knowledge (partition warming, tmpfs defaults, header flag semantics) that the agent cannot infer from general training data. |
| --dry-run | ||
| ``` | ||
|
|
||
| ### Going Deeper |
There was a problem hiding this comment.
going deeper feels like a strange header, how will the agent understand when it should do this?
|
|
||
| --- | ||
|
|
||
| ## Related References |
There was a problem hiding this comment.
what was the nested benefit with related references in the file as a separate section?
|
|
||
| ### [data-loading.md](references/data-loading.md) | ||
|
|
||
| **When:** Load when planning or running bulk loads with `aurora-dsql-loader`, or diagnosing loads that come in slower than expected. |
There was a problem hiding this comment.
"or diagnosing slow load times"
slower than expected is a weird statement structure. subjective eval by the agent.
Code reviewDSQL skill PR adds 20-agent fleet ran per dsql-skill-author Workflow 2. Findings post 5-gate validation:
Items considered and dropped (audit trail)
🤖 Generated with Claude Code — 20-agent fleet per dsql-skill-author Workflow 2 §1 roster, all findings 5-gate validated at head SHA If this code review was useful, please react with 👍. Otherwise, react with 👎. |
Add references/data-loading.md covering aurora-dsql-loader operations: - Fresh-vs-warm partition behavior and throughput expectations - Resume/retry mechanics (--manifest-dir, --resume-job-id) - Conflict handling (--on-conflict do-nothing) - CSV/TSV header handling (--header flag, v3.0.0 default) - Schema inference caveats and --dry-run validation - Index count impact on throughput - Diagnostic decision tree for slow loads SKILL.md changes: - Add Workflow 3: Bulk Data Loading with key constraints - Add data loading to overview and Quick Start - Add trigger keywords (aurora-dsql-loader, bulk load, etc.) - Add data-loading.md reference entry with When/Contains - Add cross-reference from connectivity-tools.md Eval coverage: - 3 should-trigger + 2 should-not-trigger entries in trigger_evals.json - 3 functional evals (IDs 10-12) in evals.json with LLM judge grading - data_loading_eval_results.md with expected with-skill vs baseline comparison demonstrating the skill teaches operational knowledge not in general training data (partition warming, tmpfs defaults, header flag semantics)
4161f36 to
d093231
Compare
5687cab to
3e1d908
Compare
- Condense data-loading.md from 242 to 166 lines (remove verbose explanations per reviewer feedback) - Add RFC 2119 directives (MUST/SHOULD) for --manifest-dir, --header, --on-conflict preconditions, and schema inference validation - Add silent-failure callout for schema inference caveats - Fix --on-conflict semantics: triggers on any unique constraint, not just PK; add precondition that source must not have changed for crash recovery - Rename 'Going Deeper' header to 'When to load the full reference' with clear agent trigger condition - Fix 'slower than expected' to 'slow load times' in SKILL.md - Trim redundant description triggers (bulk load DSQL, DSQL data loading) - Fix workflow-number drift: Workflow 6 → 7 in dsql-lint.md, Workflow 8 → 9 in README/evals/runner - Fix README assertion count 43 → 42 - Remove standalone 'Related References' section (cross-ref inlined at top) - Add eval results with baseline vs with-skill comparison
3e1d908 to
86ec0da
Compare
Add a comprehensive data-loading reference for the aurora-dsql-loader, covering:
Also adds:
By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of the project license.