File tree Expand file tree Collapse file tree 2 files changed +8
-19
lines changed
Expand file tree Collapse file tree 2 files changed +8
-19
lines changed Original file line number Diff line number Diff line change 11# CodeContextBench: A Systematic Evaluation Framework for Assessing the Impact of Enhanced Code Intelligence on AI Coding Agent Performance
22
3- ** White Paper Technical Report (Derived) **
3+ ** White Paper Technical Report**
44** Date:** February 27, 2026
55
6- > Derivation note: The canonical source for technical-report content is
7- > ` docs/technical_reports/TECHNICAL_REPORT_V1.md ` . This white-paper document is
8- > a derived presentation artifact and should be updated from the canonical report.
9-
106---
117
128## Abstract
@@ -223,10 +219,9 @@ Repository Scale:
223219
224220| Difficulty | Tasks | Percentage | Description |
225221| ---------- | ----: | ---------- | ---------------------------------------------------------- |
226- | medium | ~ 30 | 12% | Dependency installation, straightforward fixes, unit tests |
227- | hard | ~ 140 | 58% | Multi-file changes, cross-repo reasoning, runbooks |
228- | very_hard | ~ 10 | 4% | Deep dependency chain analysis, architectural refactoring |
229- | expert | 5 | 2% | Linux kernel fault localization |
222+ | medium | 21 | 7.5% | Dependency installation, straightforward fixes, unit tests |
223+ | hard | 245 | 87.8% | Multi-file changes, cross-repo reasoning, runbooks |
224+ | expert | 13 | 4.7% | Kernel/debug fault localization and highest-complexity tasks |
230225
231226---
232227
Original file line number Diff line number Diff line change 11# CodeContextBench: A Systematic Evaluation Framework for Assessing the Impact of Enhanced Code Intelligence on AI Coding Agent Performance
22
3- ** Canonical Technical Report (Source of Truth) **
3+ ** White Paper Technical Report**
44** Date:** February 27, 2026
55
6- > Canonical source policy: This document (` docs/technical_reports/TECHNICAL_REPORT_V1.md ` )
7- > is the authoritative source for technical report updates. Any white-paper or
8- > presentation variants (including ` docs/WHITE_PAPER_REPORT_V2.md ` ) should be
9- > treated as derived artifacts synchronized from this report.
10-
116---
127
138## Abstract
@@ -224,10 +219,9 @@ Repository Scale:
224219
225220| Difficulty | Tasks | Percentage | Description |
226221| ---------- | ----: | ---------- | ---------------------------------------------------------- |
227- | medium | ~ 30 | 12% | Dependency installation, straightforward fixes, unit tests |
228- | hard | ~ 140 | 58% | Multi-file changes, cross-repo reasoning, runbooks |
229- | very_hard | ~ 10 | 4% | Deep dependency chain analysis, architectural refactoring |
230- | expert | 5 | 2% | Linux kernel fault localization |
222+ | medium | 21 | 7.5% | Dependency installation, straightforward fixes, unit tests |
223+ | hard | 245 | 87.8% | Multi-file changes, cross-repo reasoning, runbooks |
224+ | expert | 13 | 4.7% | Kernel/debug fault localization and highest-complexity tasks |
231225
232226---
233227
You can’t perform that action at this time.
0 commit comments