Commit 92f7be4
committed
Fix retained x_attn capture without enabling Qwen QKV Freivalds
- Populate RetainedLayerState.x_attn_i8 with captured GPU x_attn
(was always None — prover used captured x_attn for QKV accumulators
but verifier fell back to bridge-derived x_attn, causing mismatch)
- Keep Qwen supports_qkv_freivalds=false until GPU validation confirms
the fix makes Freivalds pass
- Add skipped field to V4VerifyReport for explicit unsupported reporting
- Expose skipped in Python verify dict and Display impl
- Update roadmap: QKV Freivalds gated by profile1 parent f397405 commit 92f7be4
2 files changed
Lines changed: 10 additions & 6 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
91 | 91 | | |
92 | 92 | | |
93 | 93 | | |
94 | | - | |
| 94 | + | |
95 | 95 | | |
96 | 96 | | |
97 | 97 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
148 | 148 | | |
149 | 149 | | |
150 | 150 | | |
151 | | - | |
| 151 | + | |
152 | 152 | | |
153 | | - | |
154 | | - | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
155 | 159 | | |
156 | 160 | | |
157 | 161 | | |
158 | 162 | | |
159 | | - | |
160 | | - | |
| 163 | + | |
| 164 | + | |
161 | 165 | | |
162 | 166 | | |
163 | 167 | | |
| |||
0 commit comments