reset recurrent state after cudagraph warmup by valtterivalo · Pull Request #579 · PufferAI/PufferLib

valtterivalo · 2026-05-29T18:06:14Z

i had parity issues between CUDA trained models and local Metal evals and noticed this. we restore all kinds of state but not recurrent for some reason. could be intentional but i'm not aware of a reason

this can be repro'd on breakout with default config for example where reading buffer_states[0] after construction shows 98304/131072 bytes nonzero. this PR makes that 0/131072

it's +5 lines for something CUDA to CUDA users will never feel but it affected me so might as well PR

Warmup advances per-buffer and frozen-bank MinGRU state but never restored it, unlike weights, momentum, and RNG state. A fresh PuffeRL now starts from zero recurrent state.

Reset recurrent state corrupted by cudagraph warmup

0bade50

Warmup advances per-buffer and frozen-bank MinGRU state but never restored it, unlike weights, momentum, and RNG state. A fresh PuffeRL now starts from zero recurrent state.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reset recurrent state after cudagraph warmup#579

reset recurrent state after cudagraph warmup#579
valtterivalo wants to merge 1 commit into
PufferAI:5.0from
valtterivalo:fix/recurrent-state-cudagraph-warmup

valtterivalo commented May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

valtterivalo commented May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant