Skip to content

reset recurrent state after cudagraph warmup#579

Open
valtterivalo wants to merge 1 commit into
PufferAI:5.0from
valtterivalo:fix/recurrent-state-cudagraph-warmup
Open

reset recurrent state after cudagraph warmup#579
valtterivalo wants to merge 1 commit into
PufferAI:5.0from
valtterivalo:fix/recurrent-state-cudagraph-warmup

Conversation

@valtterivalo
Copy link
Copy Markdown

i had parity issues between CUDA trained models and local Metal evals and noticed this. we restore all kinds of state but not recurrent for some reason. could be intentional but i'm not aware of a reason

this can be repro'd on breakout with default config for example where reading buffer_states[0] after construction shows 98304/131072 bytes nonzero. this PR makes that 0/131072

it's +5 lines for something CUDA to CUDA users will never feel but it affected me so might as well PR

Warmup advances per-buffer and frozen-bank MinGRU state but never restored it, unlike weights, momentum, and RNG state. A fresh PuffeRL now starts from zero recurrent state.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant