Skip to content

Pull requests: pytorch/rl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Bugfix] Add MPS float64->float32 downcast BugFix CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3548 opened Mar 6, 2026 by bsprenger Loading…
4 of 6 tasks
[LLM] Wire MATH and Countdown into GRPO and Expert Iteration scripts CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. sota-implementations/
#3546 opened Mar 5, 2026 by vmoens Loading…
[LLM] Add Countdown numbers-game environment CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Documentation Improvements or additions to documentation llm/ LLM-related PR, triggers LLM CI tests
#3545 opened Mar 5, 2026 by vmoens Loading…
[LLM] Add MATH (competition mathematics) environment CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Documentation Improvements or additions to documentation llm/ LLM-related PR, triggers LLM CI tests
#3544 opened Mar 5, 2026 by vmoens Loading…
[LLM] Simplify IFEval reward aggregator CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. llm/ LLM-related PR, triggers LLM CI tests
#3543 opened Mar 5, 2026 by vmoens Loading…
[LLM] Rewrite GSM8K reward function to follow standard GRPO conventions CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. llm/ LLM-related PR, triggers LLM CI tests sota-implementations/
#3542 opened Mar 5, 2026 by vmoens Loading…
[CI] Install torchcodec from source CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Record
#3541 opened Mar 4, 2026 by vmoens Loading…
[Feature] PILCO CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Feature New feature sota-implementations/
#3537 opened Feb 27, 2026 by PSXBRosa Loading…
3 of 6 tasks
[Feature] Added Genesis Environments to TorchRL CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Environments Adds or modifies an environment wrapper Feature New feature
#3536 opened Feb 26, 2026 by ParamThakkar123 Loading…
6 of 10 tasks
[Feature] Added Lazy implementation of priority updates for replaybuffer prototype CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Feature New feature ReplayBuffers
#3507 opened Feb 13, 2026 by ParamThakkar123 Loading…
3 of 10 tasks
[Feature] Added support for TDMPC2 dataset CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Data Data-related PR, will launch data-related jobs Documentation Improvements or additions to documentation Environments Adds or modifies an environment wrapper Feature New feature
#3501 opened Feb 12, 2026 by ParamThakkar123 Loading…
6 of 10 tasks
[Feature] Added OpenEnv environments CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Documentation Improvements or additions to documentation Environments Adds or modifies an environment wrapper Feature New feature llm/ LLM-related PR, triggers LLM CI tests Trainers
#3470 opened Feb 9, 2026 by ParamThakkar123 Loading…
6 of 10 tasks
[Feature] Extended Support delayed spec initialization for exploration modules CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Feature New feature Modules
#3450 opened Feb 5, 2026 by ParamThakkar123 Loading…
3 of 10 tasks
[Feature] Added MCTSPolicyBase, MCTSPolicy, AlphaGoPolicy, AlphaStarPolicy, and MuZeroPolicy CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Documentation Improvements or additions to documentation Feature New feature Modules
#3449 opened Feb 5, 2026 by ParamThakkar123 Loading…
6 of 10 tasks
[Feature] Add SGLang backend support to GRPO CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Feature New feature llm/ LLM-related PR, triggers LLM CI tests sota-implementations/
#3437 opened Feb 2, 2026 by vmoens Loading…
[Algorithm] DPO CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Documentation Improvements or additions to documentation llm/ LLM-related PR, triggers LLM CI tests Objectives
#3427 opened Jan 31, 2026 by vmoens Loading…
[Feature] SDPO CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Feature New feature llm/ LLM-related PR, triggers LLM CI tests Objectives
#3425 opened Jan 30, 2026 by vmoens Loading…
5 tasks
[CI] Add path-based triggers for niche workflows CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3403 opened Jan 28, 2026 by vmoens Loading…
[BugFix] Call Transfom._call from reset BugFix CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Transforms
#3385 opened Jan 26, 2026 by ParamThakkar123 Loading…
3 of 10 tasks
[Feature] Incremental TensorStorageCheckpointer CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3344 opened Jan 19, 2026 by vmoens Loading…
[Feature] Add _Contiguous module and reshape improvements to encoders/decoders CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3306 opened Jan 8, 2026 by vmoens Loading…
[BugFix] Fix SliceSampler for torch.compile compatibility CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3298 opened Jan 8, 2026 by vmoens Loading…
Fix Habitat CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3065 opened Jul 14, 2025 by vmoens Loading…
[Algorithm] DPO CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
#3025 opened Jun 23, 2025 by vmoens Loading…
[Feature, Example] A3C Atari Implementation for TorchRL CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. new algo New algorithm request or PR
#3001 opened Jun 15, 2025 by simeetnayan81 Loading…
3 of 9 tasks
ProTip! Adding no:label will show everything without a label.