-
Notifications
You must be signed in to change notification settings - Fork 440
Pull requests: pytorch/rl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Bugfix] Add MPS float64->float32 downcast
BugFix
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3548
opened Mar 6, 2026 by
bsprenger
Loading…
4 of 6 tasks
[LLM] Wire MATH and Countdown into GRPO and Expert Iteration scripts
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
sota-implementations/
#3546
opened Mar 5, 2026 by
vmoens
Loading…
[LLM] Add Countdown numbers-game environment
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Documentation
Improvements or additions to documentation
llm/
LLM-related PR, triggers LLM CI tests
#3545
opened Mar 5, 2026 by
vmoens
Loading…
[LLM] Add MATH (competition mathematics) environment
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Documentation
Improvements or additions to documentation
llm/
LLM-related PR, triggers LLM CI tests
#3544
opened Mar 5, 2026 by
vmoens
Loading…
[LLM] Simplify IFEval reward aggregator
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
llm/
LLM-related PR, triggers LLM CI tests
#3543
opened Mar 5, 2026 by
vmoens
Loading…
[LLM] Rewrite GSM8K reward function to follow standard GRPO conventions
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
llm/
LLM-related PR, triggers LLM CI tests
sota-implementations/
#3542
opened Mar 5, 2026 by
vmoens
Loading…
[CI] Install torchcodec from source
CI
Has to do with CI setup (e.g. wheels & builds, tests...)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Record
#3541
opened Mar 4, 2026 by
vmoens
Loading…
[Feature] PILCO
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Feature
New feature
sota-implementations/
#3537
opened Feb 27, 2026 by
PSXBRosa
Loading…
3 of 6 tasks
[Feature] Added Genesis Environments to TorchRL
CI
Has to do with CI setup (e.g. wheels & builds, tests...)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Environments
Adds or modifies an environment wrapper
Feature
New feature
#3536
opened Feb 26, 2026 by
ParamThakkar123
Loading…
6 of 10 tasks
[Feature] Added Lazy implementation of priority updates for replaybuffer prototype
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Feature
New feature
ReplayBuffers
#3507
opened Feb 13, 2026 by
ParamThakkar123
Loading…
3 of 10 tasks
[Feature] Added support for TDMPC2 dataset
CI
Has to do with CI setup (e.g. wheels & builds, tests...)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Data
Data-related PR, will launch data-related jobs
Documentation
Improvements or additions to documentation
Environments
Adds or modifies an environment wrapper
Feature
New feature
#3501
opened Feb 12, 2026 by
ParamThakkar123
Loading…
6 of 10 tasks
[Feature] Added OpenEnv environments
CI
Has to do with CI setup (e.g. wheels & builds, tests...)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Documentation
Improvements or additions to documentation
Environments
Adds or modifies an environment wrapper
Feature
New feature
llm/
LLM-related PR, triggers LLM CI tests
Trainers
#3470
opened Feb 9, 2026 by
ParamThakkar123
Loading…
6 of 10 tasks
[Feature] Extended Support delayed spec initialization for exploration modules
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Feature
New feature
Modules
#3450
opened Feb 5, 2026 by
ParamThakkar123
Loading…
3 of 10 tasks
[Feature] Added MCTSPolicyBase, MCTSPolicy, AlphaGoPolicy, AlphaStarPolicy, and MuZeroPolicy
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Documentation
Improvements or additions to documentation
Feature
New feature
Modules
#3449
opened Feb 5, 2026 by
ParamThakkar123
Loading…
6 of 10 tasks
[Feature] Add SGLang backend support to GRPO
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Feature
New feature
llm/
LLM-related PR, triggers LLM CI tests
sota-implementations/
#3437
opened Feb 2, 2026 by
vmoens
Loading…
[Algorithm] DPO
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Documentation
Improvements or additions to documentation
llm/
LLM-related PR, triggers LLM CI tests
Objectives
#3427
opened Jan 31, 2026 by
vmoens
Loading…
[Feature] SDPO
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Feature
New feature
llm/
LLM-related PR, triggers LLM CI tests
Objectives
#3425
opened Jan 30, 2026 by
vmoens
Loading…
5 tasks
[CI] Add path-based triggers for niche workflows
CI
Has to do with CI setup (e.g. wheels & builds, tests...)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3403
opened Jan 28, 2026 by
vmoens
Loading…
[BugFix] Call Transfom._call from reset
BugFix
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Transforms
#3385
opened Jan 26, 2026 by
ParamThakkar123
Loading…
3 of 10 tasks
[Feature] Incremental TensorStorageCheckpointer
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3344
opened Jan 19, 2026 by
vmoens
Loading…
[Feature] Add _Contiguous module and reshape improvements to encoders/decoders
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3306
opened Jan 8, 2026 by
vmoens
Loading…
[BugFix] Fix SliceSampler for torch.compile compatibility
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3298
opened Jan 8, 2026 by
vmoens
Loading…
Fix Habitat
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3065
opened Jul 14, 2025 by
vmoens
Loading…
[Algorithm] DPO
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#3025
opened Jun 23, 2025 by
vmoens
Loading…
[Feature, Example] A3C Atari Implementation for TorchRL
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
new algo
New algorithm request or PR
#3001
opened Jun 15, 2025 by
simeetnayan81
Loading…
3 of 9 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.