Skip to content

Pull requests: pytorch/rl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Feature] MCP and HTTP tools, agentic tutorial, see-also pointers CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Documentation Improvements or additions to documentation Feature New feature llm/ LLM-related PR, triggers LLM CI tests tutorials/
#3737 opened May 10, 2026 by vmoens Collaborator Loading…
[Feature] ToolCompose with parallel dispatch, builtin tools, legacy adapter Benchmarks rl/benchmark changes CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Feature New feature llm/ LLM-related PR, triggers LLM CI tests
#3736 opened May 10, 2026 by vmoens Collaborator Loading…
[Feature] Agentic toolkit foundation: protocols, parsers, sandbox, REPL CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Documentation Improvements or additions to documentation Feature New feature llm/ LLM-related PR, triggers LLM CI tests
#3735 opened May 10, 2026 by vmoens Collaborator Loading…
[Feature] Adds HERReplayBuffer and HindsightStrategy to torchrl.data. CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Feature New feature ReplayBuffers
#3734 opened May 10, 2026 by theap06 Contributor Loading…
Revert "[Performance] Preallocate collector rollout writes" ci-no-td CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Collectors Integrations/torch_geometric Integrations
#3732 opened May 8, 2026 by vmoens Collaborator Loading…
[Performance] Preallocate collector rollout writes CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Collectors Integrations/torch_geometric Integrations Performance Performance issue or suggestion for improvement
#3731 opened May 8, 2026 by vmoens Collaborator Loading…
[Performance] Add Isaac collector policy profiling modes CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Examples Performance Performance issue or suggestion for improvement
#3730 opened May 8, 2026 by vmoens Collaborator Loading…
[BugFix] Fix KeyError in PettingZoo action mask with ParallelEnv and done_on_any=False BugFix CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Environments/pettingzoo Environments Adds or modifies an environment wrapper Objectives
#3727 opened May 8, 2026 by dashitongzhi Loading…
Bump vllm from 0.14.1 to 0.20.0 in /sota-implementations/grpo CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Dependencies Pull requests that update a dependency file python Pull requests that update python code sota-implementations/
#3708 opened May 6, 2026 by dependabot Bot Loading…
[BugFix] Check agent presence before updating action mask in PettingZoo BugFix Environments/pettingzoo Environments Adds or modifies an environment wrapper
#3703 opened May 5, 2026 by nshoman Loading…
3 of 10 tasks
[Feature] MuJoCo custom envs with selectable physics backend CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Documentation Improvements or additions to documentation Environments Adds or modifies an environment wrapper Feature New feature
#3700 opened May 1, 2026 by vmoens Collaborator Loading…
6 of 7 tasks
[Feature] Add functorch integration tests for TensorDictModule - Fixes #154 CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Feature New feature Modules
#3697 opened Apr 30, 2026 by ParamThakkar123 Contributor Loading…
[CI] Add ruleset JSON requiring lint-done on protected branches CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3690 opened Apr 28, 2026 by vmoens Collaborator Loading…
3 tasks
[CI] Selective PR test matrix gated by changed-files + ciflow/* labels CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3674 opened Apr 27, 2026 by vmoens Collaborator Loading…
4 of 8 tasks
Bump transformers from 4.52.4 to 5.0.0rc3 in /sota-implementations/expert-iteration CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Dependencies Pull requests that update a dependency file python Pull requests that update python code sota-implementations/
#3601 opened Apr 8, 2026 by dependabot Bot Loading…
[CI] Install torchcodec from source CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Record
#3541 opened Mar 4, 2026 by vmoens Collaborator Loading…
[Feature] Added Lazy implementation of priority updates for replaybuffer prototype CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Feature New feature ReplayBuffers
#3507 opened Feb 13, 2026 by ParamThakkar123 Contributor Loading…
3 of 10 tasks
[Feature] Added support for TDMPC2 dataset CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Data Data-related PR, will launch data-related jobs Documentation Improvements or additions to documentation Environments Adds or modifies an environment wrapper Feature New feature
#3501 opened Feb 12, 2026 by ParamThakkar123 Contributor Loading…
6 of 10 tasks
[Feature] Added OpenEnv environments CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Documentation Improvements or additions to documentation Environments Adds or modifies an environment wrapper Feature New feature llm/ LLM-related PR, triggers LLM CI tests Trainers
#3470 opened Feb 9, 2026 by ParamThakkar123 Contributor Loading…
6 of 10 tasks
[Feature] Extended Support delayed spec initialization for exploration modules CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Feature New feature Integrations/torch_geometric Integrations Modules
#3450 opened Feb 5, 2026 by ParamThakkar123 Contributor Loading…
3 of 10 tasks
[Feature] Added MCTSPolicyBase, MCTSPolicy, AlphaGoPolicy, AlphaStarPolicy, and MuZeroPolicy CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Documentation Improvements or additions to documentation Feature New feature Modules
#3449 opened Feb 5, 2026 by ParamThakkar123 Contributor Loading…
6 of 10 tasks
[Algorithm] DPO CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Documentation Improvements or additions to documentation llm/ LLM-related PR, triggers LLM CI tests Objectives
#3427 opened Jan 31, 2026 by vmoens Collaborator Loading…
[Feature] SDPO CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Feature New feature llm/ LLM-related PR, triggers LLM CI tests Objectives
#3425 opened Jan 30, 2026 by vmoens Collaborator Loading…
5 tasks
[CI] Add path-based triggers for niche workflows CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3403 opened Jan 28, 2026 by vmoens Collaborator Loading…
[BugFix] Call Transfom._call from reset BugFix CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Transforms
#3385 opened Jan 26, 2026 by ParamThakkar123 Contributor Loading…
3 of 10 tasks
ProTip! no:milestone will show everything without a milestone.