[docs] Document Gym + RL integration design #1762

ananthsub · 2026-01-12T12:57:02Z

What does this PR do ?

Part of NVIDIA-NeMo/Gym#292

This PR documents the NeMo RL + Gym integration, which includes:

The Ray actor bridge code in RL that initializes & launches Gym, and how Gym re-uses the Ray cluster info
How RL prepares its vLLM servers for Gym to proxy through to, so inference logic is contained within RL
The training loop flow for how RL sends request data to Gym and how the data is translated between Gym and RL formats

Issues

NVIDIA-NeMo/Gym#292

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you run the unit tests and functional tests locally? Visit our Testing Guide for how to run tests
Did you add or update any necessary documentation? Visit our Document Development Guide for how to write, build and test the docs.

Additional Information

...

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

[docs] Add gym + rl design integration

9527953

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

ananthsub requested a review from bxyu-nvidia January 12, 2026 12:57

ananthsub added the documentation Improvements or additions to documentation label Jan 12, 2026

ananthsub temporarily deployed to nemo-ci January 12, 2026 12:57 — with GitHub Actions Inactive

ananthsub temporarily deployed to nemo-ci January 12, 2026 13:00 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[docs] Document Gym + RL integration design #1762

[docs] Document Gym + RL integration design #1762

Uh oh!

ananthsub commented Jan 12, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[docs] Document Gym + RL integration design #1762

Are you sure you want to change the base?

[docs] Document Gym + RL integration design #1762

Uh oh!

Conversation

ananthsub commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Issues

Usage

Before your PR is "Ready for review"

Additional Information

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ananthsub commented Jan 12, 2026 •

edited

Loading