feat: Auto research skill by vinhngx · Pull Request #2419 · NVIDIA-NeMo/RL

vinhngx · 2026-05-06T02:23:57Z

What does this PR do ?

This PR adds an auto research skill that guides agents on how to do a prolonged research session with Nemo-RL and Nemo-gym. It sets some operating guidelines on how to form and test hypotheses, how to organize git branches, how to monitor and report progress, and how to explicitly check for the stopping conditions of the campaign.

Issues

N/A

Usage

You can prompt Codex, such as:

Use the @skill/auto_research skill and train the Qwen-3-VL-2B-instruct model to high accuracy in the Nemo-gym circle click environment. Time budget: 5h

For this skill to be effective, Codex should have sufficient knowledge of the local operating environment (e.g. Slurm or local machine). A prerequisite to using the auto research skill is therefore, for the agent to be able to automatically run a baseline workload on the given environment.

copy-pr-bot · 2026-05-06T02:24:01Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

terrykong · 2026-05-07T20:11:39Z

/claude review

terrykong

Review: PR #2419 — feat: Auto research skill

Nice work — this skill provides a well-structured framework for running iterative RL experiments with git as the experiment journal. The exploration-ideas guide and git-workflow reference are thorough and practical. The safety guardrails in git-workflow.md (no stash/reset/overwrite without consent) are particularly good.

A few suggestions to align with existing repo conventions and improve consistency:

Directory naming mismatch

All other skill directories use hyphens (build-and-dependency, config-conventions, launch-nemo-rl, etc.), but this one uses an underscore (auto_research). The frontmatter name field is auto-research (with hyphen), creating an inconsistency. Consider renaming the directory to auto-research/ to match the convention.

Nemo-gym coverage

The PR description mentions guiding agents on research with "Nemo-RL and Nemo-gym", but the SKILL.md workflow (step 3) only references NeMo-RL paths (examples/run_grpo.py, nemo_rl/models/, etc.). The Nemo-gym entrypoints (examples/nemo_gym/) are not mentioned. Consider either adding Nemo-gym paths to the workflow or adjusting the PR description to match the actual scope.

See inline comments for additional suggestions.

Generated by Claude Code

Signed-off-by: Vinh Nguyen <vinhn@nvidia.com>

vinhngx · 2026-05-12T01:17:11Z

Thanks, @terrykong, for the prompt review. Fixed the reported issues and tightened all 3 skills. Add best practices and gotchas observed with Codex, but could happen with other agents.

Signed-off-by: Vinh Nguyen <vinhn@nvidia.com>

terrykong · 2026-05-15T05:31:04Z

/ok to test a8b8513

terrykong · 2026-05-18T18:19:53Z

/ok to test 54c71ec

yuki-97 · 2026-05-19T12:22:24Z

/ok to test 9880d5e

chtruong814 · 2026-05-20T12:37:46Z

/ok to test 39de685

vinhngx requested a review from a team as a code owner May 6, 2026 02:23

github-actions Bot added the community-request label May 6, 2026

vinhngx force-pushed the vinhn/autoresearch branch from fafbf11 to ca90191 Compare May 6, 2026 02:28

vinhngx changed the title ~~Auto research skill~~ feat: Auto research skill May 6, 2026

claude Bot reviewed May 7, 2026

View reviewed changes

Comment thread skills/auto-research/SKILL.md Outdated

claude Bot reviewed May 7, 2026

View reviewed changes

Comment thread skills/auto-research/SKILL.md

svcnvidia-nemo-ci added the waiting-on-maintainers Waiting on maintainers to respond label May 9, 2026

terrykong reviewed May 11, 2026

View reviewed changes

Comment thread skills/auto-research/SKILL.md Outdated

Comment thread skills/auto-research/SKILL.md Outdated

Comment thread skills/auto-research/SKILL.md Outdated

Comment thread skills/auto-research/references/git-workflow.md Outdated

Comment thread skills/auto-research/references/git-workflow.md

svcnvidia-nemo-ci added waiting-on-customer Waiting on the original author to respond and removed waiting-on-maintainers Waiting on maintainers to respond labels May 11, 2026

revise auto research skill

bd35c3e

Signed-off-by: Vinh Nguyen <vinhn@nvidia.com>

vinhngx force-pushed the vinhn/autoresearch branch from 29bb26f to 7aee365 Compare May 12, 2026 01:09

vinhngx added 15 commits May 12, 2026 01:22

add Brev skill. Rename auto-research

d5b497c

Signed-off-by: Vinh Nguyen <vinhn@nvidia.com>

add session memory skill

3455e96

Signed-off-by: Vinh Nguyen <vinhn@nvidia.com>

docs: address auto research skill review feedback

6e3250a

Signed-off-by: Vinh Nguyen <vinhn@nvidia.com>

docs: clarify auto research execution environment

c4b9013

Signed-off-by: Vinh Nguyen <vinhn@nvidia.com>

docs: refine auto research skill triggers

41ba60d

Signed-off-by: Vinh Nguyen <vinhn@nvidia.com>

docs: clarify auto research objectives

55d5034

Signed-off-by: Vinh Nguyen <vinhn@nvidia.com>

docs: clarify auto research experiment count stop rule

b623125

Signed-off-by: Vinh Nguyen <vinhn@nvidia.com>

docs: rename auto research experiment count target

7184baf

Signed-off-by: Vinh Nguyen <vinhn@nvidia.com>

docs: link auto research to Brev etiquette

e90907f

Signed-off-by: Vinh Nguyen <vinhn@nvidia.com>

docs: clarify Brev detection for auto research

397de2c

Signed-off-by: Vinh Nguyen <vinhn@nvidia.com>

docs: trigger Brev etiquette on user mention

06e97b3

Signed-off-by: Vinh Nguyen <vinhn@nvidia.com>

docs: require session memory for auto research

c856914

Signed-off-by: Vinh Nguyen <vinhn@nvidia.com>

docs: tidy research support skills

a16b0a9

Signed-off-by: Vinh Nguyen <vinhn@nvidia.com>

docs: add auto research gotchas

43e16a3

Signed-off-by: Vinh Nguyen <vinhn@nvidia.com>

docs: preserve auto research context across handoffs

4cdf4a1

Signed-off-by: Vinh Nguyen <vinhn@nvidia.com>

terrykong approved these changes May 15, 2026

View reviewed changes

terrykong added the CI:docs Run doctest label May 15, 2026

copy-pr-bot Bot had a problem deploying to nemo-ci May 15, 2026 05:31 Failure

svcnvidia-nemo-ci added waiting-on-maintainers Waiting on maintainers to respond and removed waiting-on-maintainers Waiting on maintainers to respond labels May 15, 2026

Merge branch 'main' into vinhn/autoresearch

54c71ec

copy-pr-bot Bot temporarily deployed to public May 18, 2026 18:20 Inactive

copy-pr-bot Bot had a problem deploying to nemo-ci May 18, 2026 18:20 Failure

copy-pr-bot Bot temporarily deployed to public May 18, 2026 18:21 Inactive

copy-pr-bot Bot temporarily deployed to public May 18, 2026 18:25 Inactive

svcnvidia-nemo-ci removed the waiting-on-maintainers Waiting on maintainers to respond label May 18, 2026

Merge branch 'main' into vinhn/autoresearch

9880d5e

copy-pr-bot Bot temporarily deployed to public May 19, 2026 12:22 Inactive

copy-pr-bot Bot had a problem deploying to nemo-ci May 19, 2026 12:22 Failure

copy-pr-bot Bot temporarily deployed to public May 19, 2026 12:22 Inactive

copy-pr-bot Bot temporarily deployed to public May 19, 2026 12:23 Inactive

copy-pr-bot Bot temporarily deployed to public May 19, 2026 12:26 Inactive

copy-pr-bot Bot had a problem deploying to nemo-ci May 20, 2026 09:32 Error

Merge branch 'main' into vinhn/autoresearch

39de685

copy-pr-bot Bot temporarily deployed to public May 20, 2026 12:38 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci May 20, 2026 12:38 Inactive

copy-pr-bot Bot temporarily deployed to public May 20, 2026 12:38 Inactive

copy-pr-bot Bot temporarily deployed to public May 20, 2026 12:42 Inactive

terrykong merged commit 012bf17 into NVIDIA-NeMo:main May 20, 2026
42 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Auto research skill#2419

feat: Auto research skill#2419
terrykong merged 21 commits into
NVIDIA-NeMo:mainfrom
vinhngx:vinhn/autoresearch

vinhngx commented May 6, 2026

Uh oh!

copy-pr-bot Bot commented May 6, 2026

Uh oh!

terrykong commented May 7, 2026

Uh oh!

Uh oh!

Uh oh!

terrykong left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vinhngx commented May 12, 2026

Uh oh!

terrykong commented May 15, 2026

Uh oh!

terrykong commented May 18, 2026

Uh oh!

yuki-97 commented May 19, 2026

Uh oh!

chtruong814 commented May 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

vinhngx commented May 6, 2026

What does this PR do ?

Issues

Usage

Uh oh!

copy-pr-bot Bot commented May 6, 2026

Uh oh!

terrykong commented May 7, 2026

Uh oh!

Uh oh!

Uh oh!

terrykong left a comment

Choose a reason for hiding this comment

Review: PR #2419 — feat: Auto research skill

Directory naming mismatch

Nemo-gym coverage

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vinhngx commented May 12, 2026

Uh oh!

terrykong commented May 15, 2026

Uh oh!

terrykong commented May 18, 2026

Uh oh!

yuki-97 commented May 19, 2026

Uh oh!

chtruong814 commented May 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants