Evaluation of a RSL RL policy by viiik-inside · Pull Request #333 · isaac-sim/IsaacLab-Arena

viiik-inside · 2026-01-14T15:12:59Z

Summary

RSL RL policy evaluation class

alexmillane

Looks great! Thanks for doing this!

I have some questions about the success term in the LiftObjectTask which I think that we need to sort out.

Suggestion also to add a test. Could just run the the policy_runner with the default configs that we have and check that it doesn't crash.

alexmillane · 2026-01-22T18:58:55Z

+        return [
+            LiftSuccessMetric(
+                minimum_height=self.minimum_height_to_lift,
+                object_name=self.lift_object.name,
+            )
+        ]


Suggestion also to return SuccessRateMetric(). Actually, all tasks should return that. Not sure of a good way of enforcing that.

Edit: looking at the code closely, this task doesn't define a success term. This is a (apparently unenforced) assumption in Arena.

What's the reason there? Is there anything preventing us from defining a success term? Right now the LiftObjectTask wont really work in an IL setting.

The second question is, if we define success, do we need LiftSuccessMetric. I.e. is SuccessRateMetric the same thing? Potentially not because here success may also entail reaching an (x,y,z) goal, rather than just z>min_height. If that's the case, LiftSuccessMetric does indeed provide more information.

its a bit tricky in this case.

Lift Obejct task actually uses the command term to get its goal location. Its not a certain height but a certain position in space. In the RL training when it reaches the position it waits until time out to reset.

As the goal location comes from the command term, it starts in the RL problem and couldnt be included in the termination config which is in the IL problem.

This particular problem is ill defined in the IL setting as such. We need an observation term and convert the problem into setting a random goal pose as well.

I am gonna write a note here. When using a command manager, the terminations dont have a success term. Which makes sense, but how do we combine them in our setting is gonna be a different question. I would say we would need to override the termination cfg by removing the success term in the RL setting.I will implement that

I have made some changes here and i think its the neatest way forward to solve this issue. Ill try to explain below.

IL Base class needs success
RL class should not have success during training but needs success during policy evaluation(by our script)
RL env sets goal pose from the command manager, which the IL base class does not have access to.

Solution, have a smarter success condition. Which will return False during rl training, enforced with a flag. Will get the goal position from the command manager during rl_evaluation. Will take in the goal pose directly for IL training if command manager does not exist.

alexmillane · 2026-01-31T20:52:14Z

+    def get_events_cfg(self):
+        return self.events_cfg
+
+    def make_termination_cfg(self, rl_training: bool = False, use_command_goal: bool = False):


Is this behavior is general to all RLTasks? That the RL task termination is always (or at least usually) the IL task termination minus the success condition?

Can we push this to a function or a base class? Maybe the simplest is a function remove_success_condition(super.get_termination_cfg()). Actually I think that @peterd-NV did this for sequential tasks - he had to remove the success conditions from subtasks when forming the composite.

alexmillane · 2026-01-31T21:02:03Z

+            target_x_delta: Target range deltas for x [min_delta, max_delta] relative to initial pose (m).
+            target_y_delta: Target range deltas for y [min_delta, max_delta] relative to initial pose (m).
+            target_z_delta: Target range deltas for z [min_delta, max_delta] relative to initial pose (m).
+            rl_training_mode: If True, disables success termination. Set to False for evaluation.


Can we use the underlying (IL) task (LiftObjectTask) for evaluation, rather than passing this flag?

viiik-inside added 4 commits January 14, 2026 16:12

Evaluation of a RSL RL policy

02f25c3

Merge branch 'main' into feature/rl_policy_eval

9c7f7f1

Add from dict method for rl policy

191ad22

Add a metric to the lift object task

7d6e991

alexmillane reviewed Jan 22, 2026

View reviewed changes

viiik-inside and others added 12 commits January 23, 2026 10:43

Merge branch 'main' into feature/rl_policy_eval

9ba94bd

add parameters to data class

db7ca63

Merge branch 'main' into feature/rl_policy_eval

04c1a28

Significant changes to get both base and rl tasks working

c920ab6

Remove none from success

c1c5b67

remove unused metrics

3c66752

Add user argument for rl training

0de9b59

Remove pose range as it has not been tested with rl

9fc0783

Fix play script

7100b68

remove unused imports

5cf622c

Fix rsl rl policy

05732f7

Move get agent cfg to base rsl rl policy

205bd9f

alexmillane approved these changes Jan 31, 2026

View reviewed changes

viiik-inside added 2 commits February 2, 2026 11:03

split rl and il terminations

062afad

Add rl test

2726c60

viiik-inside merged commit 490b5d9 into main Feb 2, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation of a RSL RL policy#333

Evaluation of a RSL RL policy#333
viiik-inside merged 18 commits intomainfrom
feature/rl_policy_eval

viiik-inside commented Jan 14, 2026

Uh oh!

alexmillane left a comment •

edited

Loading

Uh oh!

Uh oh!

alexmillane Jan 22, 2026

Uh oh!

alexmillane Jan 22, 2026

Uh oh!

viiik-inside Jan 23, 2026

Uh oh!

viiik-inside Jan 28, 2026

Uh oh!

viiik-inside Jan 28, 2026

Uh oh!

alexmillane Jan 31, 2026

Uh oh!

Uh oh!

Uh oh!

alexmillane Jan 31, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

viiik-inside commented Jan 14, 2026

Summary

Uh oh!

alexmillane left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

alexmillane Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

alexmillane Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

viiik-inside Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

viiik-inside Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

viiik-inside Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

alexmillane Jan 31, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

alexmillane Jan 31, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

alexmillane left a comment •

edited

Loading