Hi,
After successfully reproducing the code for AnyTracker and reviewing the codebase, I noticed that the current implementation appears to use a simple MLP actor–critic architecture trained with PPO on multiple motions, along with domain randomization during training.
I wanted to ask whether the Motion Clustering and Specialist-to-Generalist methods described in the paper are actually implemented in AnyTracker. Additionally, the paper seems to suggest that domain randomization was not included in AnyTracker, which seems to differ from what I observed. In my reproduction, the robot also struggles with contact-rich actions such as fallAndGetup, which the paper reported as being successfully performed.
Please feel free to correct me if I’ve misunderstood any part of the codebase or the paper’s implementation details.
Thank you so much.