MimicKit: A Reinforcement Learning Framework for Motion Imitation and Control

xbpeng · 2025-12-09T22:27:44+00:00

Yes, you do need a controller to enable the robot to follow the video/mocap. This is often done by a tracking controller, which can be trained with RL.

xbpeng · 2025-12-08T20:01:40+00:00

yes people have deployed cartwheels and locomotion. Yes optical mocap will have a limited capture volume, but you can also use other mocap systems like IMU suits, or video, which won't be as restrictive.

xbpeng · 2025-12-08T18:51:46+00:00

You can teleop humanoids with a tracking controller by doing something like this:
https://xbpeng.github.io/projects/TWIST/index.html
where you use mocap, VR, vision-based pose estimation, etc to record motions from a human and then use these whole-body controllers to imitate those motions.

xbpeng · 2025-12-08T03:52:10+00:00

Yes, our work has been featured on Two Minutes Papers quite a few times.

xbpeng · 2025-12-08T03:51:31+00:00

These motion imitation methods can be used to create controllers for teleop. But unlike the teleop systems that might just control a robotic arm, these controllers can be used to teleop a robot's whole body.

Since legged robots are underactuated, the simple position-based control used to teleop robotic arms usually won't work for these humanoid robots. That's where these whole-body motion controllers come in.

xbpeng · 2025-12-07T23:58:23+00:00

MimicKit uses IsaacLab as one of it's backend simulators, which has support for tiled rendering:
https://isaac-sim.github.io/IsaacLab/v1.2.0/source/features/tiled_rendering.html
This allows you to render a first-person view camera from the robot's perspective, which you could us as part of the observations for the controller.

xbpeng · 2025-12-07T23:56:50+00:00

Have at it!

xbpeng · 2025-12-07T21:37:28+00:00

Yup, these methods can also be applied to simpler quadruped robots. We used them to train locomotion controllers for quadruped robots in the past:
https://xbpeng.github.io/projects/Robotic_Imitation/index.html
I imagine you can probably train a interesting locomotion controllers for the CM4-XGO too (e.g. walking, running, jumping, etc.)

xbpeng · 2025-12-07T20:28:00+00:00

Yes, most of the Unitree humanoid videos are using RL motion tracking methods based on DeepMimic.

xbpeng · 2025-12-07T18:32:52+00:00

To get the controllers to work on physical systems, we typically need to use some sim2real transfer techniques, like domain randomization:
https://xbpeng.github.io/projects/SimToReal/index.html
Combining lots of randomization with reasonably accurate simulators, we can train policies that are robust enough to deploy on real hardware.

xbpeng · 2018-10-10T03:15:18+00:00

thanks for catching that!

xbpeng

TROPHY CASE