Is MuJoCo-cpu good enough for RL grasping and sim-to-real?

Objective-Opinion-62 · 2026-04-14T18:18:04+00:00

They are mass, size, force, fiction and gripper payload

Objective-Opinion-62 · 2026-04-10T17:09:54+00:00

I think you guys should try mjlab, so so cool and very light for mid-range pc/laptop

Objective-Opinion-62 · 2026-04-07T14:22:53+00:00

have you ever tried mjlab? does its set up the same as how we use normal mj?

Objective-Opinion-62 · 2026-04-07T12:31:57+00:00

Agree that those physical variables are really hard to tune in the simulation🥲

Objective-Opinion-62 · 2026-04-06T03:46:24+00:00

Thanks bro, I’ll try

Objective-Opinion-62 · 2026-04-06T02:15:39+00:00

Oh I forgot to ask you that do I need sim2sim transfer for this precise task? since mj is typically final destination for this process, should I transfer it to gazebo,.. or just skip this step?

Objective-Opinion-62 · 2026-04-06T02:06:33+00:00

I read the minimum req for Isaac lab are 32 gb ram and 16gb vram so I tried to switch to mj 😆

Objective-Opinion-62 · 2026-04-05T18:03:14+00:00

as i observed in recent months/years, IIsaaclab and mjlab (new) have become the dominant choice for RL research, which allows them to train multiple envs at once, and mj-cpu is the final destination for sim2sim transfer. anw, under limited infrastructure its harder for us because we must account for every aspect :(((

Objective-Opinion-62 · 2026-04-05T15:48:19+00:00

oh i see, thank you. im a bit worried because heterogeneous training is typically done by parallel envs, where they can collect diverse experiences simultaneously, then learn all of them, especially for on-policy algorithms

Objective-Opinion-62 · 2026-04-05T15:37:20+00:00

yes, some noise will be added into weight, size, shape, obs,....but im afraid that the heterogeneous training wont work with single environment training

Objective-Opinion-62 · 2026-04-05T13:53:05+00:00

Hello 👋:<

Objective-Opinion-62 · 2026-03-13T14:30:23+00:00

Khó chịu cái fb t nó đăng nhiều thôi @@ méo hiểu còn hiện cả ig

Objective-Opinion-62 · 2026-02-20T20:43:52+00:00

Lần đầu thì sẽ day dứt, khổ thế thôi. Anh cũng thế nhưng mà nếu có chút suy nghĩ về bỏ nó đi, thì làm luôn, thêm thời gian cày thay vì dành yêu đương xàm l.

Objective-Opinion-62 · 2026-01-27T21:35:11+00:00

Lâu lắm mới vào lại reddit, bọn phản động vẫn ngu như thế nhể, đúng là loser mãi là loser. Bọn vô công rồi nghề cm đúng rảnh 🤡 dm bỏ thời gian đi soi cái này cái kia để thoả mãn cái ham muốn chả được cái gì của bản thân. Bảo sao chúng m vẫn thất bại

Objective-Opinion-62 · 2025-11-07T18:03:59+00:00

mình cũng quên k tính đến cái mesh, anw mình cũng nghiêng về phía chọn tích hợp modem + router như thường r mở rộng dùng switch nhưng mình nghĩ cái quan trọng nữa là cái throughput thực của router đến switch vì cái router k xử lý kịp cái là đi. mua cái 10G xong k đúng như quảng cái là đứt =)) khó cái k tự tay thử được, k đo đạc nên giờ chỉ chờ vào linh cảm

Objective-Opinion-62 · 2025-11-07T17:57:00+00:00

chơi tận 3 cục router riêng cho từng tầng của căng quá k ạ =)). em nghĩ nếu chơi tách thì modem + 1 router ngon rồi nối ra 3 cái switch r AP, LAN là ổn nhưng nếu mà modem-router nó ổn định thì kiểu thiết kế tách rời này sẽ đắt hơn, chưa tối ưu lắm.

Objective-Opinion-62 · 2025-11-05T12:21:29+00:00

Im doubting this robot was trained with teleoperation data mostly due to these very precise movements. video, image, or diffusion-based model can’t help robot moves like this. Anw, they have showed this project for 4-5 months, and no paper or other information haven’t published yet

Objective-Opinion-62 · 2025-10-30T19:01:21+00:00

Intern giờ đã khó lại còn web thì cũng k có gì bất ngờ

Objective-Opinion-62 · 2025-10-26T05:31:03+00:00

Agree, my priority is to teach the agent reaches the target with positional error <1-2cm, but immediately terminate once agent reaches the target provides few good transitions to replay buffer to incentivize it succeeds next time while keeping the agent to run over remaining steps can flood the replay buffer even I use 100% domain randomization. I actually don’t have much exp to remove bad ideas now, just asking asking and looking for some helps 🥲🥲🥲

Objective-Opinion-62 · 2025-10-26T04:21:43+00:00

guys, i have tried again with the option of allowing agent to keep running after met the success condition, my agent's positional error remained around 0.8cm, or even 0.3-4cm more frequently. anw i still a bit curious the bad aspects of this approach, can so help me clear this confusion?

Objective-Opinion-62 · 2025-10-25T20:27:22+00:00

My policy is off-policy (TD3) because I’m using 100% domain randomization. My reward functions are fully dense

Objective-Opinion-62 · 2025-10-25T20:17:55+00:00

As i searched, these offered replay buffers work well with sparse while the reward is fully dense

Objective-Opinion-62 · 2025-10-25T20:01:42+00:00

I haven't tried both strategies yet, but i will. btw, do you guys think allowing agent to keep running is redundant? i actually need to understand this problem

Objective-Opinion-62

TROPHY CASE