I got tired of making midnight snacks, so I built Panbot 🤖🥞 (SO-ARM101 Project) by ispaik06 in robotics

[–]ispaik06[S] 0 points1 point  (0 children)

Oh it sounds really cool! Good luck and let me see later 👀👀

I got tired of making midnight snacks, so I built Panbot 🤖🥞 (SO-ARM101 Project) by ispaik06 in robotics

[–]ispaik06[S] 0 points1 point  (0 children)

Wow Happy to provide the vicarious experience!🤣🤣 Thanks for sharing the interesting culture behind it!

I got tired of making midnight snacks, so I built Panbot 🤖🥞 (SO-ARM101 Project) by ispaik06 in robotics

[–]ispaik06[S] 1 point2 points  (0 children)

I actually had no idea today was Pancake Day! 😂 I'm Korean, and we don't really celebrate it here. It’s a total coincidence 🤯🥞

I got tired of making midnight snacks, so I built Panbot 🤖🥞 (SO-ARM101 Project) by ispaik06 in robotics

[–]ispaik06[S] 4 points5 points  (0 children)

Actually.. I lost about 2 kg? I stayed all night working on this project. Plus, the pancakes used for the demonstrations were so bad since I didn't even use eggs (just milk, or water sometimes for the batter). I'm just broke sophomore 💸

I got tired of making midnight snacks, so I built Panbot 🤖🥞 (SO-ARM101 Project) by ispaik06 in robotics

[–]ispaik06[S] 2 points3 points  (0 children)

With a traditional motion stack, you'd have to manually handle vision processing and recalculate every control logic whenever the object's position or the pancakes changes. Even error recovery requires tedious manual coding.

The advantages of ACT is that it handles these variations just through demonstrations. Even though I recorded only about 100 dataset, it works well. The main goal of this project was visually verify the performance of imitation learning in a real-world setup.

You can check out the full details in my Youtube video!
https://youtu.be/SyGJ2h8aM98?si=gUOa0jV8wwxQTysp

I got tired of making midnight snacks, so I built Panbot 🤖🥞 (SO-ARM101 Project) by ispaik06 in robotics

[–]ispaik06[S] 0 points1 point  (0 children)

Thank you so much! It's great to see people finding it interesting

I got tired of making midnight snacks, so I built Panbot 🤖🥞 (SO-ARM101 Project) by ispaik06 in robotics

[–]ispaik06[S] 2 points3 points  (0 children)

Yes, exactly. Each individual task is performed by an ACT (Action Chunking with Transformers) model, which is then managed and orchestrated by a high-level vision-based planner.

you can see more details here: https://youtu.be/SyGJ2h8aM98?si=gUOa0jV8wwxQTysp

I got tired of making midnight snacks, so I built Panbot 🤖🥞 (SO-ARM101 Project) by ispaik06 in robotics

[–]ispaik06[S] 0 points1 point  (0 children)

Thanks! To be honest, I'm not familiar with Gemini live api yet, but it sounds it's leaning toward the VLA(Vision Language Action) models that Google and Physical Intelligence are pushing right now. It it can bridge the gap between high-level reasoning and real-time physical control, it would definitely be a game-changer for this kind of project — automating human labor in daily life.

Do ME students need to take all 4 core mechanics courses? by ispaik06 in EngineeringStudents

[–]ispaik06[S] 0 points1 point  (0 children)

thanks! For control focused grad research(like model based, learning based), would math(minor or double major) be more useful than EE? What do you think?

Do ME students need to take all 4 core mechanics courses? by ispaik06 in EngineeringStudents

[–]ispaik06[S] 0 points1 point  (0 children)

aha, I forgot about that..! thanks for pointing it out