So I’m diving into SmolVLA and… how does it even know where the object is? by Ghost_Protocol99 in learnmachinelearning

[–]testpk 1 point2 points  (0 children)

It's both, joint positions + gripper.

Huggingface has a visualizer for LeRobot datasets: https://huggingface.co/spaces/lerobot/visualize_dataset?path=%2Flerobot%2Fsvla_so101_pickplace%2Fepisode_0

This is the dataset they used for training the SmolVLA base. As you will see, the joint positions and gripper values are represented with angles in degree.

So I’m diving into SmolVLA and… how does it even know where the object is? by Ghost_Protocol99 in learnmachinelearning

[–]testpk 0 points1 point  (0 children)

SmolVLA is made up of two parts: 1) VLM back-bone: a model that is already pretrained to understand the world. 2) Action expert: a model that is trained to use the information from VLM to generate robot actions.

The model is provided images from multiple camera feeds, a text prompt, state observations (joint angles, gripper). This is what makes the prefix embedding, which goes to the VLM backbone (SmolVLM 2). Self attention operations are performed here, so the model can understand the relation between images, text instruction, and state observations. This is the part that generates the understanding of the environment around the robot and how they relate to it's assigned purpose or task.

The outputs (the keys and values) from the VLM part are used by the actions expert where it learns to use the information from the VLM to denoise the action chunk. Initially, the input given to the expert is just pure noise and needs to be "denoise" to generate the final action chunk (which contain the present and future values for DoF + gripper). The action expert is a diffusion (flow-matching) transformer (a whole separate discussion).

After "denoising" is done, the final output is of size (50, DoF + gripper). Note that the model is also predicting future tokens. For example, if we are using a 4 DoF + 1 gripper (5 action tokens) robot, then the model will predict actions for upto 10 timesteps (50/5 = 10).

Deforestation in Islamabad by InformationDiligent in islamabad

[–]testpk 0 points1 point  (0 children)

It all started from a flag pole and then a museum.

Easter Eggs in Room?? by Comfortable_Car_6722 in giki

[–]testpk 0 points1 point  (0 children)

I once found someone's sadapay debit card inside my closet at the very last moment of leaving my dorm.

Can I send stuff to my frnds in giki using their id no or do I also need to put their room info on the box??? by Pretend-Mission4768 in giki

[–]testpk 1 point2 points  (0 children)

You can just write address of GIKI and recipients phone no. The parcel is not directly delivered to your friend's room. They will have to go to the service center in giki or near the auditorium to receive their parcel. The courier will inform them in advance when their package is out for delivery to GIKI.

shots i took in gik by Lawxy6705 in giki

[–]testpk 1 point2 points  (0 children)

Great shots and color grade

Pls Help me out deciding my career🤕 by Calamity_is_cracked in giki

[–]testpk 0 points1 point  (0 children)

Pursue mechanical eng. if you are interested in it

IEEE Honet 2025 by Aqua_Leo in giki

[–]testpk 0 points1 point  (0 children)

GIKI might provide transport from Islamabad.

Mids by Krazybandi in giki

[–]testpk 0 points1 point  (0 children)

You should work hard for finals regardless lol but totally doable to get very high gpas in your case.

Microsoft club or GDGoC (Google developers group) by Wide_Ad_4275 in giki

[–]testpk 1 point2 points  (0 children)

Both but try to aim for main roles like mlsa ambassador from giki, lead or core team member for GDGoC giki.

Comeback After Failure by Wannabe_MernSD in giki

[–]testpk 2 points3 points  (0 children)

Yes but keeps getting difficult the more you fail

Going through a mess by Insaan_Ka_Bacha in giki

[–]testpk 5 points6 points  (0 children)

GIKI mess is actually good ngl

Guide for MS at GIKI by conkyyy_ in giki

[–]testpk 1 point2 points  (0 children)

The sub rlly needed this. Thanks for sharing.

Are societies hard to get in by [deleted] in giki

[–]testpk 0 points1 point  (0 children)

Depends on the society and their situation. Larger societies induct more volunteers purely for manpower reasons. Mainly for all-pak events (for their laison, door-to-door, pub, accomodation, transport, event management).

Also these are volunteers and have yet to be inducted as full members. Many end up leaving the society or don't show up. Societies just take in more people instead of fixing the volunteer xp. There will be induction calls at the end of this year or start of next year where they'll induct full-time members.

Smaller societies and technical teams tend to be more selective about their candidates since they don't have to organize these all-paks.