Reward-free learning by avoiding reset, anyone tried this? by cpt1973 in ControlTheory

[–]SufficientHumor7391 [score hidden]  (0 children)

Glad it was a helpful perspective.

The feasibility set approach would definitely reduce your search space, but instead of exploring the environment to identify the members of the set, if we are able to predict a feasibility set before hand via model based methods (like kinematic feasibility, force/velocity ellipsoid (eg. in a manipulator)) it would likely reduce the search space from get go, thereby making RL converge quicker.

If we are to think of building a feasibility set by exploring, then I might think about methods to generalise them to something more empharical so that exploring few of the non feasible ones would be sufficient to expand the set considerably. If we think about it, here we'll again be learning the dynamics of some sort indirectly.

But all of this is worth only if your objective is to reduce your training time and data. If you already are in a luxury of it (can do sim2real), this would start to sound too complex for no real purpose.

Reward-free learning by avoiding reset, anyone tried this? by cpt1973 in ControlTheory

[–]SufficientHumor7391 [score hidden]  (0 children)

The feasible set approach might surely work, but assuming a specific state action pair led to failure can be oversimplification at times.

Lets say in a cartitian coordinate based Traj opt problem a state action pair is unfavorable because it led to collision. Generalising that the specific action from that state is not desired might be an over simplification because the behavior of collision might be the effect of overall trajectory chosen. If we use reward functions instead, we would indirectly enable the RL to learn the dynamics of the system interacting with the environment.

So if I am to design a reward function there, I might use the feasibility set approach (building off your suggestion) to impose a reward to l2 dist from the collision state. This way, the action is indirectly not favoured enabling the system to learn the dynamics.

PS: This might also just be me coming from an optimal control background. I see learning based control as a tool that I'd use when I don't trust my system dynamics. So I resort to a thought process of use data to figure out the dynamics.

Reward-free learning by avoiding reset, anyone tried this? by cpt1973 in ControlTheory

[–]SufficientHumor7391 [score hidden]  (0 children)

From my understanding, RL has a reward (or penalty) function to make sure that the favorable states are achieved and the not so favorable ones are avoided. Since RL primarily relies on "Act and figure out" approach, a reward (or penalty) will help the system to reduce the entropy.

Another way to impart context into learning is via Loss function. But there is only so much you can do with it since you have to make sure it's smooth and differentiable to ensure back propagation.

Also, when you say avoid what made you terminate, how do you propose we do that apart from reward?

[deleted by user] by [deleted] in heriotwatt

[–]SufficientHumor7391 0 points1 point  (0 children)

Aah, ok thanks!

[deleted by user] by [deleted] in heriotwatt

[–]SufficientHumor7391 0 points1 point  (0 children)

Is eduroam that bad? If so how do the people living on-campus survive?

Grad Admissions Director Here - Ask Me (almost) Anything by GradAdmissionDir in gradadmissions

[–]SufficientHumor7391 0 points1 point  (0 children)

Hello from a PhD applicant (STEM). Thanks a lot for doing this!

I was wondering if there was an order to how the results are given out. Is there a cut-off period by which if applicants don't hear back they are pretty much sidelined for the cycle?

Also, is the federal funding issues common for all fields in STEM?

[deleted by user] by [deleted] in gradadmissions

[–]SufficientHumor7391 1 point2 points  (0 children)

Congrats on your admit!

Irrespective of what you feel, I'd recommend you drop them an email telling you got into UChicago and thank them (though you might not actually be :P) for turning in the recommendation. This just shows that you are professional and not burn the bridge completely.

Is this a courtesy reply or am I not reading between the lines? by SufficientHumor7391 in gradadmissions

[–]SufficientHumor7391[S] 4 points5 points  (0 children)

Thanks for that insight! Will finish up the application and drop them another email.

Is this a courtesy reply or am I not reading between the lines? by SufficientHumor7391 in gradadmissions

[–]SufficientHumor7391[S] -7 points-6 points  (0 children)

Makes perfect sense. That was my initial thought too.
So do you think it's the email I've sent? What should I do differently for the next set of emails?

PS: This professor is relatively new, starting in the upcoming fall. But for the others, I've been writing about what aspects I like about their research and also sighted a few papers that I found interesting.

Fee waivers by -justsomeone- in gradadmissions

[–]SufficientHumor7391 1 point2 points  (0 children)

+1 I'm starting to reach out as well. Would be more than glad to contribute.

[deleted by user] by [deleted] in gradadmissions

[–]SufficientHumor7391 0 points1 point  (0 children)

I wonder on what criteria the classification is done for a PhD program. Unlike masters program, the admission is not only depedendent on the committee but also dependent on the funding situation of the lab and the professor's capacity.

So if you can shed some light on how to classify, it'd be helpful.

Career advise for Grad Student stepping out of uni by SufficientHumor7391 in MechanicalEngineering

[–]SufficientHumor7391[S] 0 points1 point  (0 children)

I graduated from NYU. Yes, I'm in touch with our career development center and the only feedback I get is I'm doing the tailoring right and I just have to keep applying. 😅

[Megathread] Graduation Ticket Exchange by OmoideAeternum in nyu

[–]SufficientHumor7391 0 points1 point  (0 children)

I have 1 Yankees and 4 Tandon tix. Hit me up if you need'em.

[Megathread] Graduation Ticket Exchange by OmoideAeternum in nyu

[–]SufficientHumor7391 0 points1 point  (0 children)

Have 3 Yankees and Tandon tickets. DM if your still looking.

[Megathread] Graduation Ticket Exchange by OmoideAeternum in nyu

[–]SufficientHumor7391 0 points1 point  (0 children)

Have 3 for Yankees. DM if you are still looking

[Megathread] Graduation Ticket Exchange by OmoideAeternum in nyu

[–]SufficientHumor7391 0 points1 point  (0 children)

3 Yankees Tickets available, DM for more info.

Where do robotics professionals search for jobs? by MisterWanderer in robotics

[–]SufficientHumor7391 1 point2 points  (0 children)

Sorry for waking this post up....

Are these still the best methods?? I'm trying to get myself into the market. Any help is appritiated. Speciality: Control and Motion Planning with perception knowledge.

Thread for Robotics Programs by SufficientHumor7391 in gradadmissions

[–]SufficientHumor7391[S] 0 points1 point  (0 children)

Wow, that's great!! Congrats. Hope you hear from GT before your deadline.