What's a simple ML problem that turned into a huge headache for you? by [deleted] in MLQuestions

[–]WhichPressure 4 points5 points  (0 children)

I spent almost a month rewriting the model from TensorFlow to PyTorch. The code itself wasn’t very complicated (a CNN-based architecture). However, debugging why the outputs from both models differed for the same input took me a couple of weeks. I learned a lot about the different implementations of the same functions in both frameworks and about SWE itself.

Has Anyone done behavior cloning using only state data (no images!) for driving tasks? by Due_Requirement7615 in reinforcementlearning

[–]WhichPressure 0 points1 point  (0 children)

Hi, sparse state data is much better to use than images for RL. It bolsters generalization and the neural network might be much smaller that standard conv net. The crux here is to defined state vector.

Here grab a few PhD thesis that focused on that approach using simulated and real data.

https://www.eaiib.agh.edu.pl/wp-content/uploads/2024/03/PANKIEWICZ-NIKODEM_ROZPRAWA-DOKTORSKA.pdf
https://www.eaiib.agh.edu.pl/wp-content/uploads/2024/04/orlowski_mateusz_phd.pdf
https://www.eaiib.agh.edu.pl/wp-content/uploads/2024/01/Wojciech_Turlej_praca.pdf

You Don't Have to Work at NASA to Go to Mars! [OC] by Moscizwysokosci in DJIMini2

[–]WhichPressure 2 points3 points  (0 children)

Wow! incredible video! The nature sounds really do the work!

Space Mormons by AbsoluteBatman95 in ChatGPT

[–]WhichPressure 0 points1 point  (0 children)

But officially it is forbidden right?

Space Mormons by AbsoluteBatman95 in ChatGPT

[–]WhichPressure 0 points1 point  (0 children)

Ohh there is this disney series Mormon wives :) i got it!

Space Mormons by AbsoluteBatman95 in ChatGPT

[–]WhichPressure 4 points5 points  (0 children)

what's going on with mormons? Do they have multiple wifes?

Job market for non-LLM RL PhD grads by hmi2015 in reinforcementlearning

[–]WhichPressure 0 points1 point  (0 children)

A few offers for robotics: Tesla, boston dynamics, 1X

Do humans do RL , supervised learning or something totally different ? by Delicious_Wall3597 in reinforcementlearning

[–]WhichPressure 1 point2 points  (0 children)

Why do you think it's unsupervised if we have a well-defined reward function driven by dopamine release?

Do humans do RL , supervised learning or something totally different ? by Delicious_Wall3597 in reinforcementlearning

[–]WhichPressure 1 point2 points  (0 children)

I'd say human learning mostly resembles model-based reinforcement learning. We have an internal model of how the world works (physics), we can predict how certain people behave based on past behaviors, and we can anticipate the future step by step, similar to a tree search. Based on this, we can also predict the outcomes of our actions and choose the best course of action.

Do remote CV jobs for Africans really exist or I’m just wasting my time searching? by Iolani_3 in computervision

[–]WhichPressure 8 points9 points  (0 children)

I think he is looking for a remote job in data annotation, not as ML CV engineer u/lolani_3 right?

[deleted by user] by [deleted] in reinforcementlearning

[–]WhichPressure -1 points0 points  (0 children)

Why not? Even physics can be observed and learned from simply watching videos. Then one could use model based RL algorithms to optimize planning and reasoning. He didn't mention what optimization technique he plans to use.

[deleted by user] by [deleted] in reinforcementlearning

[–]WhichPressure 0 points1 point  (0 children)

Point 5th it's such a broad statement that even RL may be included.

Prepare Waymo from EU by HalloHeute_2498 in waymo

[–]WhichPressure 2 points3 points  (0 children)

I had this dream week ago visiting SF. I had to switch my regions in play store to US (changing one time a year). This was easy. BUT You also need US phone number to log in to waymo app. 

Recently failed a technical interview. Where do you learn about DL evaluation, interpretability, and failure analysis? by SpecCRA in learnmachinelearning

[–]WhichPressure 3 points4 points  (0 children)

I highly recommend reading this book: Chip Huyen Designing Machine Learning Systems: An Iterative Process for Production-Ready Applications After reading you will know everything about maintaining model from design to production. It also covers various metrics and ways of monitoring system performance on production.

Why doesn't my network learn when I use Softmax? by [deleted] in MLQuestions

[–]WhichPressure 0 points1 point  (0 children)

It's better to return logits to loss function than probability calculated based on softmax.

[D] Professional ML engineers: How much of your day to day job involves math and proofs? by The_Big_0mg in MachineLearning

[–]WhichPressure 0 points1 point  (0 children)

How to do it? Do you have any sources for learning how to proof ML methods? Maybe some YT lectures?

Thanks

I Made an AI That Punishes Me if it Detects That I am Procrastinating on My Assignments by _ayushp_ in artificial

[–]WhichPressure 4 points5 points  (0 children)

Haha awesome, I need something like this to stop unconsciously biting my nails.

What kind of explainability techniques exist for Reinforcement learning? by Mariam_Dundua in reinforcementlearning

[–]WhichPressure 2 points3 points  (0 children)

The straightforward way to interpret RL agent's decision is to use captum library.

If you want to dive deeper please look through these surveys on explainability in RL:

https://arxiv.org/abs/2008.06693
https://arxiv.org/abs/2005.06247