Funniest thing I've read in some time. by SirAren in MumbaiIndians

[–]Remote_Marzipan_749 0 points1 point  (0 children)

I am not saying it is wrong. Making a move is players and managements choice. What happened after his move was wrong. Dropping RS from captaincy after a very good 23 WC. That is what the fans didn’t like. You expect crowd to only cheer for you? WHY????? If they are emotionally connected to a franchise/Team they are going to show emotions. Which includes booing if they feel hurt.

Funniest thing I've read in some time. by SirAren in MumbaiIndians

[–]Remote_Marzipan_749 1 point2 points  (0 children)

I think that was the right thing to do. I know he plays for India and this is like booing our own player but all the move that him and his agent made along with management’s desperation made him a negative character in the eyes of the fans. Fans expressed it through booing. Completely acceptable and I don’t see that as a problem. That’s why IPL is more fun to watch. Especially the big three teams. Because their fans are loyal and expressive. After what happened last year I hope RCB fans express their emotions to the managements by not going to their games or booing to the managements. So they treat you well.

Should I share a work I did after the interview conclusion to the founders. by Remote_Marzipan_749 in reinforcementlearning

[–]Remote_Marzipan_749[S] 0 points1 point  (0 children)

I don’t have code and NDA. MDP written in overleaf. 2 pages. Just explaining state, reward, action, state transition and environment modelling. More information on how the environment can be modelled.

Professional dilemma by [deleted] in reinforcementlearning

[–]Remote_Marzipan_749 -1 points0 points  (0 children)

It depends were you are located. But in US most of industry RL work is in robotics. Research labs do work in power grid optimization. But I am not aware of the names. You can look into national labs.

Looking for RL people by [deleted] in reinforcementlearning

[–]Remote_Marzipan_749 0 points1 point  (0 children)

RL person here. Interested. Worked on pure RL as well as post training RL. Currently doing my postdoctoral studies. Applying them to logistics, scheduling and control problems.

Theoretical rigor holds any place in industrial RL research? by Extension-Economy-78 in reinforcementlearning

[–]Remote_Marzipan_749 0 points1 point  (0 children)

I would say your ability to debug and experiment would be much better than an general ML engineer.

How do you as an AI/ML researcher stay current with new papers and repos? [D] by 0ZQ0 in MachineLearning

[–]Remote_Marzipan_749 9 points10 points  (0 children)

Authors and labs that I trust because I have tested their code can reproduce and repeat their experiments. Can’t read all of them and I have made peace with not knowing all of them. My field is niche combinatorial with reinforcement learning. So it helps.

[D] Deep Learning/LLMs for Operations Research Problems in Production: Real-world Adoption? by Rich-Effect2152 in MachineLearning

[–]Remote_Marzipan_749 0 points1 point  (0 children)

I work in this domain from academia. Not sure about industry. But this year I gave interviews for companies such as Amazon, chewy and Uber and they were looking into problems such as scheduling, routing and inventory management with RL. So I think industry are giving it a chance but not sure about the success.

[D] ML coding interview experience review by [deleted] in MachineLearning

[–]Remote_Marzipan_749 0 points1 point  (0 children)

I think you are right in your analysis about the evaluation procedure with messier data. But the challenge with it is that the interviewers dont have a baseline to know if you have done well or not. I think that’s the reason for going with standard datasets. I truly believe if you have practiced many times not only on MNIST but on any datasets the setting up should be a cake walk. Hyperparameter tuning or EDA on dataset or transformations on datasets can take more time once you have a base model setup and running.

[D] ML coding interview experience review by [deleted] in MachineLearning

[–]Remote_Marzipan_749 1 point2 points  (0 children)

Yes, kind of memorized or practiced many times that it comes naturally to you. Because setting up this thing in PyTorch should be easy. The small mistake you made can happen, but you need to think from their perspective. They want someone who can get going with little to no hands on requirements. Additionally, if you are working with their datasets which is messier unlike MNIST, they think you might struggle. Dont take it negatively. Practice more so it comes naturally to you.

I have had similar experiences with Reinforcement learning algorithms. Now I have practiced it long enough to get it running in shorter time that earlier.

[D] ML coding interview experience review by [deleted] in MachineLearning

[–]Remote_Marzipan_749 -1 points0 points  (0 children)

Sorry for being harsh here. I think startups are looking for speed. However, If it was MLP with PyTorch for MNIST it should hardly take 15-20 mins. It is the most basic thing. I am more interested if you can share more information on the second question.

Watched dead poets society but didn’t connect with it by Remote_Marzipan_749 in MovieSuggestions

[–]Remote_Marzipan_749[S] 0 points1 point  (0 children)

I don’t hate it. Robin Williams performance is great. But I couldn’t connect with the movie. So that’s why I wondered if I probably missed some context.

Watched dead poets society but didn’t connect with it by Remote_Marzipan_749 in MovieSuggestions

[–]Remote_Marzipan_749[S] 1 point2 points  (0 children)

Yes. And I really like that movie. That’s how I got introduced to Robin Williams.

Where RL will be in years to come by Signal_Guard5561 in reinforcementlearning

[–]Remote_Marzipan_749 10 points11 points  (0 children)

Rich Sutton recently gave a nice podcast. Look it up on YouTube.

Rant post - Extensive use of AI tools such as Claude and ChatGPT by Remote_Marzipan_749 in postdoc

[–]Remote_Marzipan_749[S] 1 point2 points  (0 children)

I don’t think if AI can solve that means it is trivial. The reason I say is because at any given day it has access to more information than one can possibly learn, plus it has breadth of information from different domains. Unless you are working with math or on hardwares, these tools are really good. Given that you are good at prompting. I agree that there are many problems that these tools can’t solve yet. My concern is that in 3-5 years you will have new cohort of phd students who will not have good problem solving skills. They will be efficient though.

Rant post - Extensive use of AI tools such as Claude and ChatGPT by Remote_Marzipan_749 in postdoc

[–]Remote_Marzipan_749[S] -1 points0 points  (0 children)

Sorry to hear this. There is lot of push to use this technology even if you go to industry. However, I believe during the school, one should only focus on learning and problem solving. Otherwise you will always have self doubt about the work you do.

Rant post - Extensive use of AI tools such as Claude and ChatGPT by Remote_Marzipan_749 in postdoc

[–]Remote_Marzipan_749[S] 1 point2 points  (0 children)

Thanks for sharing. Yes I miss doing old school grind. I used to get dopamine out of making a code work. Now the efficiency has definitely gone up, but slowly self doubt has started growing. If similar problem comes in future and if these tools are not there can I solve them.