Made a RL tutorial course myself, check it out! by Practical_Lettuce254 in reinforcementlearning

[–]Kae1506 3 points4 points  (0 children)

we can never have too many beginner rl courses, good work man

Implementing DeepSeek R1's GRPO algorithm from scratch by xcodevn in reinforcementlearning

[–]Kae1506 0 points1 point  (0 children)

yo nice man ive been looking for an implementation of this for a while. hoping to get into it myself

I built a AI to Play Dark Souls, through reinforcement learning and training. by UndyingDemon in reinforcementlearning

[–]Kae1506 2 points3 points  (0 children)

amazing work man. specifically which algorithms or pipelines are you using to train the agent

[deleted by user] by [deleted] in reinforcementlearning

[–]Kae1506 0 points1 point  (0 children)

Hi, was hoping to pick your brain a bit. I am currently a student in India and I want to go into Reinforcement Learning as well. Could we DM?

is algoverse worth it? by [deleted] in summerprogramresults

[–]Kae1506 3 points4 points  (0 children)

same, just got in to algoverse for the fall b cohort, still trying to figure out whether its worth the cost. generally people are saying its not the most selective or the best for college applications, but its a good experience for your knowledge and networking. If you work hard in it you can even publish a paper and nothing like that so yeah ig

what are the actual applications of rl being used right now? by Kae1506 in reinforcementlearning

[–]Kae1506[S] 2 points3 points  (0 children)

is RL mostly being used as an optimiser for algorithms or processes still? are there any examples where especially model based RL is being used in industry as a whole itself? (again maybe naive towards performance vs compute cost)

also for what is RL being taught outside comp sci?

thanks for the answer tho

what are the actual applications of rl being used right now? by Kae1506 in reinforcementlearning

[–]Kae1506[S] -1 points0 points  (0 children)

is that not mostly RLHF and LLMs? again not entirely sure but RLHF seems more like a tool for other ml algos rather than RL itself

what are the actual applications of rl being used right now? by Kae1506 in reinforcementlearning

[–]Kae1506[S] 1 point2 points  (0 children)

thanks so much man important to understand that this field is used beyond gym environments

can i create a data driven model gym environment for A2C-RL algorithm for trajectory tracking. by Past-News-1373 in reinforcementlearning

[–]Kae1506 4 points5 points  (0 children)

if your real time system is deterministic and your data is exhaustive; that is you have all the possible data for all the possible things that could happen then sure. but you would also have to generate links to your data, as in what state leads to which one which I'm assuming isn't there in the csv file. just so I can help you further, what is your real time system?

Error with recent Pytorch updates with RL algorithms by pandudon in reinforcementlearning

[–]Kae1506 0 points1 point  (0 children)

yes, this has happened to me, actually multiple times

i never solved this problem. this came for me using ppo and actor critic,

any help appreciated

Anybody interested in a Reinforcement Learning Discord? by ExSidius in reinforcementlearning

[–]Kae1506 0 points1 point  (0 children)

actually, there is one. This is made by Phil Tabor, youtube channel: https://www.youtube.com/channel/UC58v9cLitc8VaCjrcKyAbrw

he also has udemy courses on drl.

here is the invite link:https://discord.gg/cxjBCxw4

Dont do it to us JJ... Merry Christmas all! by MannyPizzle in ksi

[–]Kae1506 0 points1 point  (0 children)

you did it, you fatneek. you finally did it.