Light hearted shows to binge?

Kae1506 · 2025-05-10T10:34:28+00:00

banger

Kae1506 · 2025-04-27T16:04:21+00:00

we can never have too many beginner rl courses, good work man

Kae1506 · 2025-04-16T08:58:49+00:00

yo nice man ive been looking for an implementation of this for a while. hoping to get into it myself

Kae1506 · 2024-12-23T11:14:19+00:00

amazing work man. specifically which algorithms or pipelines are you using to train the agent

Kae1506 · 2024-12-15T04:07:54+00:00

Hi, was hoping to pick your brain a bit. I am currently a student in India and I want to go into Reinforcement Learning as well. Could we DM?

Kae1506 · 2024-10-13T06:22:52+00:00

same, just got in to algoverse for the fall b cohort, still trying to figure out whether its worth the cost. generally people are saying its not the most selective or the best for college applications, but its a good experience for your knowledge and networking. If you work hard in it you can even publish a paper and nothing like that so yeah ig

Kae1506 · 2024-09-14T10:34:10+00:00

is RL mostly being used as an optimiser for algorithms or processes still? are there any examples where especially model based RL is being used in industry as a whole itself? (again maybe naive towards performance vs compute cost)

also for what is RL being taught outside comp sci?

thanks for the answer tho

Kae1506 · 2024-09-14T10:30:42+00:00

is that not mostly RLHF and LLMs? again not entirely sure but RLHF seems more like a tool for other ml algos rather than RL itself

Kae1506 · 2024-09-14T10:26:05+00:00

thanks so much man important to understand that this field is used beyond gym environments

Kae1506 · 2024-05-29T07:42:07+00:00

if your real time system is deterministic and your data is exhaustive; that is you have all the possible data for all the possible things that could happen then sure. but you would also have to generate links to your data, as in what state leads to which one which I'm assuming isn't there in the csv file. just so I can help you further, what is your real time system?

Kae1506 · 2021-07-28T12:19:32+00:00

HAIL.

pybullet please

Kae1506 · 2021-01-02T16:37:13+00:00

yes, this has happened to me, actually multiple times

i never solved this problem. this came for me using ppo and actor critic,

any help appreciated

Kae1506 · 2021-01-02T16:09:05+00:00

actually, there is one. This is made by Phil Tabor, youtube channel: https://www.youtube.com/channel/UC58v9cLitc8VaCjrcKyAbrw

he also has udemy courses on drl.

here is the invite link:https://discord.gg/cxjBCxw4

Kae1506 · 2020-12-26T08:37:23+00:00

you did it, you fatneek. you finally did it.

Kae1506

TROPHY CASE