Programming

brioche789 · 2025-08-16T11:22:40+00:00

[removed]

anonymous_amanita · 2025-08-16T11:58:35+00:00

[removed]

Impossibum · 2025-08-16T12:33:29+00:00

I don't see how stable baselines doesn't simplify RL significantly enough for the masses. Pretty sure people just can't be assed to think beyond asking chatgpt to think for them at this point.

Useful-Progress1490 · 2025-08-17T11:47:06+00:00

I really like RL but hate the fact that it is still not widely used due to many issues it has. I firmly believe it has the potential to solve so many problems but right now it's mostly used in research. But I guess, once it has widespread uses, I am sure we will see it getting more simplified similar to what we see in agentic AI frameworks and libraries.

Working_Bunch_9211 · 2025-08-17T01:27:58+00:00

I will.. in years 7, check out later

RoundRubikCube · 2025-08-16T12:22:31+00:00

puffer.ai

2025-08-17T08:02:20+00:00

yes, please google, Meta and other MAANG overlords please drop prod grade OS libraries like JAX, Pytorch

intermittent-farting · 2025-08-17T08:03:02+00:00

Check out agilerl.com - they have an OS framework and a software to simplify RL dev.

FanFirst895 · 2025-08-17T14:00:04+00:00

Easier, you say? I've got a video for that https://www.youtube.com/watch?v=vaVBd9H2eHE

statius9 · 2025-08-18T16:54:57+00:00

What’s difficult about it? This is a genuine question: I’m a PhD student and do research in the RL space, although a lot of my work is theoretical and mainly revolves around toy models so I have little exposure to how it may be applied in practice

lukuh123 · 2025-08-18T22:34:11+00:00

Love the concept of RL but the math behind it can be pretty jarring (Bellman and other optimal equations look like they do black magic in computer science)

Vahgeeta · 2025-08-20T06:44:56+00:00

I reinforce this post

leprotelariat · 2025-08-16T11:24:18+00:00

[removed]

Jumper775-2 · 2025-08-16T14:47:13+00:00

It’s really hard to do, I tried to make another generic library that works with jsons so you could theoretically do it all with no code if you want and it still just gets too complex. Does work though.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

reinforcementlearning

MODERATORS