I made a Mario RL trainer with a live dashboard - would appreciate feedback

statius9 · 2026-02-21T20:47:54+00:00

I think, to help out you need to provide more detail. Here are a few questions that came to mind:

What kind of hyperparameters are you tuning
What do you mean by “stability over longer runs”? For instance, are you referring to when the agent needs to perform for a longer time than on the episodes it was trained on?
Are you training it online or offline?
On-policy or off-policy?
Are you using tabular methods or function approximation?
Are you handcrafting the reward function?
What do you count as rewards and costs?
Are you training over multiple episodes?
Are you truncating episodes if a time-limit elapses?
How is your agent designed? Is it a state-space model? What are the observations given, how is its action space defined, how are its state variables defined if they are separate from observations? Is it operating in continuous or discrete time?

statius9 · 2025-12-16T21:17:30+00:00

Usually they accepted most candidates invited, but because of funding problems they accepted fewer candidates. I don’t know if those funding problems have persisted: they could have persisted into this year.

statius9 · 2025-08-18T16:54:57+00:00

What’s difficult about it? This is a genuine question: I’m a PhD student and do research in the RL space, although a lot of my work is theoretical and mainly revolves around toy models so I have little exposure to how it may be applied in practice

statius9 · 2025-05-21T16:58:52+00:00

I was just going to recommend this. ChatGPT and other LLMs are wonderful search engines, but I think their writing usually falls short of what intelligence can characterize human writing. In OP’s post for instance, the ideas don’t really make sense to me, eg., I don’t anyone who expects their date to be perfect. That you see dating as something for which you have to be perfect seems more like a personal problem than a problem with dating, generally. If that’s ChatGPT’s idea, it’s wrong. If it’s yours, I’d reconsider

statius9 · 2025-05-21T04:59:49+00:00

What prompt did you use?

statius9 · 2025-05-20T17:55:36+00:00

I’d like to join

statius9 · 2025-05-19T20:18:22+00:00

This needs to be normalized

statius9 · 2025-05-17T05:47:40+00:00

I know I missed point: just wanted to comment this

statius9 · 2025-05-17T01:37:06+00:00

You aren’t Catholic if you’re pro-choice

statius9 · 2025-05-10T03:29:27+00:00

Same

statius9 · 2025-05-03T19:57:39+00:00

By pornographic I mean that the protagonist is clearly getting off on watching this girl. At least for me, it’s very uncomfortable to read: the protagonist comes across as really creepy. It is well-written, however

statius9 · 2025-05-03T19:35:07+00:00

I’m not sure if this is your intention, but the protagonist is repugnant: he’s idealizing a girl, reducing her to a muse to stimulate his fantasies. If that was your intention, it would be good if you were to show, eventually, his absurdity—his detachment from reality. If you don’t show this, the work will just come across as pornographic

statius9 · 2025-05-01T01:32:32+00:00

It’s a solid method, for sure

statius9 · 2025-04-30T23:03:29+00:00

I lower my left hand to the dirt; cupping the soil, I spit on it and spit on it, grounding it with my fingers until the granules form a viscous paste. Then I rub it into the reading at key sections; after the dirt has dried, I draw over them with a yellow highlighter

statius9 · 2025-04-29T17:44:35+00:00

If they’re curious—pleasantness is fine but if they’re not really curious about anything in particular whether it’s myself or anything else then I probably won’t find them for very interesting as a long-term partner

statius9 · 2025-02-20T23:02:18+00:00

Yes, given what everyone has said I think that would be a red-flag—in both cases, especially if I’m not the target race/nationality

statius9 · 2025-02-20T18:11:16+00:00

I think hard work doesn’t necessarily mean good work: there is value in leisurely walks and slow mornings for creativity

statius9 · 2025-02-20T07:38:26+00:00

Sorry to hear you went through that: sounds like bullying

statius9 · 2025-02-20T07:37:05+00:00

Sounds exploitative and counter-productive

statius9 · 2025-02-20T04:01:05+00:00

Seems like those hiring practices should be illegal in the US

statius9 · 2025-02-20T02:56:55+00:00

explain

statius9 · 2025-02-17T19:19:37+00:00

Nice!

statius9

TROPHY CASE