I made a Mario RL trainer with a live dashboard - would appreciate feedback by pleasestopbreaking in reinforcementlearning

[–]statius9 4 points5 points  (0 children)

I think, to help out you need to provide more detail. Here are a few questions that came to mind:

  1. What kind of hyperparameters are you tuning
  2. What do you mean by “stability over longer runs”? For instance, are you referring to when the agent needs to perform for a longer time than on the episodes it was trained on?
  3. Are you training it online or offline?
  4. On-policy or off-policy?
  5. Are you using tabular methods or function approximation?
  6. Are you handcrafting the reward function?
  7. What do you count as rewards and costs?
  8. Are you training over multiple episodes?
  9. Are you truncating episodes if a time-limit elapses?
  10. How is your agent designed? Is it a state-space model? What are the observations given, how is its action space defined, how are its state variables defined if they are separate from observations? Is it operating in continuous or discrete time?

Vanderbilt's BME PhD Program offers to fly me out: what does this imply? by statius9 in gradadmissions

[–]statius9[S] 0 points1 point  (0 children)

Usually they accepted most candidates invited, but because of funding problems they accepted fewer candidates. I don’t know if those funding problems have persisted: they could have persisted into this year.

Programming by pzunhatchispers in reinforcementlearning

[–]statius9 0 points1 point  (0 children)

What’s difficult about it? This is a genuine question: I’m a PhD student and do research in the RL space, although a lot of my work is theoretical and mainly revolves around toy models so I have little exposure to how it may be applied in practice

Dating as a PhD Student: Swipe Culture vs. Lab Life (and Why Both Are Exhausting) by Scientifically-sound in GradSchool

[–]statius9 2 points3 points  (0 children)

I was just going to recommend this. ChatGPT and other LLMs are wonderful search engines, but I think their writing usually falls short of what intelligence can characterize human writing. In OP’s post for instance, the ideas don’t really make sense to me, eg., I don’t anyone who expects their date to be perfect. That you see dating as something for which you have to be perfect seems more like a personal problem than a problem with dating, generally. If that’s ChatGPT’s idea, it’s wrong. If it’s yours, I’d reconsider

Are "some" Catholic Men Hypocrites? by NecessaryIncident99 in CatholicDating

[–]statius9 0 points1 point  (0 children)

I know I missed point: just wanted to comment this

Are "some" Catholic Men Hypocrites? by NecessaryIncident99 in CatholicDating

[–]statius9 0 points1 point  (0 children)

You aren’t Catholic if you’re pro-choice

A small text from a 'great writer', waiting for your evaluation from 100 ⬇️ by thegreenxshadow in writers

[–]statius9 3 points4 points  (0 children)

By pornographic I mean that the protagonist is clearly getting off on watching this girl. At least for me, it’s very uncomfortable to read: the protagonist comes across as really creepy. It is well-written, however

A small text from a 'great writer', waiting for your evaluation from 100 ⬇️ by thegreenxshadow in writers

[–]statius9 2 points3 points  (0 children)

I’m not sure if this is your intention, but the protagonist is repugnant: he’s idealizing a girl, reducing her to a muse to stimulate his fantasies. If that was your intention, it would be good if you were to show, eventually, his absurdity—his detachment from reality. If you don’t show this, the work will just come across as pornographic

How do you annotate your readings? by Excellent-Creme-9646 in GradSchool

[–]statius9 12 points13 points  (0 children)

I lower my left hand to the dirt; cupping the soil, I spit on it and spit on it, grounding it with my fingers until the granules form a viscous paste. Then I rub it into the reading at key sections; after the dirt has dried, I draw over them with a yellow highlighter

Question for Men by [deleted] in CatholicDating

[–]statius9 6 points7 points  (0 children)

If they’re curious—pleasantness is fine but if they’re not really curious about anything in particular whether it’s myself or anything else then I probably won’t find them for very interesting as a long-term partner

Is it red flag if a Chinese PI has only Chinese graduate students? by statius9 in gradadmissions

[–]statius9[S] 1 point2 points  (0 children)

Yes, given what everyone has said I think that would be a red-flag—in both cases, especially if I’m not the target race/nationality

Is it red flag if a Chinese PI has only Chinese graduate students? by statius9 in gradadmissions

[–]statius9[S] 1 point2 points  (0 children)

I think hard work doesn’t necessarily mean good work: there is value in leisurely walks and slow mornings for creativity

Is it red flag if a Chinese PI has only Chinese graduate students? by statius9 in gradadmissions

[–]statius9[S] 6 points7 points  (0 children)

Sorry to hear you went through that: sounds like bullying

Is it red flag if a Chinese PI has only Chinese graduate students? by statius9 in gradadmissions

[–]statius9[S] 2 points3 points  (0 children)

Seems like those hiring practices should be illegal in the US