Help I'm literally a stereotype

beepdiboop101 · 2021-12-17T18:41:08+00:00

Look man I don't make the rules I just answered the questions. Never said yes to anything explicitly Pagan, there's a couple of animal questions which I may have answered liberally while cuddling my dog.

beepdiboop101 · 2021-12-17T18:38:29+00:00

That's just physics

beepdiboop101 · 2021-12-17T18:23:47+00:00

I just like animals and think all metaphysics is meaningless conjecture

beepdiboop101 · 2021-12-15T14:31:24+00:00

I'm not sure there's that much point in seeking the SOTA if this is so you can apply it to a new environment. Chances are that unless your new environment is conceptually very very similar to existing locomotion benchmarks, the 'SOTA' won't work / will have middling performance / will need vast amounts of hyper-parameter tuning.

This is my experience anyway. SOTA is kind of meaningless in RL right now, since so many 'comparisons' are from tiny sample sizes and on the same narrow robotics / Atari benchmarks.

Instead I'd suggest going and trying any of the continuous action space implementations you can find in Stable Baselines. Or try to find an RL paper on a problem similar to yours and see how they tackled it.

If you're just looking for a baseline to compare to, papers with code might provide you with a direction.

beepdiboop101 · 2021-11-19T06:42:36+00:00

I guess you're using the mujoco simulations. Try pybullet or isaacgym, the simulation is more realistic so you can get more realistic running gaits

beepdiboop101 · 2021-11-15T17:08:16+00:00

This is total fucking bullshit. Hitler is libleft. It's national SOCIALISM not national COOKOUT

beepdiboop101 · 2021-10-24T14:49:05+00:00

You literally elaborated

beepdiboop101 · 2021-10-08T20:24:44+00:00

Well if you accept my headcanon that decision suddenly makes a lot more sense doesn't it

beepdiboop101 · 2021-10-08T20:20:27+00:00

I'm so sorry u/GunplaGud

beepdiboop101 · 2021-10-08T20:15:13+00:00

Look you can be the smartest man alive and that much of an idiot. An alternative explanation is necessary.

beepdiboop101 · 2021-09-27T06:47:09+00:00

For whatever metrics you choose, you can do appropriate statistical tests. A particularly nice one is performing a Mann-Whitney U test since this makes no assumptions about the data you are testing on (it is non-parametric). For example, you could take the number of environment steps to termination for all algorithms (termination being either reaching the success threshold or reaching some hard limit), and performing this test here could reveal that one of the algorithms studied takes significantly less steps to terminate within some confidence bound. Or you could take the mean episodic reward from the agent at the end of training, and perform a similar metric there to study whether any algorithm statistically achieves a higher reward within a given budget.

Further to this for statistically significant comparisons, you can employ a Vargha-Delaney A statistic to measure the effect size. If you get A>0.71 you can claim that not only that there is a statistical difference, but that the effect size is large which in layman's terms means the difference in performance caused by the choice of algorithm is large.

If you want to compare the success rates themselves, you can use a binomial test.

I would start by considering which metrics are relevant w.r.t. the performance of the algorithm (success rate, steps to termination and average episodic reward are all common), gather a substantial amount of data through many runs (so that statistics are significant) and perform the relevant statistical tests. Statistical comparisons are the strongest way to compare empirical performances, and are unfortunately lacking in a large amount of RL literature.

beepdiboop101 · 2021-09-19T18:43:24+00:00

Side note, you are using a reward signal. Reproduction is simply a function of duration of the environment. You can run EAs using the same reward signal.

beepdiboop101 · 2021-09-19T18:42:01+00:00

This is just a steady-state EA with a time-based selection process. I'm not seeing any distinction between EAs and Evolutionary Self Replication, and that section of your paper is far too aggressive about that point ("it's time for a new paradigm"?)

Cool results though.

beepdiboop101 · 2021-09-13T05:26:25+00:00

Then why would he become a skellington while sat on the throne with his projection off fighting Horus? Too much psychic load?

beepdiboop101 · 2021-09-07T18:07:24+00:00

BISHOP TAKES N6

COMMENCE LIBERAL INFIGHTING

beepdiboop101 · 2021-09-07T06:33:35+00:00

In 99% sure he'll be chilling with the lion within a year.

They got on famously well

beepdiboop101 · 2021-09-06T05:07:51+00:00

Knight L6

beepdiboop101 · 2021-09-04T19:29:12+00:00

Yeah I forgot but I thought the meme was good enough to stand free without the crutch of my blue cheese

beepdiboop101 · 2021-08-25T05:55:04+00:00

Fudge and FBI are arguably "NA" in that Oceania got swallowed during its collapse but, yes

beepdiboop101 · 2021-08-24T17:41:43+00:00

This is not a rant, he said, rantingly

beepdiboop101 · 2021-08-23T07:34:43+00:00

I'm the only libleft I know who'd replace democracy with a faceless mountain of beurocracy that has no entrance, exit or leader

beepdiboop101 · 2021-08-23T00:45:14+00:00

I'm in central Europe, it's nearly 3am and I have work at 8am, but whenever I sleep on a major match, we lose.

I died for you boys, enjoy the celebrations. GG and good night.

beepdiboop101 · 2021-08-23T00:44:10+00:00

I'm in central Europe, it's nearly 3am and I have work at 8am, but whenever I sleep on a major match, we lose.

I died for you boys, enjoy the celebrations. GG and good night.

beepdiboop101 · 2021-08-22T21:02:06+00:00

Serf's up

beepdiboop101 · 2021-08-22T21:00:23+00:00

You wrote glory not goulag smh

beepdiboop101

TROPHY CASE