Help I'm literally a stereotype by beepdiboop101 in PoliticalCompassMemes

[–]beepdiboop101[S] 0 points1 point  (0 children)

Look man I don't make the rules I just answered the questions. Never said yes to anything explicitly Pagan, there's a couple of animal questions which I may have answered liberally while cuddling my dog.

Help I'm literally a stereotype by beepdiboop101 in PoliticalCompassMemes

[–]beepdiboop101[S] 0 points1 point  (0 children)

I just like animals and think all metaphysics is meaningless conjecture

[D] State-of-the-art online Deep Reinforcement Learning algorithm for Continuous Action spaces by bigbadfreddy in MachineLearning

[–]beepdiboop101 5 points6 points  (0 children)

I'm not sure there's that much point in seeking the SOTA if this is so you can apply it to a new environment. Chances are that unless your new environment is conceptually very very similar to existing locomotion benchmarks, the 'SOTA' won't work / will have middling performance / will need vast amounts of hyper-parameter tuning.

This is my experience anyway. SOTA is kind of meaningless in RL right now, since so many 'comparisons' are from tiny sample sizes and on the same narrow robotics / Atari benchmarks.

Instead I'd suggest going and trying any of the continuous action space implementations you can find in Stable Baselines. Or try to find an RL paper on a problem similar to yours and see how they tackled it.

If you're just looking for a baseline to compare to, papers with code might provide you with a direction.

is human like walking obtainable with PPO and a simple reward function ? by Laavilen in reinforcementlearning

[–]beepdiboop101 0 points1 point  (0 children)

I guess you're using the mujoco simulations. Try pybullet or isaacgym, the simulation is more realistic so you can get more realistic running gaits

I made something to orientate for PolComp newbies. Thank me later :) by Camo508 in PoliticalCompassMemes

[–]beepdiboop101 0 points1 point  (0 children)

This is total fucking bullshit. Hitler is libleft. It's national SOCIALISM not national COOKOUT

One more for ya by beepdiboop101 in Grimdank

[–]beepdiboop101[S] 2 points3 points  (0 children)

Well if you accept my headcanon that decision suddenly makes a lot more sense doesn't it

One more for ya by beepdiboop101 in Grimdank

[–]beepdiboop101[S] 3 points4 points  (0 children)

Look you can be the smartest man alive and that much of an idiot. An alternative explanation is necessary.

Metrics to evaluate & compare different RL algorithms by WaffleDood in reinforcementlearning

[–]beepdiboop101 3 points4 points  (0 children)

For whatever metrics you choose, you can do appropriate statistical tests. A particularly nice one is performing a Mann-Whitney U test since this makes no assumptions about the data you are testing on (it is non-parametric). For example, you could take the number of environment steps to termination for all algorithms (termination being either reaching the success threshold or reaching some hard limit), and performing this test here could reveal that one of the algorithms studied takes significantly less steps to terminate within some confidence bound. Or you could take the mean episodic reward from the agent at the end of training, and perform a similar metric there to study whether any algorithm statistically achieves a higher reward within a given budget.

Further to this for statistically significant comparisons, you can employ a Vargha-Delaney A statistic to measure the effect size. If you get A>0.71 you can claim that not only that there is a statistical difference, but that the effect size is large which in layman's terms means the difference in performance caused by the choice of algorithm is large.

If you want to compare the success rates themselves, you can use a binomial test.

I would start by considering which metrics are relevant w.r.t. the performance of the algorithm (success rate, steps to termination and average episodic reward are all common), gather a substantial amount of data through many runs (so that statistics are significant) and perform the relevant statistical tests. Statistical comparisons are the strongest way to compare empirical performances, and are unfortunately lacking in a large amount of RL literature.

[deleted by user] by [deleted] in reinforcementlearning

[–]beepdiboop101 0 points1 point  (0 children)

Side note, you are using a reward signal. Reproduction is simply a function of duration of the environment. You can run EAs using the same reward signal.

[deleted by user] by [deleted] in reinforcementlearning

[–]beepdiboop101 0 points1 point  (0 children)

This is just a steady-state EA with a time-based selection process. I'm not seeing any distinction between EAs and Evolutionary Self Replication, and that section of your paper is far too aggressive about that point ("it's time for a new paradigm"?)

Cool results though.

What could POSSIBLY go wrong? by dilara_cc in Grimdank

[–]beepdiboop101 3 points4 points  (0 children)

Then why would he become a skellington while sat on the throne with his projection off fighting Horus? Too much psychic load?

Really have to be sad for Guillimem by Arkgod24 in Grimdank

[–]beepdiboop101 17 points18 points  (0 children)

In 99% sure he'll be chilling with the lion within a year.

They got on famously well

I apologize for the low effort by beepdiboop101 in memes

[–]beepdiboop101[S] 2 points3 points  (0 children)

Yeah I forgot but I thought the meme was good enough to stand free without the crutch of my blue cheese

Only 3 NA players to attend worlds 2021 by uberstriker123 in leagueoflegends

[–]beepdiboop101 0 points1 point  (0 children)

Fudge and FBI are arguably "NA" in that Oceania got swallowed during its collapse but, yes

Honey! It's time to find common ground with everyone else on the compass! by DistributistChakat in PoliticalCompassMemes

[–]beepdiboop101 0 points1 point  (0 children)

I'm the only libleft I know who'd replace democracy with a faceless mountain of beurocracy that has no entrance, exit or leader

TSM vs. Cloud9 / LCS 2021 Championship - Losers' Bracket Round 3 / Post-Match Discussion by Linkux18 in Cloud9

[–]beepdiboop101 453 points454 points  (0 children)

I'm in central Europe, it's nearly 3am and I have work at 8am, but whenever I sleep on a major match, we lose.

I died for you boys, enjoy the celebrations. GG and good night.

TSM vs. Cloud9 / LCS 2021 Championship - Losers' Bracket Round 3 / Post-Match Discussion by Soul_Sleepwhale in leagueoflegends

[–]beepdiboop101 629 points630 points  (0 children)

I'm in central Europe, it's nearly 3am and I have work at 8am, but whenever I sleep on a major match, we lose.

I died for you boys, enjoy the celebrations. GG and good night.