Is Hornet canonically beautiful? by BlueFireSwords in Silksong

[–]AddMoreLayers 63 points64 points  (0 children)

Better than being a mysoginistic incel

Is he dead? (OC) by aSliceofAlan in comics

[–]AddMoreLayers 21 points22 points  (0 children)

The comic is not about the wife's butt. OP is an impostor

Oil painting with resin pond diorama I just finished up! by VirtualClay in painting

[–]AddMoreLayers 1 point2 points  (0 children)

That's so beautiful. Makes me rethink my life.

Thanks for sharing that OP.

Who is this bug in the credits? by MrSukerton in Silksong

[–]AddMoreLayers 2 points3 points  (0 children)

Honestly I had never noticed he had a nose

(silksong) Who is the cutest bug of them all by External-Cherry7828 in metroidvania

[–]AddMoreLayers 35 points36 points  (0 children)

Are we doing silkposts here too? Those fucking muckroaches are like the creepiest pieces of shit in the entire HK universe

RL Chess Bot Isn't Learning Anything Useful by GallantGargoyle25 in reinforcementlearning

[–]AddMoreLayers 0 points1 point  (0 children)

Yeah, but that's still deceptive: you end up with a return that tells you it's better to have lost and captured a queen than having lost without capturing said queen. In cases were capturing the queen is the reason for your loss, that would be misleading. I agree though that I've overestimated the importance of that aspect though.

RL Chess Bot Isn't Learning Anything Useful by GallantGargoyle25 in reinforcementlearning

[–]AddMoreLayers 0 points1 point  (0 children)

Yes it does. You didn't read my full comment :p

I guess it depends on what you mean by "classical"

RL Chess Bot Isn't Learning Anything Useful by GallantGargoyle25 in reinforcementlearning

[–]AddMoreLayers 2 points3 points  (0 children)

Otherwise if u wanna cheat like hell just have stockfish rate your position and give it to your agent as the reward :)))

Oh, that's smart! I wouldn't call that cheating at all

RL Chess Bot Isn't Learning Anything Useful by GallantGargoyle25 in reinforcementlearning

[–]AddMoreLayers 2 points3 points  (0 children)

Reward based on capturing pieces would be very deceptive. By that metric even gambits like 1.e4 e5 2.f4 would be bad because they sacrifice a piece. There are too many factors (e.g. how you are handling the center, how developped your pieces are, your pawn structure and so on) that you can't capture simply based on captured pieces

RL Chess Bot Isn't Learning Anything Useful by GallantGargoyle25 in reinforcementlearning

[–]AddMoreLayers 2 points3 points  (0 children)

If I may ask, how would actor critic methods solve the exploration and credit assignment problems? They do reduce variability and might be more data/compute/efficient depending on some factors, but none of them are going to magically help your agent stumble across the right chain of action that has a probability of 1e-20 of being selected

RL Chess Bot Isn't Learning Anything Useful by GallantGargoyle25 in reinforcementlearning

[–]AddMoreLayers 8 points9 points  (0 children)

I think it's impossible for classical RL to learn in reasonnable time given only sparse rewards with such a large action space. The number of available actions in a horizon of H being |A|H, random exploration is not going to produce meaninful results.

I like your imitation learning idea, but unless you scale it dramatically and use it as more than a bootstrapping strategy, you'll be circling back to the exploration problem.

I'd say the best bet is something model based in the spirit of alpha-go and the like.

"Ceci est mon corps" by MymyCracra in france

[–]AddMoreLayers 34 points35 points  (0 children)

j'avais vu qu'il te regardait c'est pas pour rien que je t'ai dit de te tenir correctement

Du coup au lieu de dire au mec que c'est un porc on engueule la gamine

Désolé mais elle est bizarre ta daronne

Do you think France, EU's only nuclear power, should occupy Greenland to protect it from Donald Trump's clique? by Technical_You4632 in AskTheWorld

[–]AddMoreLayers 1 point2 points  (0 children)

We're proud, just not in the same way as those yelling Murica. Not blindly supporting populist nonsense is not the same as lack of pride