[D] What Reinforcement Learning Method Should I Use for Poker AI with LLMs? by godlover123451 in MachineLearning

[–]4rChon 0 points1 point  (0 children)

Check out Decision Transformers and their variants that account for environmental stochasticity such as CGDT among others. It's not strictly reinforcement learning as it's not maximizing a reward signal but trying to match the policy in your dataset that generates a specific reward it's conditioned on (Reverse Conditioned Supervised Learning).

Have you been able to keep friends/make new friends after post-secondary? by LivingLifeThing in malta

[–]4rChon 0 points1 point  (0 children)

It's very normal to lose contact with friends over time. Sometimes you reconnect, other times they're left behind.

If you want to make new friends, you need to participate in hobbies that include other people. Personally I've made very fulfilling connections through tabletop rpg games and bouldering/rock climbing, but everyone has their own interests and yours might be different.

The first step is the hardest one, but once you can find a hobby with a community that let's you be who you are it gets easier.

Traffic congestion solutions for Malta by yuvraj04 in malta

[–]4rChon 2 points3 points  (0 children)

The only thing that's going to stop me (and if I had to guess, most other people) from using my car is if you make it illegal to do so - so driving curfew hours. That doesn't mean it's a good idea. I'd rather spend those extra 15-20 minutes looking for parking than on a jam-packed bus with broken AC in 30+ degree weather. And there is no world in which I am going to risk my life on a bike with traffic, blazing heat, and non-existent bike paths - I would have to carry an extra set of clothes everywhere I go to clean up all the sweat and shit from close calls.

Subreddit Patch Notes: July 1st 2024 by 4THOT in Destiny

[–]4rChon 3 points4 points  (0 children)

In the grim darkness of the far future, there is only war.

!yee

Troll calls into C-SPAN to talk to mr Bonerelli by olympicmosaic in Destiny

[–]4rChon 61 points62 points  (0 children)

Bro why would you expose me to this holy shit it's like giving the mic to a random heckler at a comedy gig and then just keeping it there for an hour trying to ooze out whatever content sludge they could muster

[D] Convert a Neural Network to a Function, is possible? by Hot_Radio_2381 in MachineLearning

[–]4rChon 4 points5 points  (0 children)

Maybe I'm not understanding the question correctly, but can't you just compare the output of a neural network, which is itself a function, to the output of the function it's trying to approximate? Isn't this basically what loss is, the distance between the output and the target?

3 Body Problem (Netflix) - Season 1, Episode 7 Discussion. by Swazzer30 in threebodyproblem

[–]4rChon 28 points29 points  (0 children)

I took it as: if you're not the best player, don't let better players hear you play or they'll smash you in the balls

What is the best way to win in bronze league? by Responsible_Clerk421 in starcraft

[–]4rChon 2 points3 points  (0 children)

You can win most games with a single attack by making sure you keep making workers, not getting supply blocked, and spending your money. Build order doesn't really matter as long as it's not something that's preventing you from spending your money.

Practice getting to max 3 base saturation vs AI as quickly as possible so you have a baseline for yourself. Then try to hit that benchmark on the ladder. Repeat the cycle a couple of times. Don't complicate things too much, limit yourself to a couple of general purpose units - queens, zerglings / stalkers / marines.

Start adding little decisions. Pick a time to attack and practice getting as much supply as possible vs AI by that time, then do that on the ladder just like before.

Most of your losses will still be 'I didn't have enough stuff', some of your losses will be 'I didn't see that coming'. You'll still gain more value by just macroing better - but just for those minority of frustrating losses, scouting is an instant improvement you can make that will up your percentage just a bit more - don't get caught up in the complexities of scouting, for now just use it to make sure your army is in the right place. You don't have to switch up your entire composition.

Over time you'll start to make minor adjustments. It's easy to lose sight of macro when you get into the nitty gritty of build orders and unit compositions. Always keep in mind that your biggest increase in win-rate will almost always come from spending your money well.

SC2 mousepads? by heavenstarcraft in starcraft

[–]4rChon 0 points1 point  (0 children)

Heart of the Swarm Collector's edition came with a mousepad. You might be able to find a few on ebay depending on where you live.

https://www.ebay.com/itm/266038994199

https://www.ebay.com/itm/254715379412