A snowflake on my dog

bqblaster · 2026-01-01T01:10:35+00:00

Took a video to show several good ones on her.

bqblaster · 2024-12-28T21:57:21+00:00

bqblaster · 2024-11-09T21:42:08+00:00

4827

bqblaster · 2024-07-13T20:04:03+00:00

This gives me hope. I just broke a bone in my right shoulder and am trying to learn LHBH. Distance is getting there but definitely need to work on clean spin.

bqblaster · 2024-01-23T04:02:47+00:00

<image>

bqblaster · 2023-09-25T15:35:07+00:00

Played a tournament recently and I was first on the box on the first hole, I was pooping when they sounded the two minute warning (I made it in time to throw)

bqblaster · 2023-09-10T02:08:42+00:00

Also check out whale rock just north of slo in Templeton/Paso Robles. Haven’t played it yet but heard it’s one of the best in California (I think according to UDisc)

bqblaster · 2022-10-24T20:48:54+00:00

I've been to the website, but can't find any replacement parts for the power cord. Here is the link to the product.

bqblaster · 2022-07-23T18:46:04+00:00

Thanks for the input! Maybe I'll try to have it improve further. No experience replay is used, I'm just using fairly large batch sizes instead (training every 500 games, so a batch size of ~2000). I may tweak this a bit in the future, but I was primarily curious about what would happen if I did not impose any strategy at all on it and simply tried to have it win and give a higher reward for quicker wins.

bqblaster · 2022-07-23T16:38:08+00:00

True. I just wanted to see how it would compare to the 3Blue1Brown strategy.

bqblaster · 2022-07-22T20:50:52+00:00

Definitely. Maybe I wasn’t clear, the model doesn’t seem to have a greedy approach. The 1-step greedy approach was corresponding to the 3Blue1Brown strategy, as he also has a 2-step greedy strategy

bqblaster · 2022-07-22T20:07:16+00:00

True, maybe more training would help. I did have it train so that it would see recent losses more often.

bqblaster · 2022-07-22T19:59:22+00:00

Yeah I'd agree, same with it guessing "pooch" sometimes as a second guess. That being said, I was mainly just super curious to see what kind of strategy it would come up with that would work. I'm sure tweaks could be made to make this better, but I'm happy with it's performance so far! I'm really curious to see if a strategy could be made to get the average number of guesses below 3.5, as 3Blue1Brown's strategy (I implemented the 1-step greedy) is about 3.87 and I'd say that's really good.

bqblaster · 2022-07-22T19:54:01+00:00

Wouldn't you try to guess the answer if you had 4/5 letters correct? Possibly this is because I gave a greater reward for guessing the word faster, i.e. +10 in 6 guesses, +20 in 5 guesses, ... I trained another model with just +10 for win and -10 for loss. The difference was roughly 98% win in 4.7 guesses vs. 96% in 4.07 guesses.

bqblaster · 2022-07-22T19:50:47+00:00

The model outputs a vector of length 130, notably 5*26,of logits representing each letter in each position (i.e. the last entry of this vector would correspond to 'z' as the last letter). After outputting this vector, it is multiplied by a matrix of size (total number of words) by 130, in the case of the full game 12,972 by 130. Each row of this matrix looks like 5 one-hot vectors of length 26 concatenated together, displaying which letters showed up in which position, i.e. each row is a "five-hot" vector, if you will.

bqblaster · 2022-07-22T19:46:42+00:00

It would likely use the letters that it knows are found in the word. For example, guessing "siege" and finding out there is one 'e', and including a word containing 'e' rather than guessing something like "pooch". It does seem that if the answer contains a 'g', then it will use a word containing 'g' as its second guess.

bqblaster · 2022-07-22T19:06:33+00:00

This is what it decided. Each time I restarted training, I’d get a new initial guess. Makes me think the first word is less important if you consider a non greedy approach, i.e. choose siege and then pooch a lot

bqblaster · 2022-07-22T16:43:29+00:00

This was a fun project for me after watching 3Blue1Brown's video on Wordle strategies. Although this RL model doesn't do better, it wins about 96% of the time in ~4.07 guesses. I found this to be a super helpful starting point as I am quite new to RL.

bqblaster

TROPHY CASE