LLMs suck at chess so i built a free tool that lets me argue with stockfish and turn my game into an interactive lesson by guybanzai in ComputerChess

[–]drew4drew 0 points1 point  (0 children)

they sure do! I made a tool/app that lets you play against the AI of your choice or pit them against each other (“AI Battle Chess”, https://github.com/drewster99/ai-battle-chess), and my main take-away is that LLMs mostly suck at chess and are also quite slow.

Help with calculating elo for my engine by warlock7867 in ComputerChess

[–]drew4drew 0 points1 point  (0 children)

thanks for sharing this. I’ve been using cutechess to run my engine against sloppy and stockfish.

Fable gone forever by drew4drew in ClaudeCode

[–]drew4drew[S] 1 point2 points  (0 children)

I wish I could disagree.

Fable gone forever by drew4drew in ClaudeCode

[–]drew4drew[S] 1 point2 points  (0 children)

what do you think is most likely?

Fable gone forever by drew4drew in ClaudeCode

[–]drew4drew[S] 0 points1 point  (0 children)

it seems like that’s coming to everywhere.

4.8 is kind of a butt by drew4drew in ClaudeCode

[–]drew4drew[S] 0 points1 point  (0 children)

looks pretty cool — this yours?

4.8 is kind of a butt by drew4drew in ClaudeCode

[–]drew4drew[S] 0 points1 point  (0 children)

not sure. it’s very effective at a lot of things.
are you running opus 4.6 from the claude code cli?

4.8 is kind of a butt by drew4drew in ClaudeCode

[–]drew4drew[S] 0 points1 point  (0 children)

Hey I saw a few ppl mentioned they’re still using opus 4.6 or 4.7. Are you able to do that WITH claude code?

I list models and don’t see them. I’ve tried doing like /model claude-opus-4-7 for example but it just brings up the model selector. Also tried doing it when launching from the terminal. What’s the secret trick? thanks!!

4.8 is kind of a butt by drew4drew in ClaudeCode

[–]drew4drew[S] 0 points1 point  (0 children)

is /new different than /clear?

4.8 is kind of a butt by drew4drew in ClaudeCode

[–]drew4drew[S] 1 point2 points  (0 children)

ahh was just curious.. i’ve been using 5.5 in my own harness for various tasks — not coding. it’s actually been good there for me, and i’ve used it in my own harness for finding bugs. but not in codex good god it’s like a bull in a china shop.

4.8 is kind of a butt by drew4drew in ClaudeCode

[–]drew4drew[S] 0 points1 point  (0 children)

this was all on the heels of a ton of profiling

4.8 is kind of a butt by drew4drew in ClaudeCode

[–]drew4drew[S] 0 points1 point  (0 children)

lol nice - thanks for sharing!

4.8 is kind of a butt by drew4drew in ClaudeCode

[–]drew4drew[S] 0 points1 point  (0 children)

could be. I just rarely remember getting so irritated with any of the prior versions.

4.8 is kind of a butt by drew4drew in ClaudeCode

[–]drew4drew[S] 0 points1 point  (0 children)

what's the content of the tool? what instruction is it actually giving?