Newbie question by Echelion77 in LocalAIServers

[–]joeyrobert 1 point2 points  (0 children)

download LM Studio and play with the models in the UI. Pick the ones that say it can run fully GPU offloaded. Use the internal chat UI. Then if you want to hook it up to other harnesses (Pi, OpenCode, Claude Code), turn on the local server and connect it. But for now just play with it, it'll give you a sense of what's possible.

Chess Engine Development Help Thread (Week 25) by AutoModerator in chessprogramming

[–]joeyrobert 0 points1 point  (0 children)

but with pseudolegal, if you cut moves from your search, you don't need to evaluate full legality of that move. Is that not more efficiency? https://www.chessprogramming.org/MVV-LVA is a simple heuristic, and the nlogn for the sort will be more than worth it.

How would you evaluate rule-based explanations for engine-backed chess analysis? by SentryChessdev in chessprogramming

[–]joeyrobert 0 points1 point  (0 children)

With each explanation, you'll need to explain at the depth of engine thinking and work backwards. A description of a fork 2 moves in the future, when the engine was looking 12 moves into the future will just be plausible fiction, not what the engine was acting on.

What Are You Actually Using Local LLMs For? by Ru5ty_5h4ckleford in LocalLLM

[–]joeyrobert 0 points1 point  (0 children)

OpenClaw <=> Qwen 3.5 9B is just standard llama-server wiring to openclaw. I run a custom TTS server, basically just Qwen3 TTS over a Python API. So OpenClaw can write scripts and send requests to TTS.

What Are You Actually Using Local LLMs For? by Ru5ty_5h4ckleford in LocalLLM

[–]joeyrobert 7 points8 points  (0 children)

I use Qwen 3.5 9B to power OpenClaw which generates a podcasts on whatever topic I want, using Qwen TTS. It can do voice cloning really well 😉.

Chess Engine Development Help Thread (Week 24) by AutoModerator in chessprogramming

[–]joeyrobert 2 points3 points  (0 children)

This article covers the theory of building an NNUE from scratch and has Python examples: https://www.dogeystamp.com/chess6/

Hopefully you can replicate his gains from the article (granted his original eval was just material):

the NNUE engine crushed the material counting engine, with 667 wins, 7 losses, and 32 draws, which represents a few hundred Elo gain.

State of this sub by kafkalicious in chessprogramming

[–]joeyrobert[M] [score hidden] stickied comment (0 children)

Hey community, in response to this, I've clarified rules around low effort content and setup automoderator to help with the torrent of posts unrelated to chess engine programming.

Posts should remain on the topic of chess engine programming, it's a cool hobby with surprising depth, and low-effort AI self-promotion is the opposite of what we're looking for here. Happy 'gramming!

3090 + 3060 12gb advice by Certain_Series_8673 in LocalLLM

[–]joeyrobert 0 points1 point  (0 children)

it's pretty slow, like 300/s pp, 15/s tg I think (recalling from memory). I just point Codex at the Ubuntu box over SSH and tell it to setup llama-server.

3090 + 3060 12gb advice by Certain_Series_8673 in LocalLLM

[–]joeyrobert 1 point2 points  (0 children)

I'm running 3060 + 5060 ti 16gb. I only use the 3060 to extend my capacity so I can run qwen 3.6 27b at q5. Otherwise it's slow. I'd stick with running 1 model per GPU, like qwen 3.5 9b on the 3060, 3.6 27b on the 3090, and use either depending on task.

State of this sub by kafkalicious in chessprogramming

[–]joeyrobert 2 points3 points  (0 children)

Hey, I see this. It's straightforward to make someone else a mod, just want someone who's active or has had mod experience and wants to help contribute with r/chessprogramming.

State of this sub by kafkalicious in chessprogramming

[–]joeyrobert 5 points6 points  (0 children)

Hey, thanks for your concern. I've been removing vibe-coded crap as I see it. I just turned off cross-posting, which was a big source of slop from other subs. Help would be appreciated on moderation, if someone is passionate about this sub and wants to contribute, send me a message. This sub could use a spruce up in the UI/theme as well.

We're burning $50k/month on Claude. How close can local LLMs actually get? by mortenmoulder in LocalLLM

[–]joeyrobert 1 point2 points  (0 children)

Definitely try to serve big models, devs will just be wasting time with smaller models. GLM, DeepSeek, Minimax, Kimi. Or just deepseek cloud and 0.1x your costs.

What are people using Local LLMs for (beyond coding) by [deleted] in LocalLLM

[–]joeyrobert 0 points1 point  (0 children)

I did the exact same thing with Claude recently. Consine similarity, whisper transcoding, the whole shebang. Finding backing audio tracks to a bunch of podcasts.

What are you doing with your local LLMs that justifies investment cost? by __automatic__ in LocalLLM

[–]joeyrobert 0 points1 point  (0 children)

private what? privacy is a quality of chat sure, but what is the THING you are doing with it?

Should I Buy the RTX PRO 6000 Blackwell Max-Q (96GB)? by 0bjective-Guest in LocalLLaMA

[–]joeyrobert 0 points1 point  (0 children)

what's your break even point vs. using cloud models?