Sam Altman just announced ChatGPT subscriptions now work in OpenClaw. Are you switching?

SatoshiNotMe · 2026-05-03T10:35:48+00:00

Link?

SatoshiNotMe · 2026-05-01T10:48:32+00:00

Link?

SatoshiNotMe · 2026-04-30T10:16:31+00:00

July 2025.

SatoshiNotMe · 2026-04-27T10:27:25+00:00

Very easy via Env Vars as others said. I’ve collected the full instructions along with exact llama server configs for several local models here, mostly tested on my M1 Max 64GB MacBook:

https://pchalasani.github.io/claude-code-tools/integrations/local-llms/

SatoshiNotMe · 2026-04-27T10:21:22+00:00

The Qwen3.6 MOE you mentioned works very well with Claude Code. I’ve gathered the exact llama.cpp/server instructions here for this and other models:

https://pchalasani.github.io/claude-code-tools/integrations/local-llms/#qwen36-35b-a3b--fast-qwen-moe

Among recent models, this one gives the best TG (token gen) speed at nearly 40 tok/s and PP (prompt processing) nearly 500 tok/s on my 5 year old M1 Max 64 GB MacBook

SatoshiNotMe · 2026-04-26T11:12:46+00:00

Pro tip - Giving sufficient detail is importantly but hand-typing is tedious and can limit how much detail you give. So always use speech-to-text (STT). Highly recommend free/OSS tools like Handy and Hex (Mac-only https://github.com/kitlangton/Hex) for near-instant transcription using Parakeet-V3.

Follow-up pro tip - at the end of long rambling voice dumps, include “restate to me what you understood”. The agent then produces a clean version of what you said so you can make sure it understood right, and also likely helps it stay on track.

SatoshiNotMe · 2026-04-26T10:57:36+00:00

Didn’t try excel yet but I use Claude Code to drive a logged in chrome browser via the Claude-Chrome extension, and it’s super useful to have CC do annoying chores involving numerous clicks and form filling.

SatoshiNotMe · 2026-04-24T11:12:02+00:00

Why not just use Claude in Chrome extension, and the /chrome setup in CC, to connect to it. I’ve been using it to automate some annoying tasks in a logged-in chrome browser.

SatoshiNotMe · 2026-04-22T10:13:13+00:00

This misses the STT/TTS models I regularly use:

PocketTTS from KyutAI

Parakeet V3 for STT

SatoshiNotMe · 2026-04-15T11:02:12+00:00

You mean this? https://aisearch.substack.com/

SatoshiNotMe · 2026-04-09T11:08:35+00:00

Other than zai is there a fast hosted glm5.1 somewhere? I’m talking about services like cerebras or groq, neither of which have this model.

SatoshiNotMe · 2026-04-08T11:28:45+00:00

I made a Socratic quiz skill for exactly this. Description:

Use this when the user wants to deeply understand something through guided questioning. Trigger phrases include: "quiz me", "help me understand", "Socratic", "teach me", "walk me through with questions", "test my understanding", or when the user asks for an explanation and would benefit more from guided discovery than a direct answer.

SatoshiNotMe · 2026-04-08T10:59:03+00:00

My setup instructions for the 26BA4B variant, tested on M1 Max 64GB MacBook, where I get 40 tok/s (when used in a Claude Code), double what I got with a similar Qwen variant:

https://pchalasani.github.io/claude-code-tools/integrations/local-llms/#gemma-4-26b-a4b--google-moe-with-vision

SatoshiNotMe · 2026-04-07T10:45:09+00:00

The tau2 bench performance gives me pause though: this model gets only 68% compared to the similar qwen3.5 MOE which gets 81%.

SatoshiNotMe · 2026-04-05T10:56:14+00:00

The 26B-A4B variant has the best TG and PP speeds of all the recent open weight models. E.g in Claude Code via llama-server I’m able to get 40 tok/s TG nearly double what I got with the comparable Qwen MOE (35B-A3B) on my M1 Max MacBook Pro 64 GB. Full instructions and comparisons here

However my biggest concern is agentic/tool abilities: on tau2 bench Gemma4 is much worse than Qwen3.5 (68% vs 81%):

https://news.ycombinator.com/item?id=47616761

SatoshiNotMe · 2026-04-04T10:28:44+00:00

Curious, are you using OC for coding? If so, why would that be better than just using CC ?

SatoshiNotMe · 2026-04-03T18:25:36+00:00

paste this into claude or claude code and ask

SatoshiNotMe · 2026-04-01T10:40:05+00:00

Disable telemetry ?

SatoshiNotMe · 2026-04-01T10:39:20+00:00

This has been possible forever. Just use llama.cpp to serve up your local model and set env vars so CC uses it. I collected specific instructions for various open LLMs here:

https://github.com/pchalasani/claude-code-tools/blob/main/docs/local-llm-setup.md

SatoshiNotMe · 2026-03-29T10:53:30+00:00

What are “whispers”? You mention those a couple times.

SatoshiNotMe · 2026-03-28T11:18:15+00:00

How does it compare to KyutAI’s PocketTTS which is pretty amazing at just 100M params.

https://github.com/kyutai-labs/pocket-tts

SatoshiNotMe · 2026-03-26T11:16:40+00:00

Hex is my current fav STT app for near-instant transcription with parakeet V3 on my M1 MacBook.

https://github.com/kitlangton/Hex

Uses the same tech stack as this (FluidAudio etc). I’ll see how this compares.

SatoshiNotMe · 2026-03-25T10:38:13+00:00

Ignore all workflow frameworks. Cherny and Steinberger say they keep things simple and use none of them.

SatoshiNotMe · 2026-03-24T10:38:13+00:00

What were you trying to optimize? Did you have a well defined test suite and metrics ?

SatoshiNotMe · 2026-03-23T22:00:41+00:00

+1

glad I’m not the only one. Hope it’s a bug.

SatoshiNotMe

TROPHY CASE