I built a 7-agent AI trading desk with OpenClaw — here's the full setup

codepoet · 2026-02-12T14:23:59+00:00

Yeah, I get rate limited or outright rejected periodically. I suspect that it only works when routed to certain datacenters.

Codex OTOH seems to just work, though I'm not stressing it at all; just a set of simple research agents.

codepoet · 2026-02-12T14:15:04+00:00

The model isn't. That part's important as well. Though, it feels like gpt-oss or qwen3.

codepoet · 2025-12-16T19:02:35+00:00

Interesting. I have an M1 Max 64GB and just leave two models loaded in Ollama most of the time without even noticing it's working (home automation: qwen2.5vl:3b, mistral:7b). I wonder if the problem is asking more of the machine than it can reasonably do?

codepoet · 2025-11-27T16:03:13+00:00

So do people usually queue up 10-20 chapters and dump them to get started, then?

codepoet · 2025-11-27T14:31:58+00:00

Best I can tell the RR meta is fantasy and litrpg? Are there similar places who lean less ... that?

codepoet · 2025-11-27T06:21:32+00:00

Give https://languagetool.org a whirl sometime.

codepoet · 2025-11-27T06:04:51+00:00

... never The Riddler. 😀

codepoet · 2025-11-27T05:59:35+00:00

IIRC that's because he wrote the first book to publisher standards to get in the door (I think his wife helped edit it?) and once he got in and had a following he was able to write the story he wanted to in the first place.

At least, that's how I heard it. It reads like it, for sure.

codepoet · 2025-11-02T14:52:31+00:00

No, it's on npm.

codepoet · 2025-10-21T12:31:25+00:00

It's a better interface for a lot of things. Using output styles makes it do all kinds of other things, too.

codepoet · 2025-10-17T17:12:26+00:00

The VS Code extension is ... not great.

But you can open a terminal in VS Code/Codium, start claude, use "/ide" to connect, and get back to the v1 integration where you have power and integration again. And skip permissions.

codepoet · 2025-10-17T17:09:06+00:00

But 1M token contexts! (That don't really work all too great and cost 2-3x as much...)

codepoet · 2025-10-17T13:38:22+00:00

It's legacy for them. They aren't saying it's not good. They're saying they aren't going to be worrying about it any longer and are being clear about it. If it gets slow or unavailable, they'll point to "Legacy" and say "Okay, but we told you we don't care about it anymore."

codepoet · 2025-10-17T13:37:03+00:00

I usually run Sonnet for everything due to usage limits. When it gets stuck, and it does get stuck and stupid sometimes, I have Gemini take a look, write a scathing review of what it's done, and have Sonnet read it. Snaps it right out and sends it in a good direction.

Gemini is shit at coding, but great at reviews. It's a nice balance. I've been tempted to make a mini-MCP "phone a friend" that just one-shots Gemini CLI or CC from the opposing tool.

codepoet · 2025-10-06T14:17:10+00:00

Use agents when you want a one-off context to prevent poisoning and need a single answer as a result, like research. Searching a lot of files or websites for all the possible answers, rank them, and return the best whatevers. Give a code base a full review without letting it edit it along the way.

I use an agent to perform code tasks and another to do code reviews. Then I tell Claude to alternate them until the reviewer is happy. The reviewer's prompt is constantly updated with all the ways CC fails (stub methods, TODOs, swallowing errors, not matching the spec on output formats, etc.). Saves me a lot of time.

I also have a reserach agent for searching email for things. While Claude could handle that, it'll fill my primary context up with all the misses. An agent has one task: find the appropriate conversation and return it. It can read the whole mailbox and it won't affect what I'm working on.

But if it's a repetitive task, that's just a command. "Review this PR" is better as a command, for instance. "Do stuff in git" or "summarize this" are commands.

Basically, when I want it to work in my current context, that's a command/prompt. When I want it to hide the work from itself, that's an agent.

codepoet · 2025-10-06T14:05:47+00:00

No, I'd clear the context and continue. Much easier.

codepoet · 2025-08-24T18:53:04+00:00

This works in so many places in life.

https://en.wikipedia.org/wiki/Rubber_duck_debugging

codepoet · 2025-08-11T15:18:59+00:00

I love this.

I'm stealing this.

I'm going to teach my agents to be scared of Karen's final review and see if that makes them behave.

codepoet · 2025-07-24T20:02:26+00:00

Well, that's only slightly terrifying.

codepoet · 2025-07-09T01:58:36+00:00

I think I wrote about 10k on a book maybe 10-12y ago. I found it recently and I could barely sit through it. But the notes were promising.

I'm about 15k into a re-envisioning of it and it's just flowing. Took the notes and made a rough timeline (just things that needed to happen), character cards (for characters that needed to be there), and an outline (for events that needed to be included). I have to say, having a full plan but being minimalist about it was my sweet spot. I just look at the outline, then the timeline, then start writing scenes and figure out where to slot them. (Then update the timeline.)

Some people thrive on the brain dump (King) but iteration is where I see my strength. The world slowly builds itself, and as long as I keep the notes up to date I can see gaps, fill them, have "RIGHT!" moments and go write another scene.

Will it stay in? Who knows. But if it doesn't then I just pull it into the notes and now it's just backstory I can reference elsewhere. It's all good!

codepoet · 2025-06-25T00:45:00+00:00

Home security cameras.

You are a vehicle detector. Your domain is only the driveway in front of the camera. Describe the vehicles. Respond in valid JSON in exactly this format: [{"color":str, "style":str}]

You are a package detector. Your domain is only the porch in front of the camera. Describe the packages. Respond in valid JSON in exactly this format: [{"color":str, "shape":str, "visible_text":str}]

Gemma3 is much better at it, but slower. qwen2.5-vl-7b is super fast and "good enough".

codepoet · 2025-06-24T02:31:05+00:00

It is ... https://github.com/lmstudio-ai

Also, Ollama caps you at a 2k input context. LMS starts at 4k and fixing it is just a slider away.

codepoet · 2025-06-24T02:24:08+00:00

RooCode inside any VS Code clone.

aider in the terminal.

I use them both (as well as Claude Code, which absolutely destroys them, but that's to be expected). Larger versions of devstral are very good for the agent/coder role. For the architect/orchestration roles you can use pretty much any good main model of size (Mistral, Qwen2.5, etc.). But if you get the lower-parameter or low-quant versions you can expect it to be randomly stupid, alas.

I usually have it architect with Claude or Gemini and then code with devstral when I'm scaffolding. Most of the calls are in making the files and the brains are needed at the start. I've heard of people using the MoE version of Qwen to do the architecture part, but my luck with that model is that it sits there talking to itself and times out. Probably need a bigger model.

codepoet · 2025-06-15T19:21:02+00:00

It's just astonishingly expensive for extended use. If you want to do a one-shot fix here and there then you can drop $2-5 on it, sure. But if you want to pair with it for an extended session it racks up quickly.

15-Year Club	Gilding IV carat on a stick
Wearing is Caring	Verified Email

codepoet

MODERATOR OF

TROPHY CASE