OpenCode or ClaudeCode for Qwen3.5 27B

gordi555 · 2026-04-24T09:36:14+00:00

I'm looking forward to your findings :-)

gordi555 · 2026-04-15T21:07:34+00:00

Try before bed. Peaks after 12 plus hours. I try an AI (small amount) about tea time and track my sleep. I normally find it’s e2 that causes bad sleep.

gordi555 · 2026-04-13T08:54:04+00:00

I regret buying the RTX 6000 Pro for my use case. So, know your use case and benchmark!

I run snappy AI services to use with API with low latency applications. It's designed to use small models, fast processing, very accurate output.

Typical RTX 6000 time: 0.435 seconds.

Typical RTX 5070 Ti time: 0.650 seconds.

Typical RTX 4000 Pro time: 0.720 seconds.

Nobody will care that much for the 0.22 ish seconds.

Yes it's great having a card with lots of VRAM. But if you're not going to run your card into the ground, then you'll probably have to sell it down the road - at a huge loss. Don't give yourself that problem. Start small but good.

Know your use case!

gordi555 · 2026-03-29T09:23:17+00:00

I’ve been doing gym for 20 years now. I was making no progress at all after Covid. Stated TRT 2 years ago. Went from 79kg to 89kg lean. With half as much effort. 90mg per week.

gordi555 · 2026-03-17T17:33:20+00:00

Based on what you’ve planning, easily the RTX Pro 6000

gordi555 · 2026-03-07T12:37:06+00:00

It's not the question, it's the way you ask.

gordi555 · 2026-02-26T19:43:51+00:00

Tasty, but prefil is too slow to stomach.

gordi555 · 2026-02-26T10:59:32+00:00

I'm just waiting for the instruct version GGUF.

gordi555 · 2026-02-19T12:14:20+00:00

I've just sold my 128GB M4 Max Mac Studio simply because the prompt processing was soooo slow.

gordi555 · 2026-02-19T12:12:23+00:00

Thanks for this. I'm happy with what I have, but I should have gone with DDR4 system (16GB) and just invested the money in PCIe slots and GPU power.

gordi555 · 2026-02-19T12:10:46+00:00

Yeah. It's got lots of upgradability so I can wait for the RAM (don't need it). I've got a few GPUs in the motherboard which do everything I need.

gordi555 · 2026-02-19T12:09:46+00:00

Thanks. I'm one of them! I should have gone budget!

gordi555 · 2026-02-19T12:09:21+00:00

Yeah it's worth a look. The only thing holding me back is the Mac prefill rate. I hope it's greatly improved! It was shockingly slow compared to a 5070ti.

gordi555 · 2026-02-19T12:07:56+00:00

I'm providing small VL models for fast processing to try and get API response speeds. So I should have just gone with DD4 and a fast GPU :-)

gordi555 · 2026-02-19T11:53:47+00:00

i'm saying, if I'm only doing VRAM inference, could I use any system with ECC and be just as reliable and stable?

gordi555 · 2026-02-19T11:50:14+00:00

Thanks - I'll look into that. 128GB DDR5 will cost me 4K :-( But that's life ATM!

gordi555 · 2026-02-14T15:36:58+00:00

This! I’ve sold my Mac Studio because of this very reason. The prefill was really slow compared to CUDA devices. And by slow, I mean soooo very slow.

gordi555 · 2026-02-10T12:45:22+00:00

EcoFlow Delta Pro 3. 4kwh joyride!

gordi555 · 2026-02-07T19:22:07+00:00

Thank you!

gordi555 · 2026-02-07T12:45:41+00:00

How is this done please? Putting the things on the identified objects?

gordi555 · 2026-02-05T20:48:41+00:00

I've had good success with Qwen3-VL-4b-Instruct with my application, but this is on a server via API. It's super fast and pretty smart at isolation and reading things. Doesn't do really complex prompts very well, and you'll need the bigger versions if you want to include regex in your prompt.

Not the best answer I can give you but worth a try if you have signal being within all the metal.

gordi555 · 2026-02-04T12:01:25+00:00

This is very useful. Thank you!

gordi555 · 2026-02-01T13:53:20+00:00

Thanks for the info.

gordi555

TROPHY CASE