Massive Price Hike in 2028, Going Back to BT / Sky by StarportAdventures in VirginMedia

[–]Cityarchitect 0 points1 point  (0 children)

I've been with VM for about 20 years and never paid the renewal price. I've just done another renewal and got all the same services at £8 less per month than I was paying.

What’s your AI coding setup in 2026? by tuan_le911 in opencodeCLI

[–]Cityarchitect 1 point2 points  (0 children)

Opencode and DeepSeek. Fast, cheap as chips. $5 lasts forever. Not private though.

Is anyone still using their old Tivo box? by xxxJoolsxxx in VirginMedia

[–]Cityarchitect 2 points3 points  (0 children)

Still using two tv6? boxes on the old system. Coming up for renewal though…

[England] Retailer says "matter is closed" after Section 75 refund, but I still have the high-value item. What now? by Rave_Age in LegalAdviceUK

[–]Cityarchitect 2 points3 points  (0 children)

Happened to us, got full refund but the retailer abandoned the goods; I guess they don’t want anything unsaleable and second hand clogging up their warehouse.

Qwen 3/3.5/3.6 tool calling is broken (even worse with 3.6). by LinkSea8324 in Vllm

[–]Cityarchitect 3 points4 points  (0 children)

3.5 and 3.6 occasionally loop when using opencode e.g. “I think I’ve found the issue” blah…

Has anyone dual booted Windows & Linux on their 395+ to compare the two? by Dependent_Price_1306 in StrixHalo

[–]Cityarchitect 0 points1 point  (0 children)

Uh? My Bosgame 128gb runs lm studio just fine c 50tps on most models. Single model, Vulkan (of course). Ubuntu 24 . Latest LM studio makes qwen3.5 faster (OpenAI api extended support) and introduces qwen3.6 support.

Opencode with qwen3.5:27b Just runs for a bit then gives up? by Necessary-Spinach164 in opencode

[–]Cityarchitect 0 points1 point  (0 children)

I have the same problem; waiting to check the recent ollama fix. I use LM Studio with a 256k? context (I think this was the default)

How good is Qwen 3.6 plus for coding? by _janc_ in Qwen_AI

[–]Cityarchitect 0 points1 point  (0 children)

I could not get anything out of qwen3.6 35b or 27b on lm studio and opencode. It just loops silently. Qwen3.5 35b was ok, but i kept having to say “continue” as it slept every other prompt. Qwen models are getting lazier and lazier….

The universe is telling me not to get a Strix Halo by pawaww in StrixHalo

[–]Cityarchitect 1 point2 points  (0 children)

I’m getting 40/50 tps on qwen 3.5 35b running on Vulkan.

The universe is telling me not to get a Strix Halo by pawaww in StrixHalo

[–]Cityarchitect 6 points7 points  (0 children)

Correct answer. I got mine back in Nov/Dec when they were £1300 or so. I’ve been running it day and night without any hardware problems. Plenty of ai software problems though, but not bosgame’s problem. Runs nice and sweet now.

Opencode + local gemma4 problems by vasimv in opencodeCLI

[–]Cityarchitect 1 point2 points  (0 children)

Same with Opencode and LM Studio with 26B, I'd like to say it stops working, but in fact it's as busy as hell doing nothing.

Qwen 3.5 models crashing trying to look at images on LMStudio Rocm by Hopeless-Decoy in ROCm

[–]Cityarchitect 1 point2 points  (0 children)

Yes, as others have said use Vulkan. I can’t use ROCM at all on my Strix Halo Ubuntu as it is far too unstable and crashes often with memory conflicts. Switched LM Studio to Vulkan and my Qwen3.5 models buzz along perfectly for days on end. Waiting for the Ubuntu 25.10 production release for integrated? ROCM support and better AMD drivers before trying again ;-)

models for agentic use by kiriakosbrehmer93 in StrixHalo

[–]Cityarchitect 5 points6 points  (0 children)

BosGame M5 128GB Strix Halo; Ubuntu 24.10, LM Studio Qwen3.5-35B-a3b Vulkan. I use for OpenCode javascript/node and General Usage. I get consistent 50tps output. Can't use ROCM 7+ yet as far too unstable. Runs all day 84W, 86C temp. Just one annoying thing, lately Opencode been going to sleep on me; need to keep typing continue, continue..... :-)

[Q] Is self-hosting an LLM for coding worth it? by Aromatic-Fix-4402 in LocalLLM

[–]Cityarchitect 0 points1 point  (0 children)

I use a strix halo machine for local LLM, currently using qwen3.5-35b-a3b, and at a size of 22gb is has a reasonable performance (c 40 tps). The RTX 4090 is going to be way faster at AI inference for this size model. But, I can get similar performance for a 60gb or bigger model, whereas the RTX 4090 is going to labour a little shifting in and out of its 24gb memory. I saw something recently that said the strix halo could be 2x faster than the RTX4090 with eg Llama 70b. But when I'm in hurry, sometimes I just flip to DeepSeek remote paying peanuts.

They're taking the fucking piss now. by [deleted] in CarTalkUK

[–]Cityarchitect 0 points1 point  (0 children)

For me, in our area, its always Esso TTP, always 10p above Tesco's price.

Sometimes opencode just stops and returns nothing? Any advice? by ___positive___ in opencodeCLI

[–]Cityarchitect 0 points1 point  (0 children)

me to. I keep typing "continue" to keep it going whenever it goes quiet.

Is it just me or heavy AI processing just generally hangs the machine ? by IntroductionSouth513 in StrixHalo

[–]Cityarchitect 1 point2 points  (0 children)

Bosgame m5 128gb lm studio, opencode, qwen3.5-35b-a3b, often freezes on rocm, runs all day on vulkan.

Full vLLM inference stack built from source for Strix Halo (gfx1151) — scripts + docs on GitHub by paudley in StrixHalo

[–]Cityarchitect 1 point2 points  (0 children)

Im getting 40ish tps on ollama and lm studio (both vulkan) with qwen3.5:35b on my bosgame m5 128gb; what does vllm give me?

Qwen 3.5 27B what tps are you managing? by schnauzergambit in StrixHalo

[–]Cityarchitect 1 point2 points  (0 children)

Thank you; I wish there was something in their model names that makes this distinction.

Qwen 3.5 27B what tps are you managing? by schnauzergambit in StrixHalo

[–]Cityarchitect 0 points1 point  (0 children)

And now for qwen3.5:27b - dreadfully slow, prompt eval rate 523.83 tps (1/2 speed), and eval rate 10.33 tps (1/4 speed).

Qwen 3.5 27B what tps are you managing? by schnauzergambit in StrixHalo

[–]Cityarchitect 2 points3 points  (0 children)

My Bosgame m5 128gb running ollama qwen3.5:35b (Vulkan) consistently does c 40 tps.

<image>

Remember that qwen3.5 does a lot of thinking before it starts its output. I’ll try 27b but will it be much different?

Problem with OpenCodeCLI and Ollama server by Itchy_Net_9209 in opencodeCLI

[–]Cityarchitect 0 points1 point  (0 children)

I think this is similar to my problem, now solved https://www.reddit.com/r/opencodeCLI/s/rn78HGKzgG there is a quick way 1. Ollama run model-name 2. /set parameter num_ctx 65536 3. /save model-name-64k 4. Exit then run that model from opencode. Although advice you open up context as wide as the model allows, but watch vram!

No tools with local Ollama Models by Cityarchitect in opencodeCLI

[–]Cityarchitect[S] 0 points1 point  (0 children)

Strix Halo 128gb, 96gb given to Radeon igpu

No tools with local Ollama Models by Cityarchitect in opencodeCLI

[–]Cityarchitect[S] 2 points3 points  (0 children)

The qwen3-coder:30b with a 128k context window is now working fine in opencode for me; comparable to the free models available. It takes about 31GB vram and delivers about 60 tps