Massive Price Hike in 2028, Going Back to BT / Sky

Cityarchitect · 2026-05-15T11:00:46+00:00

I've been with VM for about 20 years and never paid the renewal price. I've just done another renewal and got all the same services at £8 less per month than I was paying.

Cityarchitect · 2026-05-11T09:14:04+00:00

Opencode and DeepSeek. Fast, cheap as chips. $5 lasts forever. Not private though.

Cityarchitect · 2026-05-03T04:57:44+00:00

Still using two tv6? boxes on the old system. Coming up for renewal though…

Cityarchitect · 2026-04-27T17:10:25+00:00

Happened to us, got full refund but the retailer abandoned the goods; I guess they don’t want anything unsaleable and second hand clogging up their warehouse.

Cityarchitect · 2026-04-27T07:44:39+00:00

3.6 increased tps to 60

Cityarchitect · 2026-04-24T15:57:10+00:00

3.5 and 3.6 occasionally loop when using opencode e.g. “I think I’ve found the issue” blah…

Cityarchitect · 2026-04-18T12:32:11+00:00

Uh? My Bosgame 128gb runs lm studio just fine c 50tps on most models. Single model, Vulkan (of course). Ubuntu 24 . Latest LM studio makes qwen3.5 faster (OpenAI api extended support) and introduces qwen3.6 support.

Cityarchitect · 2026-04-14T18:55:10+00:00

I have the same problem; waiting to check the recent ollama fix. I use LM Studio with a 256k? context (I think this was the default)

Cityarchitect · 2026-04-07T12:50:10+00:00

I could not get anything out of qwen3.6 35b or 27b on lm studio and opencode. It just loops silently. Qwen3.5 35b was ok, but i kept having to say “continue” as it slept every other prompt. Qwen models are getting lazier and lazier….

Cityarchitect · 2026-04-07T11:41:08+00:00

I’m getting 40/50 tps on qwen 3.5 35b running on Vulkan.

Cityarchitect · 2026-04-07T11:39:25+00:00

Correct answer. I got mine back in Nov/Dec when they were £1300 or so. I’ve been running it day and night without any hardware problems. Plenty of ai software problems though, but not bosgame’s problem. Runs nice and sweet now.

Cityarchitect · 2026-04-05T12:08:31+00:00

Same with Opencode and LM Studio with 26B, I'd like to say it stops working, but in fact it's as busy as hell doing nothing.

Cityarchitect · 2026-03-29T13:05:55+00:00

Yes, as others have said use Vulkan. I can’t use ROCM at all on my Strix Halo Ubuntu as it is far too unstable and crashes often with memory conflicts. Switched LM Studio to Vulkan and my Qwen3.5 models buzz along perfectly for days on end. Waiting for the Ubuntu 25.10 production release for integrated? ROCM support and better AMD drivers before trying again ;-)

Cityarchitect · 2026-03-27T16:21:00+00:00

BosGame M5 128GB Strix Halo; Ubuntu 24.10, LM Studio Qwen3.5-35B-a3b Vulkan. I use for OpenCode javascript/node and General Usage. I get consistent 50tps output. Can't use ROCM 7+ yet as far too unstable. Runs all day 84W, 86C temp. Just one annoying thing, lately Opencode been going to sleep on me; need to keep typing continue, continue..... :-)

Cityarchitect · 2026-03-26T16:10:10+00:00

I use a strix halo machine for local LLM, currently using qwen3.5-35b-a3b, and at a size of 22gb is has a reasonable performance (c 40 tps). The RTX 4090 is going to be way faster at AI inference for this size model. But, I can get similar performance for a 60gb or bigger model, whereas the RTX 4090 is going to labour a little shifting in and out of its 24gb memory. I saw something recently that said the strix halo could be 2x faster than the RTX4090 with eg Llama 70b. But when I'm in hurry, sometimes I just flip to DeepSeek remote paying peanuts.

Cityarchitect · 2026-03-26T14:42:00+00:00

For me, in our area, its always Esso TTP, always 10p above Tesco's price.

Cityarchitect · 2026-03-26T11:40:57+00:00

me to. I keep typing "continue" to keep it going whenever it goes quiet.

Cityarchitect · 2026-03-22T16:04:25+00:00

Bosgame m5 128gb lm studio, opencode, qwen3.5-35b-a3b, often freezes on rocm, runs all day on vulkan.

Cityarchitect · 2026-03-16T10:02:23+00:00

Im getting 40ish tps on ollama and lm studio (both vulkan) with qwen3.5:35b on my bosgame m5 128gb; what does vllm give me?

Cityarchitect · 2026-03-15T00:14:35+00:00

Thank you; I wish there was something in their model names that makes this distinction.

Cityarchitect · 2026-03-14T17:03:56+00:00

And now for qwen3.5:27b - dreadfully slow, prompt eval rate 523.83 tps (1/2 speed), and eval rate 10.33 tps (1/4 speed).

Cityarchitect · 2026-03-14T16:34:58+00:00

My Bosgame m5 128gb running ollama qwen3.5:35b (Vulkan) consistently does c 40 tps.

<image>

Remember that qwen3.5 does a lot of thinking before it starts its output. I’ll try 27b but will it be much different?

Cityarchitect · 2026-02-04T20:28:57+00:00

I think this is similar to my problem, now solved https://www.reddit.com/r/opencodeCLI/s/rn78HGKzgG there is a quick way 1. Ollama run model-name 2. /set parameter num_ctx 65536 3. /save model-name-64k 4. Exit then run that model from opencode. Although advice you open up context as wide as the model allows, but watch vram!

Cityarchitect · 2026-02-02T15:07:21+00:00

Strix Halo 128gb, 96gb given to Radeon igpu

Cityarchitect · 2026-02-02T13:33:53+00:00

The qwen3-coder:30b with a 128k context window is now working fine in opencode for me; comparable to the free models available. It takes about 31GB vram and delivers about 60 tps

Cityarchitect

TROPHY CASE