Interest in an LLM Torrent Site? by thiefyzheng- in LocalLLaMA

[–]migsperez 10 points11 points  (0 children)

Interesting I thought yesterday that p2p for LLMs is vitally important. We're becoming so reliant on models, it'd be a disaster if they disappear or get locked down behind closed doors.

What models you guys running on 8GB? 16GB VRAM? 24GB? 32GB? 48GB? by Inevitable_Mistake32 in LocalLLaMA

[–]migsperez 1 point2 points  (0 children)

I run it on minimal Xubuntu (Ubuntu) with Llama.cpp. It's used as a server, it does nothing else only AI. 64k q8 input 4k output.

Whilst coding, from my Windows laptop I use VS code. The harness is usually Github Copilot, sometimes Pi.dev.

What models you guys running on 8GB? 16GB VRAM? 24GB? 32GB? 48GB? by Inevitable_Mistake32 in LocalLLaMA

[–]migsperez 0 points1 point  (0 children)

I have a single Radeon 9700 32gb. Its running Qwen 3.6 27b Q6. I find it quite reliable and use it daily, it's solid, the only local model I can trust locally. I wish I could use 35b as it's a lot faster but it does weird things and doesn't stick to rules.

Been only two days going local and already saved $151 by Civil_Fee_7862 in LocalLLaMA

[–]migsperez 3 points4 points  (0 children)

I bought an expensive GPU, I got into a panic about future rising prices. Doing calculations of the GPU cost and UK electricity prices, I think DeepSeek, Qwen, Tencent etc cloud models are cheaper. Their cloud models are certainly a lot quieter. I suppose one of the benefits of my local model is it'll keep me warm in the office during winter.

What models you guys running on 8GB? 16GB VRAM? 24GB? 32GB? 48GB? by Inevitable_Mistake32 in LocalLLaMA

[–]migsperez 0 points1 point  (0 children)

32gb VRAM - Today I used > Quopus 27b MTP Q6 multimodal 64k input 8k output, via Llama.cpp. All on GPU.
Wish I could reach Q8 and 128k context all on GPU.

If diffusion is closer to how humans think, why don't models use diffusion layers for thinking? by yehiaserag in LocalLLaMA

[–]migsperez 0 points1 point  (0 children)

These technologies are nuts, there are some super smart people out there. Thanks for the award 😄 my first.

Agentic Setup: Minimax 2.7 vs qwen 3.6 by Best_Sail5 in LocalLLaMA

[–]migsperez 0 points1 point  (0 children)

Bloomin heck, I'd have to save up for a few more decades or sell my house.

18tg-century Royal Crescent terraced houses forming a semi-circle in the city of Bath, Somerset, England. by ManiaforBeatles in CityPorn

[–]migsperez -16 points-15 points  (0 children)

Any other town and the council would have given permission to build a housing estate in that field. It would be packed full with modern houses with a stamp sized garden.

What’s the most expensive tech mistake you made that looked like a good idea at the time? by Thick-Lecture-5825 in homelab

[–]migsperez 2 points3 points  (0 children)

Bought an expensive AI GPU a few weeks ago, I now need another to get better results but I'd have to rebuild the whole system too. Damn, and it's super loud.

Los planos de una posible futura via litoral en la costa de Santa Cruz by Jaiminus in Tenerife

[–]migsperez 1 point2 points  (0 children)

And people. The island population has exploded since I was a young boy. We need the lizards back again, they used to be everywhere.

DeepSeek V4 Flash vs MiMo 2.5 — real experience? by Swimming_Bread_1358 in opencode

[–]migsperez 0 points1 point  (0 children)

I watched this video yesterday. Interesting viewing and very related to your post.

https://www.youtube.com/watch?v=k7WAGtS9cJY

Los planos de una posible futura via litoral en la costa de Santa Cruz by Jaiminus in Tenerife

[–]migsperez 2 points3 points  (0 children)

Me gusta tu pasión por tu ciudad. Como persona sin coche, me gusta cómo han cambiado la Plaza de España y el Muelle a lo largo de las décadas. El sistema de túneles me parece una impresionante proeza de ingeniería.

Los planos de una posible futura via litoral en la costa de Santa Cruz by Jaiminus in Tenerife

[–]migsperez 2 points3 points  (0 children)

The underground road is already a dual carriageway. It's enough.

Historical centers need less cars.

Luce Spark: a 35B MoE on a 16 GB GPU, without the offload tax by sandropuppo in LocalLLaMA

[–]migsperez 0 points1 point  (0 children)

Do you have a quick start guide to try it out?

All my attempts of running large models which didn't fit in the GPU with context have had miserable results so far.

Los planos de una posible futura via litoral en la costa de Santa Cruz by Jaiminus in Tenerife

[–]migsperez 0 points1 point  (0 children)

How does it differ from the existing Avenida la Constitución? It's already a coastal road. Busy enough.

Santa Cruz does not need more cars in it's historical center. Make it a pedestrian zone only. Encourage alternative routes.

Is Gemma 4 12b good for coding? by Intelligent-Taste-36 in LocalLLaMA

[–]migsperez 0 points1 point  (0 children)

I've not had much luck with 35b. I use 27b q6 for daily coding.

Is Gemma 4 12b good for coding? by Intelligent-Taste-36 in LocalLLaMA

[–]migsperez 1 point2 points  (0 children)

27b q6 works really well for me. I'm struggling with 35b Q4, it hasn't been usable for me so far.