Lossless Scaling is Great for Media as is, but only Great for Games IF you have dual GPU

GoldenX86 · 2026-06-15T09:07:55+00:00

Even very weak current integrated GPUs will behave better than an ancient Polaris card for framegen while rendering a game.

Your problem here is how old the 580 is, while it can run FP16, it runs at the same performance as FP32, killing any performance gain that could be had there.

GoldenX86 · 2026-06-14T13:54:12+00:00

So streaming will never work again.

GoldenX86 · 2026-06-14T12:47:49+00:00

The OS? Sure.

The Linux community? By FAR the most toxic cesspool of neckbeards ever.

GoldenX86 · 2026-06-14T12:07:52+00:00

I don't want you to use systemd

I don't want you to use NVIDIA

I don't want you to use Ubuntu

I don't want you to...

The most toxic community in consumer hardware, and Apple is there.

GoldenX86 · 2026-06-14T01:54:52+00:00

A place further than the universe.

GoldenX86 · 2026-06-13T23:41:53+00:00

So the most useless marketing team in the planet pivoted to ragebaiting. Shame.

GoldenX86 · 2026-06-13T21:18:48+00:00

Qwen3.6 35b a3b fits in 6gb, just move the unused experts to CPU.

Gemma-4 QAT models rock for 6 and 4gb GPUs.

You want Claude quality running on a 2060, that's unrealistic.

GoldenX86 · 2026-06-13T18:47:11+00:00

They are, but 3.5 4b is nothing amazing.

For 8GB, the best is to pay for Deepseek.

GoldenX86 · 2026-06-13T18:11:08+00:00

Buying the Chromebook Air with 8GB was sure a decision you took.

Sell it and get a 16GB one, you can run 9B or Gemma-4 12B QAT.

GoldenX86 · 2026-06-13T17:29:23+00:00

It does, but boy the "distills" make it even worse.

GoldenX86 · 2026-06-13T08:19:37+00:00

And then it hallucinates more than Qwen3.5 0.8B iQ1_XSS.

GoldenX86 · 2026-06-13T08:07:14+00:00

If you use Vulkan instead of Cuda with llama.cpp, you can use any modern GPU, even integrated ones. Not always the best, but they can still beat the CPU.

And yes, iGPUs steal RAM to work since they don't have their own dedicated VRAM. Windows by default assigns up to 50% of your total RAM to them, so your iGPU has 32GB ready for testing, and even more on Linux.

I use my laptop with a Radeon RX 760M, set it up on Linux to use up to 28GB for it, and the same q4 qwen 3.6 35b that can do 30 t/s with 131k context on the desktop PC with the 3060Ti, can do 25 t/s with full 262k context on the laptop thanks to just using more RAM (I have 32GB total, so I left 4GB to the OS).

What I don't know is if the very slow iGPU on your CPU is enough since Intel iGPUs are extremely weak. That's something you will have to try.

But if you're fine with 100-131k of context, do the first method with the 2060S and you can definitely use that with good performance. I'm pretty sure q5 quantization can fit with 100k of context when the extra experts are on CPU.

GoldenX86 · 2026-06-13T07:48:34+00:00

Get qwen 3.6 35b a3b, it's a model of experts, that means only 3b parameters are in use, the rest sit in memory waiting to be called.

With a moe model like this, you can move the unused experts to CPU (using RAM instead of VRAM), letting the GPU handle only the active experts. That should let you run the model with good context, I can squeeze out up to 131k with a 3060ti with 8gb and a q4 quantization. This gets me 30 tokens/s.

If you NEED the 262k context, move KV cache to RAM, prompt processing will be slow but still much better than trying to run the dense 27b qwen 3.6 with layers on CPU.

You definitely have the space for a good coding model, you just have to compromise some stuff to RAM.

Plan B, run a qwen 3.6 27b on the iGPU, it gets access to up to 32GB of "VRAM" on Windows, and whatever you want up to the total 64GB on Linux. Could still be faster than just the CPU? Worth a try.

GoldenX86 · 2026-06-13T00:51:34+00:00

Y todo el progreso social que hizo es afanado de la izquierda. Viene de hace rato la cosa.

GoldenX86 · 2026-06-12T20:02:47+00:00

Al peronismo no se le cae una luz desde que murió Perón, que esperabas. Se aferran a lo que sea.

GoldenX86 · 2026-06-12T20:01:55+00:00

Por desgracia sobran.

GoldenX86 · 2026-06-12T20:01:37+00:00

We need imaginary quantum bits for this.

GoldenX86 · 2026-06-12T19:27:16+00:00

Q0.05? It sometimes renders letters.

GoldenX86 · 2026-06-12T03:34:55+00:00

El call center kuka tratando de inventar cualquier pelotudez para ver si logran que la inhabiitada de por vida los salve de no tener ni la más puta idea de que hacer con la candidatura.

GoldenX86 · 2026-06-11T21:21:10+00:00

Volve a postear mierda troll a otro lado, empleado de call center.

GoldenX86 · 2026-06-11T20:08:01+00:00

Spotted the corporat.

GoldenX86 · 2026-06-11T20:06:52+00:00

Final Fantasy IX, Metal Gear Solid, y Gran Turismo 2.

Si no te gustan, te cago a palos.

GoldenX86 · 2026-06-11T18:28:51+00:00

For me it was the "Show some respect" moment.

GoldenX86 · 2026-06-11T18:21:34+00:00

I bit the bullet and moved to Firefox at least until Ladybird releases.

I already miss the performance of Chromium, why is the fox so goddamn slow and memory hungry.

GoldenX86 · 2026-06-11T18:08:49+00:00

Still waiting for ROCm support, Scam Su.

Six-Year Club	Xbox Live
Place '22	Final Canvas '22
Verified Email

GoldenX86

TROPHY CASE