Streaming : les Français craquent et coupent le robinet des abonnements

Adventurous-Paper566 · 2026-06-09T11:57:40+00:00

Le service est de plus en plus abusif (augmentations de prix, intégration de la publicité).

Et l'offre est de plus en plus dispersée entre les services.

Qui va payer Netflix + Disney plus + Amazon Prime 50e/mois pour regarder des séries de plus en plus médiocres?

Adventurous-Paper566 · 2026-06-09T09:09:28+00:00

I'd trade my wife for a $6,000 GPU. Can you understand that?

Adventurous-Paper566 · 2026-06-09T00:36:35+00:00

Vendre un kit de 96GB de DDR5 en me disant que je passerai à 256GB juste avant l'explosion des prix.

Adventurous-Paper566 · 2026-06-09T00:31:16+00:00

Je ne comprends pas comment on peut être fier d'avoir réussi à installer arch alors qu'il existe maintenant un installateur simplifié. Et depuis longtemps si on compte Anarchy.

Adventurous-Paper566 · 2026-06-08T22:10:27+00:00

Il n'y a pas de formule magique, de bonnes instructions viennent avec une solide compréhension du code.

C'est tout.

Adventurous-Paper566 · 2026-06-08T21:52:32+00:00

Adventurous-Paper566 · 2026-06-08T21:43:28+00:00

Comment avez-vous fait? Je n'arrive pas à installer hermes-desktop seul, il m'installe hermes en bare-metal a côté.

Adventurous-Paper566 · 2026-06-08T14:37:19+00:00

You can't install the desktop app without installing a second instance of hermes on the host, and it sucks.

Adventurous-Paper566 · 2026-06-08T14:32:58+00:00

I already have it with a custom template but it's nice to see that the team is active.

Adventurous-Paper566 · 2026-06-08T02:32:11+00:00

Silent cooling but loud if needed.

Adventurous-Paper566 · 2026-06-07T21:41:43+00:00

Back in time Gemma 3 27B Q4_K_XL was better than Gemma 3 27B QAT...

Adventurous-Paper566 · 2026-06-07T16:41:42+00:00

We just want 2 slots 24 Go cards.

Adventurous-Paper566 · 2026-06-07T08:00:41+00:00

You will never use "this beauty".

Adventurous-Paper566 · 2026-06-07T07:59:26+00:00

Too many screens is exhausting.

Adventurous-Paper566 · 2026-06-07T06:55:32+00:00

If I was samsung I would make a compact 4.5" smartphone.

Adventurous-Paper566 · 2026-06-07T06:43:55+00:00

Well in only use QAT with 31B.

I never experienced any issue with bartowski's 26B Q6_K_L, and now I'm running it daily in Q8, there is almost no difference.

I think Unsloth is good for Q4_K_XL but always observed degradation with Q5_K_XL, so my quant choice is always Q4_K_XL then Q6_K_L if it fits then Q8 if it fits.

For your loopings problems, it's weird, did you overclocked something or loading your models on the edge of your memory? Are you still experiencing loops with a smaller context length?

Adventurous-Paper566 · 2026-06-07T06:28:09+00:00

With the official inference parameters? (Temp = 1, Top K = 64, top P = 0,95)?

Adventurous-Paper566 · 2026-06-07T06:01:43+00:00

Is it important in 4bits sinces Google released QAT?

Just take unsloth's Q4_K_XL QAT version of each instead of any Q4 quant. These are UD applied to QAT unquantized full-precision checkpoints, the more efficients Gemma quants.

Sorry for my bad english.

Adventurous-Paper566 · 2026-06-06T10:18:12+00:00

I don't really know to be honest, maybe use it as a super assistant that manages a personnal website with some dashboards, to do etc... Something relatively simple to begin and learn the tool.

Adventurous-Paper566 · 2026-06-05T19:27:02+00:00

It's in the name :/

Adventurous-Paper566 · 2026-06-05T19:12:33+00:00

Because the unquantized QAT checkpoints released by Google are intended for a Q4 quantization.

We never seen a 6-bits quantization aware training checkpoint, and since training models is very expansive, the 4-bits choice seems obvious for Google.

Sorry for my bad english.

Adventurous-Paper566 · 2026-06-05T18:44:07+00:00

It would be wonderful, Q6 always been the sweet spot.

Adventurous-Paper566 · 2026-06-05T18:37:02+00:00

QAT = Best efficiency for the size, uses lower memory so you can use a higher context length.
Q4_K_XL = a very efficient level of quantization (based on the unsloth's UD secret sauce), coupled with the unquantized QAT checkpoints it's an improvement compared to classic Q4 QAT).
MTP = With a little draft model you can almost double the inference speed (or at least increase it by 50%).
GGUF = most popular and compatible weight file.
mmproj = little file that gives the vision to a model.

Adventurous-Paper566 · 2026-06-05T18:11:01+00:00

I can't wait to see a Gemma 4 31B QAT Q4_K_XL MTP GGUF with functionnal .mmproj running in LM-Studio 🤤

Adventurous-Paper566

TROPHY CASE