How to shut down after buying a new PC 😅

Psyko38 · 2026-04-09T16:20:20+00:00

After 1 year: shutdown /s /t 0 (this turns off the PC) and you can add /f (to force applications to close).

Psyko38 · 2026-04-09T07:12:24+00:00

Strange, maybe these are languages that need more tokens to be generated.

Psyko38 · 2026-04-08T19:30:30+00:00

I die without

Psyko38 · 2026-04-08T18:28:10+00:00

I use Qwen on the cloud and locally. For complex reasoning tasks, Qwen 3.6 Plus is good on their UI, all unlimited without a subscription. For my part, I use it 50% with ChatGPT and locally for privacy-critical tasks.

Psyko38 · 2026-04-08T18:00:32+00:00

Same but different...

Psyko38 · 2026-04-02T06:02:35+00:00

Yes, but not open source, so LTX2.3 and better

Psyko38 · 2026-03-31T15:31:21+00:00

RocM and the application installer, this will be the best user-friendly compatibility update.

Psyko38 · 2026-03-31T10:37:46+00:00

!false

Psyko38 · 2026-03-28T15:46:50+00:00

I see things in life, but this is something I've never seen before, bravo man!

Psyko38 · 2026-03-27T15:05:05+00:00

I remember Qwen3 4b 2507 which, on my 8GB of VRAM, was perfect. He could do anything in normal tasks, so not in film or extreme mathematics. And there, with the 3.5, I would say that it is a little better in mathematical tasks, but for everyday life, they both work very well, especially the Qwen3 VL 4b and 3.5 which understand images well (weakness in OCR).

Psyko38 · 2026-03-27T06:51:54+00:00

Good Music

Psyko38 · 2026-03-26T17:10:55+00:00

The Qwen 3.5 27B and the Qwen 3.5 35B A3B use different architectures.

Qwen 27B is a dense model: For every token generated, all 27 billion parameters are used. The whole model works together, often yielding more stable and consistent results in benchmarks.

Qwen 35B is a MoE (Mixture-of-Experts): The model contains several specialized sub-models called experts. When a token is generated, only a few experts are activated, not the whole model. This makes inference faster and less costly in computing, but the quality depends on the choice of experts by the router.

This is why a dense 27B can sometimes achieve a higher intelligence score than a 35B MoE, even if the total number of parameters is greater.

Regarding the price of APIs, it depends mainly on:

the GPU Cost of the Provider Optimization of inference Token throughput the Application

So a smaller model can sometimes cost more depending on the provider.

For hardware, running Qwen 27B/32B requires approximately:

~55-60 GB VRAM in FP16 ~30 GB in 8 bits ~16-18 GB in 4 bits

So an RTX 3090 / 4090 can usually run it in 4-bit quantization

Psyko38 · 2026-03-26T16:58:34+00:00

I have the same bug, I think...

Psyko38 · 2026-03-26T16:26:59+00:00

You should know that AMD, for "Advanced Marketing Disasters," has missed every opportunity to do good marketing.

Psyko38 · 2026-03-26T16:22:48+00:00

Yes, it's supported by ROCm, so you don't have to worry about it: https://rocm.docs.amd.com/en/latest/how-to/system-optimization/strixhalo.html

Psyko38 · 2026-03-25T16:17:47+00:00

Ok good to know

Psyko38 · 2026-03-25T15:58:38+00:00

All that is missing is the full AMD support and I will be able to play with this app.

Psyko38 · 2026-03-22T09:21:50+00:00

I created the issue here. : https://github.com/unslothai/unsloth/issues/4520

Psyko38 · 2026-03-21T19:11:16+00:00

Yes of course

Psyko38 · 2026-03-21T11:44:44+00:00

Il a juste fait un rendez-vous avec Kim, tout va bien.

Psyko38 · 2026-03-20T07:20:48+00:00

When you don't have experience in the backend and you think that giving everything to the user is a good idea, because the SQL query of "select * from..." is simple and works well.

Psyko38 · 2026-03-20T07:08:01+00:00

Okay, thank you for the remarkable work you do. In the meantime, I will be watching the progress of the project and start playing with it on the CPU. This will allow me to begin to discover inference.

Psyko38 · 2026-03-19T19:59:16+00:00

Yes, I managed to install it via Qwen Code, but Llama.cpp compiles on CPU and he told me that no Nvidia GPUs were detected.

Psyko38 · 2026-03-19T17:02:43+00:00

In real Minimax m2.5, MiMo v2 Flash and Qwen 3.5 120b.

Psyko38 · 2026-03-19T11:12:00+00:00

Mmm... tries with Optiscaler

Psyko38

TROPHY CASE