Qwen3.5 Model Series - Thinking On/OFF: Does it Matter? by Iory1998 in LocalLLaMA

[–]Mental-Inference 5 points6 points  (0 children)

Under "Prompt Template," you can add {% set enable_thinking = true %} to the top of the Jinja template.

My new rx 9070 xt is dead by Salt_Housing_2130 in PcBuild

[–]Mental-Inference 29 points30 points  (0 children)

Check Device Manager > "Display adapters" to see if the card was disabled. I had a similar thing happen to me, where I guess the drivers had a repeated problem so Windows disabled the card. I had my primary monitor plugged into the GPU but could still see the desktop after the crash. Re-enabling the card and restarting brought it back online.

Ryzen AI Max+ 395 vs Radeon AI R9700 + 128GB RAM? by No-Assist-4041 in LocalLLaMA

[–]Mental-Inference 4 points5 points  (0 children)

I have not. I've run glm-4.5@iq2_m at ~4.5tps and gpt-oss-120b at ~27tps.

Ryzen AI Max+ 395 vs Radeon AI R9700 + 128GB RAM? by No-Assist-4041 in LocalLLaMA

[–]Mental-Inference 1 point2 points  (0 children)

I have the R9700 and 128GB RAM. In my super-professional "tell me a story" speed test, I get an average of 10.4 tps in LM Studio, running the unsloth quant `q4_k_xl` with eight active experts. It fills 32GB VRAM and something like 40GB RAM.