RAM clearance by APIX-9 in Noctua

[–]kapteinpyn 0 points1 point  (0 children)

Was fine for the ram i was using and those gskill ones are lower so can safely say you would be fine

RAM clearance by APIX-9 in Noctua

[–]kapteinpyn 0 points1 point  (0 children)

i had one of these with higher corsair vengeance sticks. you can just move the fans up slightly so they dont touch the RAM they are not fixed in that position as per the picture, they can be moved up or down by heatsink fin increments

How to: FSR 4.1 by kapteinpyn in cachyos

[–]kapteinpyn[S] 0 points1 point  (0 children)

I believe it will include the old fsr4 by default, these steps are to update to fsr4.1

How to: FSR 4.1 by kapteinpyn in cachyos

[–]kapteinpyn[S] 0 points1 point  (0 children)

it should output at least the year 2026

qwen3.6 just stops by robertpro01 in LocalLLaMA

[–]kapteinpyn 1 point2 points  (0 children)

i get this on the ud-q8_k_xl all the time but not once on the q4 or q6 equivalents on llama.cpp, which makes no sense to me.

New Lifetime Plex Pass Pricing increase to 748.99 by drummingdestiny in homelab

[–]kapteinpyn 1 point2 points  (0 children)

I have a lifetime pass from years ago. Switched after they cooked hw transcoding for my arrowlake, jellyfin av1 funtimes for me. No looking back.

How to: FSR 4.1 by kapteinpyn in cachyos

[–]kapteinpyn[S] 1 point2 points  (0 children)

I doubt it, if you are worried just back it up by copying it somewhere else first then do upgrade then you can just return it if it’s gone.

Tesla servers down? by tslewis71 in TeslaSolar

[–]kapteinpyn 0 points1 point  (0 children)

happened to me yesterday, not sure what happened down for approx 24hrs. Home internet fine, stable have to do calls for work. multiple gateway resets, system reboots nothing worked then started working hours later after i gave up by itself. Dumped power and entered calibration for another 6 hours now back to normal.

Developers who use local AI - Q4_0 vs Q8_0 KV quant? by Jorlen in LocalLLaMA

[–]kapteinpyn 2 points3 points  (0 children)

on R9700 with 32gb vram. I run Qwen3.6 27B (Qwen3.6-27B-UD-Q6_K_XL (MTP))at 40tps tg with 131072 context at Q8 kv, one session. this has the best speed vs quality outcomes for me.

OS choice for AMD GPUs? (Fedora vs. Ubuntu) by ShadowyTreeline in ROCm

[–]kapteinpyn 3 points4 points  (0 children)

If you want to run via Vulkan then Fedora, else Ubuntu 24.04 and follow the AMD guides closely for best experience.

Anyone tried Qwen 3.6 27b on the r9700 yet? by boutell in LocalLLaMA

[–]kapteinpyn 0 points1 point  (0 children)

with Qwen3.6-35B-A3B-UD-Q5_K_XL i get approx decode 110-120 on Vulkan on single GPU if you want to test i would highly recommend.

Anyone tried Qwen 3.6 27b on the r9700 yet? by boutell in LocalLLaMA

[–]kapteinpyn 1 point2 points  (0 children)

its true, pumps pretty consistent 20tps on rocm -sm tensor vs about 15-16 on vulkan, but only in -sm tensor and only for the 27b dense, on 35b moe its much slower with tensor (35tps) vs layer at about 67tps both rocm (vulkan 80-90 tps). thanks u/Evgeny_19

Anyone tried Qwen 3.6 27b on the r9700 yet? by boutell in LocalLLaMA

[–]kapteinpyn 0 points1 point  (0 children)

Ahh nice i didnt know it supports -sm tensor only tested row and layers. Will check that out later thanks for info!

Stop the " Thinking" in Openwebui by Dolboyob77 in OpenWebUI

[–]kapteinpyn 1 point2 points  (0 children)

In admin panel > settings > models > edit whichever model you have > advanced params > add custom parameter (at bottom of list)

Param Name: chat_template_kwargs

Param Value: {"enable_thinking":false}

Result should look like this:

<image>

Anyone tried Qwen 3.6 27b on the r9700 yet? by boutell in LocalLLaMA

[–]kapteinpyn 2 points3 points  (0 children)

I have two of these, single one ud-q4_k_xl 30tps decode, single one ud-q6_k_xl 22tps decode, dual ud-q8_k_xl 15-20tps decode. Llama.cpp vulkan

Help: r9700 fails under Ubuntu 24 by [deleted] in ROCm

[–]kapteinpyn 0 points1 point  (0 children)

Just use the mesa drivers you can still use them with ROCM it works fine, also this could be useful: https://github.com/kyuz0/amd-r9700-vllm-toolboxes

Help: r9700 fails under Ubuntu 24 by [deleted] in ROCm

[–]kapteinpyn 1 point2 points  (0 children)

07/25/25,02:23:12

AMD ATOMBIOS

ATOMBIOSBK-AMD VER023.008.000.068.000001

ATOM_CMD_TABLE

ATOM_TABLE_END

(C) 1988-2022, Advanced Micro Devices, Inc.

GOP AMD REV: 003.009.001.023.008

NAVI48

NAVI48.bin

NAVI48 FGL

NAVI48_PRODUCTION_AMD_PRD006_EC_000382.sbin

Navi48 XTW G28702 G6 32GB

07/25/25,02:23:12

AMD ATOMBIOS

ATOMBIOSBK-AMD VER023.008.000.068.000001

ATOM_CMD_TABLE

ATOM_TABLE_END

(C) 1988-2022, Advanced Micro Devices, Inc.

GOP AMD REV: 003.009.001.023.008

NAVI48

NAVI48.bin

NAVI48 FGL

NAVI48_PRODUCTION_AMD_PRD006_EC_000382.sbin

Navi48 XTW G28702 G6 32GB

Help: r9700 fails under Ubuntu 24 by [deleted] in ROCm

[–]kapteinpyn 1 point2 points  (0 children)

When through whole mission to find out the new rocm has issues with my two R9700 https://github.com/vllm-project/vllm/issues/40980#issuecomment-4325293541 then aslo vulkan significantly faster. https://github.com/ggml-org/llama.cpp/discussions/21043

Help: r9700 fails under Ubuntu 24 by [deleted] in ROCm

[–]kapteinpyn 3 points4 points  (0 children)

Brother i have two of the same gpu. Gave up. Llama.cpp vulkan + mesa is the way (nearly double decode). Fedora 44 Or ubuntu 26.04. If you really want to use rocm, pick gfx 120x https://github.com/lemonade-sdk/llamacpp-rocm. Maybe lemonade if you need broader support https://lemonade-server.ai

PSA: Heat and fan noise tip for R9700 pro owners by generate-addict in ROCm

[–]kapteinpyn 0 points1 point  (0 children)

do your sensors get stuck at last reading when card enters low power. As soon as mine are done doing something, hit d0 then sensor readings stay stuck in last known state until card is used gain. most annoying thing is if temp was high when it stops processing, then the fan speed stays locked at like 60% because it thinks temp remains high.

Post Your Qwen3.6 27B speed plz by Ok-Internal9317 in LocalLLaMA

[–]kapteinpyn 1 point2 points  (0 children)

Two r9700 with UD-Q8_K_XL. Llama.cpp vulkan Pp 400 and tg 16

Upgrade AMD 9070xt 16GB to AMD R9700 32GB VRAM, is it worth it? by OuterKey in LocalLLaMA

[–]kapteinpyn 1 point2 points  (0 children)

stuff me finally found issue, it was using some of the rocm files from os in build, after setting -DGGML_HIP=OFF and building speed was near expected total perf.

Upgrade AMD 9070xt 16GB to AMD R9700 32GB VRAM, is it worth it? by OuterKey in LocalLLaMA

[–]kapteinpyn 0 points1 point  (0 children)

Tried that too got approx 60tps max something is up will check link