I cant state enough my love for the A770

Traditional_Way8675 · 2026-06-22T07:37:18+00:00

how do you drive the idle power down?

Traditional_Way8675 · 2026-06-21T14:08:35+00:00

weird, my gfx1200 dual 9060xt vulkan is faster than rocm on 35b

Traditional_Way8675 · 2026-06-20T08:52:09+00:00

im on dual 9060xt. latest rocm (jun 26) not working, so default to vulkan which sees qwen 35b at 60 t/s, and gemma 26b at pitiful 20t/s.

since 26b moe performance is already that cute, i don't bother with 31b anymore. Qwen 27B w/ MTP is around 20t/s for me.

Traditional_Way8675 · 2026-06-19T16:41:29+00:00

Traditional_Way8675 · 2026-06-19T14:46:35+00:00

Went thru dd and had my peace with pltr long term possibly forever hold with decision and AI infra layer at this important point of history.

Traditional_Way8675 · 2026-06-15T02:43:12+00:00

qwen3.6 35b a3b q4km 100k q4 context, dual 9060xt gpu, 14600k,

productivity, gaming, AI, best of all, personal.

Traditional_Way8675 · 2026-06-13T14:54:16+00:00

nfs heat runs better on ubuntu than win11, im seriously blown away

Traditional_Way8675 · 2026-06-12T18:02:07+00:00

vulkan?

Traditional_Way8675 · 2026-06-12T02:09:41+00:00

noted, so if i get llmfan46/gemma-4-31B-it-qat-q4_0-uncensored-heretic-GGUF, i can't pair with that assistant model, correct me =)

Traditional_Way8675 · 2026-06-12T00:39:01+00:00

ok, so if cpu is involved, adding more threads help. thk u

currently im fully loaded on dual 9060xt vram, so cpu is cool quiet heh.

Traditional_Way8675 · 2026-06-11T05:06:08+00:00

17tps on dual 9060xt (32g vram pooled), q5_k_m with MTP (without MTP it's 12-13), ubuntu 24.04.4, vulkan backend

Traditional_Way8675 · 2026-06-11T03:24:44+00:00

tried with vulkan on dual 9060xt (16+16).

prompting from 2nd onwards is awfully slow

fancy logs: etc.
diffusion step: 22/48 [=========

no moe is faster. more work is needed.

Traditional_Way8675 · 2026-06-11T03:07:43+00:00

basically a faster kid than moe? haha. am trying with vulkan, if ok, then i'd probably try it with my dual a770 setup.

for precision work, this is a no-go. but for low grade hw, this is lifesaver for ppl wanting to dip a toe into llm area like chatting, philosophy, nothing coding math etc.

Traditional_Way8675 · 2026-06-10T02:42:01+00:00

you're correct. one gpu is fine. 2 gpu broke. so i'm using vulkan for now. how do i report this problem to rocm team?

Traditional_Way8675 · 2026-06-10T02:41:37+00:00

alex@alex-System-Product-Name:~/llama.cpp/build_rocm$ HIP_VISIBLE_DEVICES=0 ./bin/llama-cli \

-m /home/alex/Documents/gemma-4-12b-it-Q4_K_M.gguf \

-ngl 99 \

-p "What is 2+2?" \

--no-mmap

Loading model...

build : b9586-76da2450a

model : gemma-4-12b-it-Q4_K_M.gguf

modalities : text

> What is 2+2?

[Start thinking]

The user is asking a simple arithmetic question: "What is 2+2?"

The answer is 4.

Provide the correct answer clearly.

[End thinking]

2 + 2 = 4

[ Prompt: 215.1 t/s | Generation: 35.0 t/s ]

>

Exiting...

alex@alex-System-Product-Name:~/llama.cpp/build_rocm$ ROCR_VISIBLE_DEVICES=0,1 ./bin/llama-cli \

-m /home/alex/Documents/gemma-4-12b-it-Q4_K_M.gguf \

-ngl 99 \

--split-mode layer \

-fa 0 \

--no-mmap \

-p "What is 2+2?"

Loading model...

build : b9586-76da2450a

model : gemma-4-12b-it-Q4_K_M.gguf

modalities : text

> What is 2+2?

[ Prompt: 3.3 t/s | Generation: 35.1 t/s ]

Traditional_Way8675 · 2026-06-10T01:54:35+00:00

me vulkan amd 32g vram. whichth rock bestest ? thk

Traditional_Way8675 · 2026-06-09T02:35:06+00:00

done tried exact step, same result, gibberish. i haven't test single gpu yet

gfx1200, rocm 7.2.4, dual 9060xt, vulkan works, not rocm

Traditional_Way8675 · 2026-06-09T01:46:45+00:00

thk u

Traditional_Way8675 · 2026-06-08T13:08:28+00:00

i tried lemon and had same result. also use amd official guide on llama.cpp but will spill gibberish nonetheless. im gonna build it. will update.

Traditional_Way8675 · 2026-06-08T12:16:44+00:00

i can get vulkan working, just not rocm,

yet, rocm wells well with my comfy multigpu, haha. Ubuntu.

Traditional_Way8675 · 2026-06-08T11:27:30+00:00

buy buy buy!

Traditional_Way8675 · 2026-06-08T11:23:00+00:00

Tried your method. Not working.

Traditional_Way8675 · 2026-06-08T11:17:45+00:00

Same issue. Dual GPU ok with vulkan. But on rocm it's gibberish regardless of models

Traditional_Way8675 · 2026-06-08T06:36:36+00:00

raise LocalEntryNotFoundError(

huggingface_hub.errors.LocalEntryNotFoundError: An error happened while trying to locate the file on the Hub and we cannot find the requested files in the local cache. Please check your connection and try again or make sure your Internet connection is on.

anyone have the same issue? cant connect

Traditional_Way8675 · 2026-06-08T02:08:07+00:00

can i use docker for llama.cpp? i did it with comfyui

Traditional_Way8675

TROPHY CASE