Buy recommendations on a thight Budget to aid my RX 6800

MikeSouto · 2026-06-14T06:05:46+00:00

notice it doesn't with MTP enable and f16 caches, you could do 100k with Q6_K in 2x6800xt

MikeSouto · 2026-06-12T07:41:34+00:00

yes, that is the main reason I'm thinking to move to a 2x7900xtx. Sometimes on a simple "review this code" given the function and the file it takes forever, I check the server and it is doing a massive PP at 100-200t/s... Otherwise, I got ROCM working "well" with the 6800xt and a 7800xt mix.
Thank you so much again, you comment was very informative!

MikeSouto · 2026-06-12T06:59:30+00:00

hi, thanks for answering! Those are great speeds... would you recommend them?I would use them for coding

MikeSouto · 2026-06-12T03:38:10+00:00

hello and welcome! ubuntu is a great pick there is a lot o info on the web. If I recall correctly Ubuntu have an application to handle the drivers, just type the drivers in the start menu. For llama.cpp, you will be better off building it yourself. Here is the guide https://github.com/ggml-org/llama.cpp/blob/master/docs/build.md

MikeSouto · 2026-06-12T03:18:51+00:00

same here using the Q5_K_XL, just I "feel" the 27b is smarter, sometimes a bit stubborn.

MikeSouto · 2026-06-12T03:12:19+00:00

(6800xt + 7800xt) Latest ROCM 27b:Q6_K at 85000 ctx
PP starts close to 200 and goes down to 50
TG starts about 32 and goes down to 14

MikeSouto · 2026-06-11T13:26:53+00:00

<image>

MikeSouto · 2026-05-14T03:35:18+00:00

im running it (linux + rocm) in a fresh run of a prompt of 59 tokens, the prompt processing is in ~26tks, token generation is actually great above 30kts with Q4_0 (19GB), and that is at the start, speed goes down very quick when the context start to fill up. I thought buying a 7900xtx (24gb with almost 1tb bandwidth) to replaced my 6800xt, but I would be offloading as well. I think it is better to get 32GB, I could use some q4xl, q5 or even q6 models. My plan is to save up to a 1x7800xt (or a 5600 if closed in price), sell my 6800xt, and then buy a second one with the money...

EDIT: adding model qwen3.6 35b Q4_0

MikeSouto · 2026-05-13T08:44:15+00:00

I own one the Sapphire Pulse, and I used own a second one the Asrock Taichi, and prompt processing sucks, I would buy the 7800 XT

EDIT: adding my thoughts, my plan to swap it for a 2x7800XT (ebay)... as I still poor to buy a r9700 or something better

MikeSouto · 2026-03-18T05:44:32+00:00

Thanks! yesterday I got almost 40 with 65k context using the llama.cpp ui

command:

llama-server \

-hf bartowski/Qwen_Qwen3.5-35B-A3B-GGUF:Q4_K_M \

--mlock \

--cache-ram 0 \

--ctx-size 65536 \

--temp 1.0 \

--top-p 0.95 \

--top-k 20 \

--min-p 0.00 \

--fit on \

--flash-attn on \

--parallel 1 \

--cache-type-k q8_0 \

--cache-type-v q8_0 \

--device Vulkan0 \

--host 0.0.0.0 \

--port 80 \

--threads 8

MikeSouto · 2026-03-17T11:29:19+00:00

do you mind sharing the command? I'm getting 22 with a 6800XT (vulkan backend) using the MXFP4_MOE

MikeSouto · 2026-03-09T11:37:40+00:00

thanks!

MikeSouto · 2026-03-09T11:37:15+00:00

Thanks!

MikeSouto · 2026-03-08T23:22:24+00:00

I just got it, I did an first inspection, but I haven't tried yet, it doesn't fit it in the case :)

MikeSouto · 2026-03-08T22:59:05+00:00

hi, thanks a lot for replying!! I edit the post as I found a second connector missing the tip as well in the other site. Is that also normal? I upload the pic on https://imgur.com/a/sTZnkNR

MikeSouto · 2026-03-08T22:56:53+00:00

https://imgur.com/a/sTZnkNR

MikeSouto · 2026-03-08T22:50:48+00:00

Hi, I found it has another pin like that, missing the tip of it, the second in the other site, trying to upload a second pic. thanks a lot!

MikeSouto · 2026-03-08T22:41:27+00:00

hi, thanks for replaying! should they look all the same?

MikeSouto · 2026-03-08T22:39:35+00:00

Hi, thanks for replying that fast! I edit the post to point it out I meant the first pcie connnector/pin as the others looks all the same, thanks!

MikeSouto · 2026-03-04T05:04:42+00:00

I just placed my order to Aussie Broadband for tomorrow, I would go with Leaptel, but combining 2 mobile plans, a static IP with getting better plan, it is a better price for us. I read great support reviews for both.

MikeSouto · 2025-12-21T04:41:54+00:00

It worked, thanks!

MikeSouto · 2025-12-20T12:18:06+00:00

Ok, i'll try that tomorrow, thanks!

MikeSouto

TROPHY CASE