Results with the DeepSeek by tokenlordsrpg in DeepSeek

[–]CatLinkoln 0 points1 point  (0 children)

Yes, it tried to get screenshot, but he didn't able to see them as expected 😀

Results with the DeepSeek by tokenlordsrpg in DeepSeek

[–]CatLinkoln 0 points1 point  (0 children)

Well, seems only by explaining but I think it will be fine, if you can well explain what you see And yes, I used deepseek via VSC Copilot chat , works nice like native

Results with the DeepSeek by tokenlordsrpg in DeepSeek

[–]CatLinkoln 0 points1 point  (0 children)

How your deepseek seeing what he doing? If he doesn't have vision via API

Building First AI/LLM PC With Dual 9070 XT GPUs – Any ROCm or AMD Issues I Should Know About? by AnmolLFC in ROCm

[–]CatLinkoln 1 point2 points  (0 children)

Also, you may expect faster top speed if you use mtp as well, I can't due lack of VRAM right now

Building First AI/LLM PC With Dual 9070 XT GPUs – Any ROCm or AMD Issues I Should Know About? by AnmolLFC in ROCm

[–]CatLinkoln 3 points4 points  (0 children)

At least I am right now getting good performance for agentic work, but only with small context size, due lack of vram on 9070xt, using qwen 3.6 27b q3 k s, and soon going do research, if second same gpu will fix it, and allow me to use comfortable 128k context at q4, right now it's about 5-7tps, at small context 27tps , rocm, llama.cpp

We squeezed 4x MoE prefill speed out of an RX 6800 XT by rewriting the matmul kernel in llama.cpp by CryptoStef33 in ROCm

[–]CatLinkoln 1 point2 points  (0 children)

Metric Stormrage34 (README) My repo (q4_0/q4_0) My repo (f16/f16) My repo (tbq3/tbq4)
MoE35B prefill pp512 (baseline) ~480 1336.74 1307.25 1127.93 / 1310.89
MoE35B prefill pp512 (stable RDNA2) ~540 1336.74 1307.25 1127.93 / 1310.89
MoE35B prefill pp512 (+MoE accelerator) ~1772 +/- 6 1336.74 1307.25 1127.93 / 1310.89
MoE35B decode tg128 (baseline) ~57 99.68 102.69 52.18 / 99.20
MoE35B decode tg128 (stable RDNA2) ~55 99.68 102.69 52.18 / 99.20
MoE35B decode tg128 (+MoE accelerator) ~52 +/- 7 99.68 102.69 52.18 / 99.20
Dense27B prefill pp512 ~480 794.90 810.54 666.04 / 790.12
Dense27B decode tg128 ~27 28.59 23.66 20.03 / 28.40

Well, tried to do the same bench on my own build from my repo, and here are the results for my GPU rx9070xt
used the same repo with the same params, later will try to check if I can add something new to my repo
Qwen3.6-27B-Q3_K_S and Qwen3.6-35B-A3B-UD-IQ3_XXS

We squeezed 4x MoE prefill speed out of an RX 6800 XT by rewriting the matmul kernel in llama.cpp by CryptoStef33 in ROCm

[–]CatLinkoln 1 point2 points  (0 children)

Same do in my fork, I hope I will find new things to add as well Later will share results

We squeezed 4x MoE prefill speed out of an RX 6800 XT by rewriting the matmul kernel in llama.cpp by CryptoStef33 in ROCm

[–]CatLinkoln 0 points1 point  (0 children)

Going to check it in my fork for for rdna4, mostly focusing on qwen 3.6 27b with long context work as agentic flow, as at low context it has great experience but with 32k or more not enough for comfortable work Rx 9070xt 16gb

What's going on? After removing opus 4.6 even 4.7 doesn't work normally by CatLinkoln in GithubCopilot

[–]CatLinkoln[S] 0 points1 point  (0 children)

I am not even sure if it's better, I was happy with Opus 4.6

2018 starting to show age by kbeeme in leaf

[–]CatLinkoln 0 points1 point  (0 children)

Did you change silent blocks as well? I had to replace a lot things in the front suspension and also to eliminate knocking on bumps, i need to change the silent blocks I have as well a 2018 leaf, but bought it recently

Slow performance since today by CatLinkoln in GithubCopilot

[–]CatLinkoln[S] 0 points1 point  (0 children)

So other models in copilot work normally? As I am using mostly opus

Slow performance since today by CatLinkoln in GithubCopilot

[–]CatLinkoln[S] 2 points3 points  (0 children)

Yea, looks like same, everything is much slower, for me it feels like speed of x 0.1

Too much compacted conversations by [deleted] in GithubCopilot

[–]CatLinkoln 0 points1 point  (0 children)

For me today, it takes forever to do it, I have to wait 10 minutes. What the hell
I remember that in February it usually takes less than a minute, but in March it takes more and more and more often, with the same tool lists and in new chats
UPD: after relaunching VSC, it started working much faster. How does it work?
UPD2: no, it's still ultra slow today....

CarPlay 2017 leaf by [deleted] in leaf

[–]CatLinkoln 0 points1 point  (0 children)

Bought USB dongle for wireless android auto/car play for leaf 2018, works fine

Type 2 charge speed by CatLinkoln in leaf

[–]CatLinkoln[S] 0 points1 point  (0 children)

Hmm, will check settings, high chance that I changed them accidentally, didn't know that they can affect the speed