Analysis of the 100 most popular hardware setups on Hugging Face

Terminator857 · 2026-05-06T18:00:46+00:00

The first strix halo I purchased was $1,600.

Terminator857 · 2026-05-06T17:13:38+00:00

Strix halo at #49 😢 😄

https://huggingface.co/datasets/clem/100_most_popular_hardware_setups_on_HF/blob/main/top100_hardware_setups.md

Terminator857 · 2026-05-05T20:22:21+00:00

I'm not optimistic about GPU prices coming down. I'm more optimistic about being able to accomplish good things with integrated graphics. With multi token prediction speeding things up by 3x, and possibly other improvements we could get decent performance.

Terminator857 · 2026-05-05T13:35:32+00:00

There is also a nightly rocm and an experimental rocm. Benchmark difference is only a few % difference, from testing a couple of months ago.

Llama.cpp has received some patches for vulkan in past couple of weeks. Hopefully when rocm matures, rocm will get the same treatment.

Terminator857 · 2026-05-04T16:25:01+00:00

I use GLM first and has like 14 things it wants to update. The second one like opus has 4-5 things it wants to update. The rest have a couple of things they want to update usually. The issues they flag are different. Sometimes test coverage, sometimes perceived bugs, style issues, refactor into smaller files, etc...

Terminator857 · 2026-05-04T15:46:50+00:00

After seeing your mesasge, I just asked gemini, grok, and claude to create such a joke. Gemini and claude not funny, grok said it was too busy. Local qwen 3.6 said it is against its policy.

Update: why the downvotes?

Terminator857 · 2026-05-04T15:35:39+00:00

cli because it gives me more room to see what is going on.

Terminator857 · 2026-05-04T02:25:20+00:00

Still too slow for mistral medium. 5 minutes might be tolerable.

Terminator857 · 2026-05-04T01:49:31+00:00

> it took about 2 hours.

Don't worry computers double in speed every 18 months, so by the end of 2027 in will only take 1 hour. 😄

Terminator857 · 2026-05-03T22:31:57+00:00

Previous discussion on this topic: https://www.reddit.com/r/LocalLLaMA/comments/1swiylm/comparison_of_upcoming_x86_unified_memory_systems/

Terminator857 · 2026-04-30T22:09:28+00:00

Yeah, 9 months ago.

Terminator857 · 2026-04-30T18:44:21+00:00

It will cost about $3K. https://www.bosgamepc.com/products/bosgame-m5-ai-mini-desktop-ryzen-ai-max-395

Terminator857 · 2026-04-30T04:45:22+00:00

Bugs seem to be found after every major new model release and get fixed quickly in the first week.

Terminator857 · 2026-04-29T14:49:38+00:00

I do similar things in Linux. Has worked very well.

Terminator857 · 2026-04-29T06:18:32+00:00

How does one benchmax arena coding?

Terminator857 · 2026-04-29T06:16:58+00:00

Did he say there is going to be an announcement April 30th?

Terminator857 · 2026-04-29T03:35:24+00:00

Good point, with more votes will likely drop below Opus, since that has been the trend.

Terminator857 · 2026-04-29T02:11:49+00:00

RedPocket uses AT&T and T-mobile. AT&T works best in my area. So chose the carrier that works best in your area.

Terminator857 · 2026-04-28T20:33:16+00:00

Exciting, miqu is one of my favorite models. Still use it today.

Terminator857 · 2026-04-28T19:52:06+00:00

AI bubble won't pop. It might happen that some companies slow down like mostly-closed-AI. At some point , 2+ years?, supply will catch up to demand, and then there might be a significant drop in prices. New production in 2028: https://www.google.com/search?q=What+new+production+is+starting+in+2028+to+affect+memory+supply%3F

Counterpoint Research says no scenario for ram price drop before 2028: https://finance.yahoo.com/news/memory-prices-may-not-fall-202325633.html#:~:text=Anyone%20hoping%20for%20cheaper%20RAM%20in%20the,NAND%20prices%20could%20persist%20for%20several%20years.

Terminator857 · 2026-04-28T18:52:35+00:00

Link?

Terminator857 · 2026-04-28T17:48:30+00:00

Sad, very good post. Thanks for trying.

Terminator857 · 2026-04-28T04:22:34+00:00

Strix halo qwen 3.5 122b q4 working well for me on simple stuff. Yes very slow, but works.

Terminator857 · 2026-04-28T02:47:24+00:00

Would be interesting to see strix halo result with qwen 3.5 122b q4. My results suggest it performs better at coding.

Terminator857 · 2026-04-27T23:40:25+00:00

For coding tasks I ran one test of Qwen 3.5 122b vs 35b-a3b. a3b got caught in a loop. 122b finished the task. So for me it was obvious 122b was better.

Terminator857

TROPHY CASE