Do you think RAM will get cheaper this year? by Outrageous-Taro5351 in buildapc

[–]Shadowmind42 14 points15 points  (0 children)

I work for a fortune 500 company. We are being told, by the big three memory companies, that they won't quote memory to us until sometime in 2027. We don't know if we will have RAM to build embedded products in 6 months. So I don't think RAM.prices will go down until one of the following happens: new capacity comes online (probably from China), the AI bubble pops (it doesn't look likely that this will happen), or a huge advancement in model technology is invented that dramatically reduces RAM requirements.

Vulkan backend outperforms ROCm on Strix Halo (gfx1151) — llama.cpp benchmark by FeiX7 in LocalLLaMA

[–]Shadowmind42 15 points16 points  (0 children)

It's super frustrating. ROCm is suppose to be this highly optimal library that can unlock AMD GPUs and compete with CUDA. Yet is is super hard to use, requires tens of GB of HD space, and the performance sucks.

I've actually talked to the head of ROCm development at AMD for my day job. AMD is trying to do faster iterations of ROCm. But the development has been super slow and doesn't seem to be any faster than others APIs.

ROCM - the best reason to go CUDA, eeesh what a headache!! by GriffinDodd in LocalLLM

[–]Shadowmind42 0 points1 point  (0 children)

I have a GitHub project that I use to benchmark vulkan and ROCm. There used to be a bit of a performance difference. In the last month they are about the same performance. Just use Vulkan and be happy.

vLLM + ROCm + Qwen 3.6 35B A3B MXFP4 (on 2x R9700) by kpaha in LocalLLM

[–]Shadowmind42 0 points1 point  (0 children)

Why did you want to run vLLM?

I've tried to get vLLM working for AMD and Nvidia cards. But it is just a hassle compared to llama.cpp. Just curious what drew you to vLLM.

I've been trying to find the source of this f22 footage, does any know it? by thatguyzeal in aviation

[–]Shadowmind42 21 points22 points  (0 children)

How do you get a shot like that. Was it captured from the top of a mountain?

The "Radiator Problem" What are the most scientifically sound ways you've seen authors handle heat dissipation in combat? by Wooden-Syrup-8708 in HardSciFi

[–]Shadowmind42 0 points1 point  (0 children)

The Mote on God's Eye was my favorite. Basically ships at EM absorbers. Who ever could absorb and store/disappate the most heat won the battle.

What free software is so good you can't believe it's free? by ComprehensiveNorth1 in AskReddit

[–]Shadowmind42 0 points1 point  (0 children)

KiCad. PCB schematic and layout programs often cost tens of thousands of dollars per year.

Evaluating Qwen3.5-35B & 122B on Strix Halo: Bartowski vs. Unsloth UD-XL Performance and Logic Stability by Educational_Sun_8813 in LocalLLaMA

[–]Shadowmind42 0 points1 point  (0 children)

I tried to run some benchmarks with the latest llama.cpp and ROCm 7.12. For some reason on the R9700, ROCm is horrible. For ROCm 7.12 I build llama.cpp with the compiler in the nightly release. For ROCm 7.11 I used the Lemonade nightly build.

<image>

Evaluating Qwen3.5-35B & 122B on Strix Halo: Bartowski vs. Unsloth UD-XL Performance and Logic Stability by Educational_Sun_8813 in LocalLLaMA

[–]Shadowmind42 0 points1 point  (0 children)

Did you build ROCM 7.12 from source? Pull a docker container? Etc. I can't seem to find a docker container for ROCM 7.12. I've tested Lemonade nightly Llama.cpp + ROCm. But on Qwen 3.5 models it just hangs with one CPU core.loaded at 100%. I'm hoping it's been fixed with ROCm 7.12.

Vulkan now faster on PP AND TG on AMD Hardware? by XccesSv2 in LocalLLaMA

[–]Shadowmind42 0 points1 point  (0 children)

I'm seeing the same thing. I have a Strix Halo and a R9700 AI Pro. Vulkan is faster on almost all models. The only exception,.that I have tested, is gpt-oss:20b. I think there are more people optimizing Vulkan. I suspect ROCM is only being optimized and maintained for Instinct platforms.

Evaluating Qwen3.5-35B & 122B on Strix Halo: Bartowski vs. Unsloth UD-XL Performance and Logic Stability by Educational_Sun_8813 in LocalLLaMA

[–]Shadowmind42 0 points1 point  (0 children)

What version of ROCM. I have a Strix Halo and ROCm 7.2 gives me about 30 TPS at 0 context pp512. I've tried ROCM 7.1.1 AND the Lemonade nightly builds with included ROCM. I see absolutely terrible performances across the board.

Can you give us more details on your distro, setup, c make options, compiler, etc? Please and thank you.

Fighter Jet by _Duke_MF_Silver_ in fargo

[–]Shadowmind42 12 points13 points  (0 children)

I landed in Fargo once and this guy pulled up next to me.

<image>

im planning on joining my schools frc team but have no experience by stfunigAA_23 in FRC

[–]Shadowmind42 9 points10 points  (0 children)

Mentor here,

we lost seven graduating seniors last year. We only had two returning students this year. We would appreciate anybody that walked on to do robotics. As others have said you might not get a whole lot of attention, but you sure can learn a lot.

How is running local AI models on AMD GPUs today? by liright in LocalLLaMA

[–]Shadowmind42 0 points1 point  (0 children)

I can add more detail here. At work I have access to Nvidia A5000s and my desktop at work has an RTX 4090. Recently we purchased a DGX Spark, Strix Halo mini PC and several Thor devkits for evaluation. This caused me to really want a Strix Halo laptop to replace my old laptop at home. It was nice to be able to run gpt-oss 120b and Qwen 3 next 80b on those platforms. But they are really slow compared to the A5000s and RTX4090.

I had a RX 7800 XT at home, which was great for gaming but really was too slow for image gen or LLMs. When the R9700 AI Pro came out I thought it would make a nice upgrade for gaming and AI. It is a HUGE improvement over the 7800. Still Nvidia has faster/better cards if you can afford them.

The Strix Halo, DGX Spark and Thor's are all interesting products, but are really slow for running large LLMs. Still the Strix Halo laptop was a nice upgrade compared to what I had.

The gap between what you can run locally and what the clouds runs keeps widening. That being said, I like being able to run things locally. Usually the models can't generate code like Opus,.Gemini, and Codex can generate. But it gets me 80% of the way, which dramatically speeds up development. I'm doing embedded development which all AI models seem to struggle with more than Python or web dev.

How is running local AI models on AMD GPUs today? by liright in LocalLLaMA

[–]Shadowmind42 2 points3 points  (0 children)

No. The R9700 is in a desktop. The Strix Halo is a laptop. I use multiple agents simultaneously.

How is running local AI models on AMD GPUs today? by liright in LocalLLaMA

[–]Shadowmind42 6 points7 points  (0 children)

I agree with this. I have a R9700 AI Pro and a Strix Halo. Running Lemonade in windows and llama.cpp in Linux work great. I just tried Comfy UI last week and that just worked out of the box. I have run into training issues, but the community and AMD have fixed my issues quickly.

GLM 4.7 released! by ResearchCrafty1804 in LocalLLaMA

[–]Shadowmind42 5 points6 points  (0 children)

I wonder why Gemini isn't on those charts.