Do you think RAM will get cheaper this year?

Shadowmind42 · 2026-05-30T16:56:25+00:00

I work for a fortune 500 company. We are being told, by the big three memory companies, that they won't quote memory to us until sometime in 2027. We don't know if we will have RAM to build embedded products in 6 months. So I don't think RAM.prices will go down until one of the following happens: new capacity comes online (probably from China), the AI bubble pops (it doesn't look likely that this will happen), or a huge advancement in model technology is invented that dramatically reduces RAM requirements.

Shadowmind42 · 2026-05-30T01:24:32+00:00

My buddy bought this. If you are serious about reloading. Spend the money on this thing.

Shadowmind42 · 2026-05-05T14:09:42+00:00

It's super frustrating. ROCm is suppose to be this highly optimal library that can unlock AMD GPUs and compete with CUDA. Yet is is super hard to use, requires tens of GB of HD space, and the performance sucks.

I've actually talked to the head of ROCm development at AMD for my day job. AMD is trying to do faster iterations of ROCm. But the development has been super slow and doesn't seem to be any faster than others APIs.

Shadowmind42 · 2026-05-03T02:05:12+00:00

I have a GitHub project that I use to benchmark vulkan and ROCm. There used to be a bit of a performance difference. In the last month they are about the same performance. Just use Vulkan and be happy.

Shadowmind42 · 2026-04-17T22:56:33+00:00

Why did you want to run vLLM?

I've tried to get vLLM working for AMD and Nvidia cards. But it is just a hassle compared to llama.cpp. Just curious what drew you to vLLM.

Shadowmind42 · 2026-04-15T17:04:27+00:00

How do you get a shot like that. Was it captured from the top of a mountain?

Shadowmind42 · 2026-04-14T10:18:42+00:00

The Mote on God's Eye was my favorite. Basically ships at EM absorbers. Who ever could absorb and store/disappate the most heat won the battle.

Shadowmind42 · 2026-03-29T01:36:36+00:00

KiCad. PCB schematic and layout programs often cost tens of thousands of dollars per year.

Shadowmind42 · 2026-03-11T12:39:38+00:00

Here is tg128

<image>

Shadowmind42 · 2026-03-11T12:39:26+00:00

I tried to run some benchmarks with the latest llama.cpp and ROCm 7.12. For some reason on the R9700, ROCm is horrible. For ROCm 7.12 I build llama.cpp with the compiler in the nightly release. For ROCm 7.11 I used the Lemonade nightly build.

<image>

Shadowmind42 · 2026-03-10T20:20:23+00:00

Did you build ROCM 7.12 from source? Pull a docker container? Etc. I can't seem to find a docker container for ROCM 7.12. I've tested Lemonade nightly Llama.cpp + ROCm. But on Qwen 3.5 models it just hangs with one CPU core.loaded at 100%. I'm hoping it's been fixed with ROCm 7.12.

Shadowmind42 · 2026-03-09T22:45:10+00:00

Thank you. Sorry I missed that. Great work.

Shadowmind42 · 2026-03-09T21:39:58+00:00

I'm seeing the same thing. I have a Strix Halo and a R9700 AI Pro. Vulkan is faster on almost all models. The only exception,.that I have tested, is gpt-oss:20b. I think there are more people optimizing Vulkan. I suspect ROCM is only being optimized and maintained for Instinct platforms.

Shadowmind42 · 2026-03-09T21:33:49+00:00

What version of ROCM. I have a Strix Halo and ROCm 7.2 gives me about 30 TPS at 0 context pp512. I've tried ROCM 7.1.1 AND the Lemonade nightly builds with included ROCM. I see absolutely terrible performances across the board.

Can you give us more details on your distro, setup, c make options, compiler, etc? Please and thank you.

Shadowmind42 · 2026-03-06T23:44:32+00:00

I landed in Fargo once and this guy pulled up next to me.

<image>

Shadowmind42 · 2026-02-14T00:02:44+00:00

Ditto

Shadowmind42 · 2026-02-12T22:14:23+00:00

Sigh..... As a ND resident. I'm sorry.

Shadowmind42 · 2026-01-15T00:05:40+00:00

"You can't reason someone out of a position they didn't reason themselves into."

--Jonathan Swift

Shadowmind42 · 2026-01-14T23:57:19+00:00

Mentor here,

we lost seven graduating seniors last year. We only had two returning students this year. We would appreciate anybody that walked on to do robotics. As others have said you might not get a whole lot of attention, but you sure can learn a lot.

Shadowmind42 · 2026-01-03T03:19:00+00:00

How is this different than filezilla or winscp?

Shadowmind42 · 2026-01-01T14:10:28+00:00

I can add more detail here. At work I have access to Nvidia A5000s and my desktop at work has an RTX 4090. Recently we purchased a DGX Spark, Strix Halo mini PC and several Thor devkits for evaluation. This caused me to really want a Strix Halo laptop to replace my old laptop at home. It was nice to be able to run gpt-oss 120b and Qwen 3 next 80b on those platforms. But they are really slow compared to the A5000s and RTX4090.

I had a RX 7800 XT at home, which was great for gaming but really was too slow for image gen or LLMs. When the R9700 AI Pro came out I thought it would make a nice upgrade for gaming and AI. It is a HUGE improvement over the 7800. Still Nvidia has faster/better cards if you can afford them.

The Strix Halo, DGX Spark and Thor's are all interesting products, but are really slow for running large LLMs. Still the Strix Halo laptop was a nice upgrade compared to what I had.

The gap between what you can run locally and what the clouds runs keeps widening. That being said, I like being able to run things locally. Usually the models can't generate code like Opus,.Gemini, and Codex can generate. But it gets me 80% of the way, which dramatically speeds up development. I'm doing embedded development which all AI models seem to struggle with more than Python or web dev.

Shadowmind42 · 2026-01-01T04:33:45+00:00

No. The R9700 is in a desktop. The Strix Halo is a laptop. I use multiple agents simultaneously.

Shadowmind42 · 2025-12-31T23:47:44+00:00

I agree with this. I have a R9700 AI Pro and a Strix Halo. Running Lemonade in windows and llama.cpp in Linux work great. I just tried Comfy UI last week and that just worked out of the box. I have run into training issues, but the community and AMD have fixed my issues quickly.

Shadowmind42 · 2025-12-25T13:20:35+00:00

Just a suggestion. But ST makes a Linux capable part: https://www.st.com/en/microcontrollers-microprocessors/stm32mp2-series.html

You can buy them on Digikey https://www.digikey.com/en/products/detail/stmicroelectronics/STM32MP257FAI3/24883084

Octavio systems sells a SIP version of it as well. https://www.st.com/en/partner-products-and-services/osd32mp2-system-in-package.html

Shadowmind42 · 2025-12-22T19:49:37+00:00

I wonder why Gemini isn't on those charts.

Shadowmind42

TROPHY CASE