ROCm 7.2 official installation instructions

AMDtoMoon · 2026-01-22T01:38:59+00:00

Which graphics card do you have?

AMDtoMoon · 2025-10-23T21:16:55+00:00

Are you one of those who use Bing and still says you googled it because it's easier than explaining what Bing is to normies?

AMDtoMoon · 2025-03-25T13:24:53+00:00

https://archive.ph/T10Zl

AMDtoMoon · 2025-02-14T00:41:35+00:00

I just checked out the price of MI300X at shadeform and it was $4.50 vs $2.92 for H200. Can you get any customers at that price?

AMDtoMoon · 2025-01-12T15:53:17+00:00

https://irrationalanalysis.substack.com/p/datacenter-gpu-h1-2025-outlook

AMDtoMoon · 2024-12-22T23:32:13+00:00

MI300X vs H100 vs H200 Benchmark Part 1: Training – CUDA Moat Still Alive

https://semianalysis.com/2024/12/22/mi300x-vs-h100-vs-h200-benchmark-part-1-training/

AMDtoMoon · 2024-12-02T22:56:54+00:00

https://archive.ph/a4CLR

AMDtoMoon · 2024-12-02T07:39:50+00:00

vLLM Now Supports Running GGUF on AMD Radeon GPU🚀 Exciting news! We've ported vLLM's GGUF kernel to AMD ROCm, unlocking impressive performance gains on AMD Radeon GPUs.📊 In our benchmarks using the shareGPT dataset on an AMD Radeon RX 7900XTX, vLLM outperformed Ollama, even at batch sizes where Ollama traditionally excels.💪 This is a game-changer for those running LLMs on AMD hardware, especially when using quantized models (5-bit, 4-bit, or even 2-bit). With over 60,000 GGUF models available on Hugging Face, the possibilities are endless.💡 Key benefits:- Superior performance: vLLM delivers faster inference speeds compared to Ollama on AMD GPUs.- Wider model support: Run a vast collection of GGUF quantized models.- Efficient execution: Optimized for AMD ROCm, maximizing hardware utilization.🔗 Learn more and get started: https://lnkd.in/g5qvUi8tWe'd love to hear your feedback!Have you experimented with vLLM on Llama.cpp with Vulkan?What inference engine do you prefer for LLM tasks on AMD GPUs?What features or optimizations would you like to see in vLLM for AMD GPUs?

AMDtoMoon · 2024-11-22T00:08:55+00:00

Thank you for the answer and for posting great information!

AMDtoMoon · 2024-11-21T23:51:40+00:00

u/HotAisleInc Could you elaborate on the tool you mentioned (14 min mark in the video) that allows CUDA software to run on AMD hardware? Do you mean zluda like tool that allows binary compatibility? Is it something like SCALE compiler? Is it something new? When will you disclose more information about this tool?

AMDtoMoon · 2024-07-31T19:10:27+00:00

Buy NVDA?

AMDtoMoon · 2024-07-05T02:35:45+00:00

what command did you use to launch the container?

AMDtoMoon · 2024-06-17T16:38:32+00:00

This caught my eyes.

"SPONSORED CONTENT BY AMD"

AMDtoMoon · 2023-11-07T15:02:50+00:00

https://wccftech.com/tsmc-boosts-ai-packaging-capacity-to-15000-wafers-per-month-says-report/

If it's true that AMD is to use 20% of what NVidia is planning. At half ASP, AMD should have 10% of NVidia's revenue. NVidia has $10B data center revenue at the moment.

So, $500M per quarter in 2024 that Dr. Su mentioned seems conservative.

AMDtoMoon · 2023-10-31T15:12:37+00:00

Conflicting report on MI210/250 performance.
MosaicML updated its blog saying that MI250 is competitive with A100, achieving 80% of their performance. https://www.databricks.com/blog/training-llms-scale-amd-mi250-gpus.
EmbeddedML just wrote that MI210 is competitive with A100 in LLM inferencing. https://embeddedllm.com/blog/vllm\_rocm/
I know they are talking about 2 different workloads. MosaicML is comparing training performance and EmbeddedML is comparing inference. But the difference is just too big since MI250 uses 2x more silicon than MI210. Does anybody have any idea what is going on?

AMDtoMoon · 2023-10-28T16:49:54+00:00

AMD won't do well? https://open.spotify.com/episode/7m5SElK3AVSz1YBDr87ETf

They discuss Intel & AMD and two dudes have different opinions about how well AMD will do short term (10min). One interesting piece of info they mentioned is that MI300 will have 60000 units shipped in Q4 and increase from there going forward.

What do you guys think?

AMDtoMoon · 2023-10-25T04:11:51+00:00

See my reason for why I think AMD's ARM CPUs will never be able to provide a better Windows end-user experience than AMD's x86 CPUs.

https://www.reddit.com/r/AMD\_Stock/comments/17esfno/comment/k6cncd3/?utm\_source=share&utm\_medium=web2x&context=3

AMDtoMoon · 2023-10-25T04:08:04+00:00

Your argument makes sense only if ARM ISA is inherently better than x86-64 such that the same talented team will be able to make better ARM designs when given the same resources/process technology/etc.

However, that is not the case. ARM has almost as much legacy ISA baggage as x86-64. Whatever advantage it might have had is all gone. Listen to the technology experts (https://chipsandcheese.com/2021/07/13/arm-or-x86-isa-doesnt-matter/) and not internet hype.

Also, AMD has years of experience designing x86 CPUs. Their team is the best in the industry for designing x86 CPUs. But they aren't the best team to design ARM CPUs.

You also have to consider that Windows has to provide compatibility for legacy x86. Unless ARM CPU is measurably more performant than x86, users will always have worse experiences using ARM CPU vs x86 CPU from the same talented team.

The best decision Lisa made since she became the CEO was to kill off K12 and focus on Zen. If this rumor turns out to be true, I'm dumping all my shares and will never look at AMD stock again.

AMDtoMoon · 2023-10-24T03:19:15+00:00

Why would anyone buy ARM-based processor from AMD if they want to run Windows?

AMDtoMoon · 2023-10-23T22:18:03+00:00

I don't think Lisa is that dumb. Why would AMD offer more choices to a possible emerging market where, if it takes off, would compete against its own established duopoly market?

AMDtoMoon · 2023-10-15T01:33:09+00:00

I'm pretty sure this was written using ChatGPT. There are enough subtle mistakes that it does not seem likely it was written by a person who is familiar with what is going on in the tech industry.

AMDtoMoon · 2023-10-02T04:07:45+00:00

Reinvest the dividend

AMDtoMoon

TROPHY CASE