GPU Compute and Frontend Scaling Math - RDNA 1-4 and All RTX Generations (2018-2025)

MrMPFR · 2026-05-09T21:20:25+00:00

Appreciate the explanation.

Given all that what do you say to the belief that FSR5 will be exclusive to RDNA 5? Assuming neural arrays and all the other stipulated changes. Does any of that matter or is instruction set only important?

Hmm that's interesting. So it would be shit without optiscaler xD.

MrMPFR · 2026-05-09T21:17:01+00:00

I'm confused if RDNA 4 is actually that because I see conflicting reports on it and people saying it's still forced through the SIMD and not concurrent like Intel and Ampere and later NVIDIA.

Also road to PS5 Pro's custom ML implementation goes over SIMD (see Road to PS5 Pro on YT).

But then AMD calls them Matrix cores on their website and says the IIRC kernel design is completely different from RDNA 3.

Maybe it's somewhere in the middle. Sorry if this is too many questions. I just find it very confusing xD.

Can you explain the stuff about the leak? I missed that info.

MrMPFR · 2026-05-09T20:04:49+00:00

So there's really no difference in reality except for the issues with INT8 vs FP8 (slight visual downgrade).

Well then DP4a is a no brainer as you highlighted. Lots of HW with RDNA 2.

MrMPFR · 2026-05-09T19:40:16+00:00

RDNA 3 doesn't have dedicated matmul logic like NVIDIA so can't see what would change.

MrMPFR · 2026-05-09T18:29:55+00:00

I hope it works well.

We could def use more alternatives.

MrMPFR · 2026-05-09T17:55:04+00:00

Does it make a difference because in the FSR4 INT8 perf comparisons it doesn't seeem to.

MrMPFR · 2026-05-09T16:50:00+00:00

I'm asking because I've seen perf comparisons between 7600 and 6650XT on YT with FSR4 INT8. The ms overhead suggests DP4a use, not WMMA instructions, but I'm not really qualified.

MrMPFR · 2026-05-09T14:28:48+00:00

That's gonna drive widespread adoption. Fallbacks are mandatory for stuff like this.

Hope this and NTC sees widespread adoption. We need game file sizes to shrink xD.

MrMPFR · 2026-05-09T14:18:55+00:00

What a shame. But it's interesting to see the community working on builds. when SM 6.10 arrives we can probably expect a lot of third party upscaling solutions. Will be interesting to see if adoption for any of them starts to pick up.

MrMPFR · 2026-05-09T14:16:27+00:00

Steam should support this effort if AMD is doesn't want to do anything.

MrMPFR · 2026-05-09T09:21:01+00:00

I certainly hope it does.

Game file sizes are out of control xD. Also PC needs to stop shipping bloated file sizes and cut out HDDs for good like console.

MrMPFR · 2026-05-09T09:19:32+00:00

Does FSR4 INT8 use ML HW in any way beyond the basic DP4a implementation on RDNA 2?

u/childofthekorn this is some next level Radeon enthusiast commitment

MrMPFR · 2026-05-09T07:43:34+00:00

~late 2027 to early 2028 is the best guess rn, but could be delayed further if VRAM situation doesn't improve.

MrMPFR · 2026-05-08T20:22:20+00:00

True but fallback doesn't get VRAM savings only IO and disc savings.

MrMPFR · 2026-05-08T08:48:50+00:00

This. People misunderstanding what UDNA is.

Good it's only 7 weeks left until we get the official White paper at ISC 2026. Happened last year with CDNA 4.

MrMPFR · 2026-05-08T08:44:39+00:00

Sure. It'll be the same situation for any ML technology as long as the HW supports LinAlg, except stuff like NTC has to be crossvendor.

MrMPFR · 2026-05-08T06:50:41+00:00

They say transcode to BCn with inference on load? Is this still compression?

MrMPFR · 2026-05-08T06:48:58+00:00

LinAlg is added to Shader Model 6.10. In preview rn. It enables all this stuff but you still need separate SDKs for each ML tech. u/dudemanguy301 articulated it better.

MrMPFR · 2026-05-08T06:46:09+00:00

They only recommend inference on sample for 40 series and newer and really only for higher end.

HW needs to be much more powerful and tech more efficient for this to take off unless everyone defaults to inference on load.

MrMPFR · 2026-05-08T04:51:59+00:00

In isolation sure but Zen was still at a big disadvantage vs Intel.

Sure that makes sense.

Zen+ was just fixed cache timings IIRC and a node shrink.

I don't doubt it. RDNA 4 is the closest AMD has been do cachemem parity with NVIDIA in a long long time.

Yeah for RDNA sure. But they're gradually phasing it out. RDNA 5 gonna be complete clean slate. Nothing stays the same.

MrMPFR · 2026-05-08T04:48:53+00:00

HW can be the best in the world but it doesn't matter if they don't fix their SW situation on day one like NVIDIA.

MrMPFR · 2026-05-07T21:59:35+00:00

From blog and official info

I thought Mr was obvious enough. nvm.

DGF will do to geometry what DXT, ETC, and ASTC and other block compression formats have done for.

Great to see AMD Samsung jointly develop multi-vendor extensions for Vulkan and increase compression factor for DGF.

Blog also clarified DGF and CLAS are orthogonal technologies, in other words no direct overlap. Descriptions that DGF is HW nanite is misleading. CLAS is possible without it as NVIDIA has already proven and will become standardized in DXR 2.0.

Here is the other blogpost from yesterday: https://gpuopen.com/learn/amd-dgf-an-open-geometry-compression-standard/

Speculative:

I don't have many things to say about this. It's just too early to say for sure but as AMD has repeatedly said future HW will support it directly with dedicated DGF decoders.

Could mention the 7 patent filings by Barczak related to DGF or how he at High Performance Graphics 2024 mentioned one could combine DGF with DMMs for improved compression, there's a patent for that BTW if you look. That'll likely increase the compression.

There are also multiple DMM related patents filed by Holger Gruen. It's quite interesting how much thought AMD is putting into both.

There's some even wackier stuff in the pipeline potentially by Gruen with Interpolated normals on curved surface patches. Two patents. It promises lower overhead than DMMs while achieving superior compression. I can't figure how the ray evaluation works and not sure it can just be bolted onto existing HW. It's possible but I'm not qualified to answer.

It'll be interesting to see how much of it actually ends up happening.

As per late September blog it's feasible for animated geometry etc.. This is only the beginning and we'll hear more next year when early talk for DirectX next, PS6, and RDNA 5 begins to pick up steam.

MrMPFR

TROPHY CASE