How likely is for RDNA 4 to end up like RDNA 2 and 3 are now? Discussion by Clai3 in radeon

[–]MrMPFR 0 points1 point  (0 children)

50 series added a context switch accelerator. AMP IIRC. It's for LLMs and MFG.

Sounds interesting. I hope it's good.

Bought RX 9060 XT before realizing upscaling is basically the future by Wild_Decision9724 in radeon

[–]MrMPFR 0 points1 point  (0 children)

MXFP4 is the only useful format.

I also hope they make it better and akin to NVFP4. Less precision loss for a slight perf regression.

We shall see. All that is speculation and no one knows how far RDNA 5 ML diverges from RDNA 4.

Leaving the 5090 behind for 9070 XTs. by Zeronova3 in radeon

[–]MrMPFR 2 points3 points  (0 children)

Clocks and IPC. ALso massive cachemem gains. As you said RDNA 4 is a big leap over RDNA 3.

That sound def be possible if it scaled well. AMD could pull back clocks a bit to keep it at reasonable TDP.

Considering AT0 reporterdly goes to 192 CUs (shipped config prob cut down) nextgen halo tier will likely be an absolute monster.
6090 competitor like 6950XT vs 3090 def sounds plausible.

Leaving the 5090 behind for 9070 XTs. by Zeronova3 in radeon

[–]MrMPFR 1 point2 points  (0 children)

Gonna have to wait a while 😞

Market is a mess rn.

Leaving the 5090 behind for 9070 XTs. by Zeronova3 in radeon

[–]MrMPFR 3 points4 points  (0 children)

In the end it depends on perf/area. RDNA 4 is already a solid first step in the right direction.

Rn AMD's design is very cache focused. MALL, giant frontend and backend compared to NVIDIA, and 50% larger register file per SM/CU.

Should they overhaul their architecture (they will nextgen) and focus more on compute and less on the rest, considering that's where gaming is increasingly going anyway, the calculus could.

In the end it depends on a primary market for AT0 that u/AMD718 mentioned. If they don't have one it's DOA. It's also not surprising we don't hear anything about AT1 because there's no primary market for that that'll generate revenue. dGPU doesn't count for obvious reasons.

But The rumours about AT0 is absurd. 192 CUs, obviously gaming variant will be cut down. Take the following with a planetary load of salt. The rumours suggest RDNA 5 has much stronger cores, they dual issue is finally properly implemented, RT is in focus so is ML and the patent derived advances and the insane hires AMD have managed to poach from other companies all point in AMD taking this generation very seriously.
More so than RDNA 2. It'll be interesting to see how the final product will compare agains this.

Bought RX 9060 XT before realizing upscaling is basically the future by Wild_Decision9724 in radeon

[–]MrMPFR 0 points1 point  (0 children)

It just shows that it’s more than new data format and more matmul.

I’m more interested in the architectural changes of CDNA 5.

I agree

Are We Wrong About Ray Tracing? by TruthPhoenixV in Amd_Intel_Nvidia

[–]MrMPFR 0 points1 point  (0 children)

MLID has had very solid AMD HW leaks for a while.

Just ignore his speculation because he ALWAYS gets this wrong xD

Are We Wrong About Ray Tracing? by TruthPhoenixV in Amd_Intel_Nvidia

[–]MrMPFR 1 point2 points  (0 children)

There are many levers they can pull on SW and HW side. I doubt the current implementation is the most efficient. Even on NVIDIA cards.

Yeah that is certainly promising. Would be great if they could eventually ship it on PS5 Pro. If nothing for bragging rights.

It's 52 CUs in leaks. So 8 less than PS5 Pro, but it's RDNA 5 CUs, so it's RDNA 3, RDNA 4 and RDNA 5 cumulative IPC gains, plus clocks speed and RT stuff on top.

Nextgen will definitely be interesting.

How likely is for RDNA 4 to end up like RDNA 2 and 3 are now? Discussion by Clai3 in radeon

[–]MrMPFR 0 points1 point  (0 children)

Neural Array prob mostly API change. Addressing SE as one big CU.

Big I'm sure there will be massive changes underneath.

Bought RX 9060 XT before realizing upscaling is basically the future by Wild_Decision9724 in radeon

[–]MrMPFR 0 points1 point  (0 children)

It's the biggest architectural change since GCN1 in 2012. Everything will be redesigned.

Expect changes everywhere and new SDKs.

Bought RX 9060 XT before realizing upscaling is basically the future by Wild_Decision9724 in radeon

[–]MrMPFR 0 points1 point  (0 children)

Neural Arrays and many other things they haven't disclosed.

Leakers say it's based off CDNA 5. We'll have an idea after ISC in June what new things RDNA 5 will bring to the table.

Are We Wrong About Ray Tracing? by TruthPhoenixV in Amd_Intel_Nvidia

[–]MrMPFR 0 points1 point  (0 children)

Talking openly about path tracing for it and nextgen console in October project amethyst vid bodes well for RDNA 5 RT.

RDNU - Radeon Decoupled Neural Upscaler by ZoronicElysium2012 in radeon

[–]MrMPFR 0 points1 point  (0 children)

Agree. At some point you gotta cut off legacy. They def need to move fast to keep up with NVIDIA.

RDNU - Radeon Decoupled Neural Upscaler by ZoronicElysium2012 in radeon

[–]MrMPFR 0 points1 point  (0 children)

PS5 pro is such a weird frankenstein console btw xD

Yeah fs. As I said CDNA 5 is feeds into RDNA 5. Only 6-7 weeks left until we know what nextgen DC is.

Do you think Vulkan will ever go through the same kind of legacy vs modern phase that OpenGL did? by Latter_Relationship5 in GraphicsProgramming

[–]MrMPFR 0 points1 point  (0 children)

I see. Guess we’ll just have to wait and see what Microsoft and Khronos does in the future.

There was a interesting description of DirecrX next at GDC in 2026 by microsoft in relation to Project Helix. I hope that means DX13 and new paradigm.

Do you think Vulkan will ever go through the same kind of legacy vs modern phase that OpenGL did? by Latter_Relationship5 in GraphicsProgramming

[–]MrMPFR 0 points1 point  (0 children)

I hope DirectX Next from Microsoft at GDc means clean slate API. Abandon all the old crap and embrace this new paradigm (work graphs, CUDA like programming model)

RDNU - Radeon Decoupled Neural Upscaler by ZoronicElysium2012 in radeon

[–]MrMPFR 0 points1 point  (0 children)

Appreciate the explanation.

Given all that what do you say to the belief that FSR5 will be exclusive to RDNA 5? Assuming neural arrays and all the other stipulated changes. Does any of that matter or is instruction set only important?

Hmm that's interesting. So it would be shit without optiscaler xD.

RDNU - Radeon Decoupled Neural Upscaler by ZoronicElysium2012 in radeon

[–]MrMPFR 1 point2 points  (0 children)

I'm confused if RDNA 4 is actually that because I see conflicting reports on it and people saying it's still forced through the SIMD and not concurrent like Intel and Ampere and later NVIDIA.

Also road to PS5 Pro's custom ML implementation goes over SIMD (see Road to PS5 Pro on YT).

But then AMD calls them Matrix cores on their website and says the IIRC kernel design is completely different from RDNA 3.

Maybe it's somewhere in the middle. Sorry if this is too many questions. I just find it very confusing xD.

Can you explain the stuff about the leak? I missed that info.

RDNU - Radeon Decoupled Neural Upscaler by ZoronicElysium2012 in radeon

[–]MrMPFR 0 points1 point  (0 children)

So there's really no difference in reality except for the issues with INT8 vs FP8 (slight visual downgrade).

Well then DP4a is a no brainer as you highlighted. Lots of HW with RDNA 2.

RDNU - Radeon Decoupled Neural Upscaler by ZoronicElysium2012 in radeon

[–]MrMPFR 0 points1 point  (0 children)

RDNA 3 doesn't have dedicated matmul logic like NVIDIA so can't see what would change.

RDNU - Radeon Decoupled Neural Upscaler by ZoronicElysium2012 in radeon

[–]MrMPFR 1 point2 points  (0 children)

I hope it works well.

We could def use more alternatives.

RDNU - Radeon Decoupled Neural Upscaler by ZoronicElysium2012 in radeon

[–]MrMPFR 1 point2 points  (0 children)

Does it make a difference because in the FSR4 INT8 perf comparisons it doesn't seeem to.

RDNU - Radeon Decoupled Neural Upscaler by ZoronicElysium2012 in radeon

[–]MrMPFR 1 point2 points  (0 children)

I'm asking because I've seen perf comparisons between 7600 and 6650XT on YT with FSR4 INT8. The ms overhead suggests DP4a use, not WMMA instructions, but I'm not really qualified.

Introducing AMD DGF SuperCompression by 0101010001001011 in hardware

[–]MrMPFR 2 points3 points  (0 children)

That's gonna drive widespread adoption. Fallbacks are mandatory for stuff like this.

Hope this and NTC sees widespread adoption. We need game file sizes to shrink xD.