Support NeoGAF

Wolzard · Aug 27, 2025

GPU	Shader Arrays (SA)	Shader Engines (SE)	Compute Units (CU)	Memory Controller (UMC)
AT0 GPU	8	16 (2 per SA)	96 (6 per SE)	16 (32-bit?)
AT0 GPU	4	8 (2 per SA)	40 (5 per SE)	6 (32-bit?)
AT3 GPU	2	4 (2 per SA)	24 (6 per SE)	8 (16-bit?)
AT4 GPU	1	2 (2 per SA)	12 (6 per SE)	4 (16-bit?)

AT0 featuring 96 Compute Units, organized into 8 Shader Arrays. Each Shader Array would contain 16 Shader Engines, with every Shader Engine including 6 Compute Units.

AT2 GPU is said to feature 40 Compute Units, with each Shader Engine containing 5 CUs. Its UMC block (memory controller) is estimated at six, suggesting a 192-bit memory bus, while AT0 is expected to use a 512-bit bus.

AT3, meanwhile, is rumored to have 24 Compute Units. Interestingly, it would have more UMCs than AT2 which may come from AT3 and AT4 being designed with LPDDR5X memory, requiring more controllers. The AT4 could have 12 CUs, with each memory controller on these chips believed to be 16-bit wide.

https://videocardz.com/newz/amd-rdn...-gpu-with-512-bit-memory-bus-96-compute-units

Tobimacoss · Aug 27, 2025

What happened to the 152 CU AT0?

Trogdor1123 · Aug 27, 2025

Could someone explain what this means?

Mr Moose · Aug 27, 2025

Is the AT3 a laptop or handheld or something?

Skifi28 · Aug 27, 2025

Trogdor1123 said:
Could someone explain what this means?

Value for the shareholders.

FireFly · Aug 27, 2025

Trogdor1123 said:
Could someone explain what this means?

Probably closer to 4090 than 5090 performance.

TheRedRiders · Aug 27, 2025

AT2 is very similar to what we're hearing about the PS6

Tobimacoss · Aug 27, 2025

Mr Moose said:
Is the AT3 a laptop or handheld or something?

Yes, AT3 and AT4 are Medusa Point, 8-24 CUs and Medusa Point Halo is 48 CUs. They're for Laptops and Handhelds. And the Halo is for cheap gaming PCs or Mini PCs.

The Mad Draklor · Aug 27, 2025

Mr Moose said:
Is the AT3 a laptop or handheld or something?

AT4 looks to be for thin laptop/handheld as that has 12CUs.

Loxus · Aug 27, 2025

Kepler called it a "schizo post".
I don't know what he means by that.

And Videocardz has it mixed up.
It's 2 Shader Arrays per Shader Engine and 6 CUs per Shader Array.

TheThreadsThatBindUs · Aug 27, 2025

FireFly said:
Probably closer to 4090 than 5090 performance.

Without clockspeed and architectural info. it's not possible to derive overall performance from this.

FireFly · Aug 27, 2025

TheThreadsThatBindUs said:
Without clockspeed and architectural info. it's not possible to derive overall performance from this.

Ok, to put it another way: An RDNA 4 class card with these specifications would be expected to deliver around 4090 level performance. So that's the baseline.

TheThreadsThatBindUs · Aug 27, 2025

FireFly said:
Ok, to put it another way: An RDNA 4 class card with these specifications would be expected to deliver around 4090 level performance. So that's the baseline.

Again, without clockspeed you can't even say that.

BlazeEcho97 · Aug 27, 2025

Finally,a real high end competitor of nvda rtx 90 interesting it will compete more to 60 series than 50?

Sanepar · Aug 27, 2025

It is really weird to have a 96 cu gpu and nothing and then a 40 cu gpu. They should have a 84, 72, 64 gpus at least.

Loxus · Aug 27, 2025

Tobimacoss said:
What happened to the 152 CU AT0?

After reading through AnandTech forum RDNA 5 / UDNA (CDNA Next) speculation thread.

I saw a post that says, "1 RDNA 4 WGP == 1 RDNA 5 CU"

So it seems that the CU in the diagram represents a WGP but in RDNA5 the WGP (the 2 CUs) are probably now one CU that can act as 1 CU or 2 CUs.

That's how I see it now being the case based on AT2. RDNA always pair the CUs in a WGP, AT2 is uneven with 5 CUs.

I wonder if K KeplerL2 can share some more on RDNA5 CUs.

Black_Stride · Aug 27, 2025

Sanepar said:
It is really weird to have a 96 cu gpu and nothing and then a 40 cu gpu. They should have a 84, 72, 64 gpus at least.

Flagship to prove they can.

Mainstream to actually make money.
9070 is what 56 CU?
9060XT 32CU.

cinnamonandgravy · Aug 27, 2025

512bus is sexy

AMD abandoning HBM on its consumer side due to latency?
was cool seeing a 4096bus

FireFly · Aug 27, 2025

TheThreadsThatBindUs said:
Again, without clockspeed you can't even say that.

Kepler said he doesn't expect Magnus to be clocked below 3 GHz, and that is a 68 CU part that is likely power constrained. In addition, RDNA 2 and RDNA 3 saw a clock speed variance of ~10% or less between the high and low end parts, and we would expect roughly the same clock speed boost for going from N4P to N3P. So even in the worst case, we would expect clock speeds to be around where the 9070 XT already is, assuming the product is not heavily power constrained.

Now you're probably going to say that the architectural changes with RDNA 5 may make the chip clock lower, which is possible but I don't recall ever happening to any significant degree on AMD cards. The rumoured IPC boost is only 5%-10%, which would be wiped out by significantly slower clocks.

kiphalfton · Aug 27, 2025

Until AMD actually beats Nvidia, in the same price bracket, I see no point buying their products.

HerjansEagleFeeder · Aug 27, 2025

kiphalfton said:
Until AMD actually beats Nvidia, in the same price bracket, I see no point buying their products.

Most valuable post in the whole thread right here folks. We can all pack it up

Sanepar · Aug 27, 2025

kiphalfton said:
Until AMD actually beats Nvidia, in the same price bracket, I see no point buying their products.

Well so it already time! 9070 xt performs 5% slow than 5070 ti but costs 20% less.

Insane Metal · Aug 27, 2025

512 bits? Finally?

Jinzo Prime · Aug 27, 2025

Tobimacoss said:
What happened to the 152 CU AT0?

96 x 2 = 192

Jinzo Prime · Aug 27, 2025

Sanepar said:
It is really weird to have a 96 cu gpu and nothing and then a 40 cu gpu. They should have a 84, 72, 64 gpus at least.

The chart goes from AT0 to AT2, do there is a missing AT1.

Edit: If they don't want to do a unique die for a 80 class card, they could just do a cut down of AT0 as well. The CUs are arranged in groups of 12, so a cut down version would presumably have CUs that are a multiple of 12 as well. 72 (12x6) would be perfect, but probably too good.

TheThreadsThatBindUs · Aug 28, 2025

FireFly said:
Kepler said he doesn't expect Magnus to be clocked below 3 GHz, and that is a 68 CU part that is likely power constrained. In addition, RDNA 2 and RDNA 3 saw a clock speed variance of ~10% or less between the high and low end parts, and we would expect roughly the same clock speed boost for going from N4P to N3P. So even in the worst case, we would expect clock speeds to be around where the 9070 XT already is, assuming the product is not heavily power constrained.

Now you're probably going to say that the architectural changes with RDNA 5 may make the chip clock lower, which is possible but I don't recall ever happening to any significant degree on AMD cards. The rumoured IPC boost is only 5%-10%, which would be wiped out by significantly slower clocks.

I just wanted you to be explicit about your assumptions. Your take is much more reasonable when fleshed out the way you have here. Thanks.

Unknown Soldier · Aug 28, 2025

Someone needs to make an AMD to Nvidia CU calculator so I can understand what "96 CU's" means in terms of Nvidia performance

KeplerL2 · Aug 28, 2025

Loxus said:
After reading through AnandTech forum RDNA 5 / UDNA (CDNA Next) speculation thread.

I saw a post that says, "1 RDNA 4 WGP == 1 RDNA 5 CU"

So it seems that the CU in the diagram represents a WGP but in RDNA5 the WGP (the 2 CUs) are probably now one CU that can act as 1 CU or 2 CUs.

That's how I see it now being the case based on AT2. RDNA always pair the CUs in a WGP, AT2 is uneven with 5 CUs.

I wonder if K KeplerL2 can share some more on RDNA5 CUs.

MI400 deprecated CU/WGP distinction, now ony CU mode is supported with WGP-sized structures.

KeplerL2 · Aug 28, 2025

Unknown Soldier said:
Someone needs to make an AMD to Nvidia CU calculator so I can understand what "96 CU's" means in terms of Nvidia performance

It's GB202 tier in terms of silicon

Gameplay Gods Bless · Aug 28, 2025

Trogdor1123 said:
Could someone explain what this means?

It means RDNA5 is going to be fast as fuck at least on the middle end and high end.

Gameplay Gods Bless · Aug 28, 2025

Sanepar said:
It is really weird to have a 96 cu gpu and nothing and then a 40 cu gpu. They should have a 84, 72, 64 gpus at least.

They already have 4 chips, that's double what they have now. They're probably very confident on the 40 cu part being capable of competing with the 6070. The 96cu part will be cut down in 3 ways so that's 3 cards from 1 chip like the 7900xtx.

Gameplay Gods Bless · Aug 28, 2025

cinnamonandgravy said:
512bus is sexy

AMD abandoning HBM on its consumer side due to latency?
was cool seeing a 4096bus

They abandoned it due to cost. Nvidia bested them even with shittier GDDR memory.

Gameplay Gods Bless · Aug 28, 2025

kiphalfton said:
Until AMD actually beats Nvidia, in the same price bracket, I see no point buying their products.

Console players love buying AMD tons of PS4s, Xbox Ones, PS5s, Xbox Series X/S and PS5 Pro that have been bought. No one loves AMD like console players.

Gameplay Gods Bless · Aug 28, 2025

Unknown Soldier said:
Someone needs to make an AMD to Nvidia CU calculator so I can understand what "96 CU's" means in terms of Nvidia performance

An AMD CU is equivalent to an Nvidia SM.

Gameplay Gods Bless · Aug 28, 2025

KeplerL2 said:
MI400 deprecated CU/WGP distinction, now ony CU mode is supported with WGP-sized structures.

Is RDNA5 finally bringing multiple GCD chiplets on the same GPU?

pacman4000 · Aug 28, 2025

KeplerL2 said:
It's GB202 tier in terms of silicon

That's one big motherf...

KeplerL2 · Aug 28, 2025

Gameplay Gods Bless said:
Is RDNA5 finally bringing multiple GCD chiplets on the same GPU?

Nope

Loxus · Aug 28, 2025

KeplerL2 said:
MI400 deprecated CU/WGP distinction, now ony CU mode is supported with WGP-sized structures.

Is this only for CDNA5 or is this also applied to RDNA5.

Just wondering cause MLiD doubled down on his leak.

I would assume both is correct. AT0 full die being 96/192 if we take into account the CU/WGP distinction.

kiphalfton · Aug 28, 2025

Sanepar said:
Well so it already time! 9070 xt performs 5% slow than 5070 ti but costs 20% less.

*In raster

Also, I didn't realize $700 (9070 XT) is 20% less than $750 (5070 Ti).

Jinzo Prime · Aug 28, 2025

Gameplay Gods Bless said:
They already have 4 chips, that's double what they have now. They're probably very confident on the 40 cu part being capable of competing with the 6070. The 96cu part will be cut down in 3 ways so that's 3 cards from 1 chip like the 7900xtx.

I'm interested in knowing what ways the AT0 chip will be cut down. I guessed that they could do a 60 cu card or a 72 cu card, but I have no real idea.

KungFucius · Sep 1, 2025

kiphalfton said:
Until AMD actually beats Nvidia, in the same price bracket, I see no point buying their products.

Huh? They usually do outside of RT. They just haven't had anything worthy of the top price brackets.

YeulEmeralda · Sep 1, 2025

kiphalfton said:
*In raster

Also, I didn't realize $700 (9070 XT) is 20% less than $750 (5070 Ti).

There's a reason why Jensen constantly talks about AI. Raster is irrelevant everything is about upscaling now.

Support NeoGAF

AMD RDNA5 rumors point to AT0 flagship GPU with 512-bit memory bus, 96 Compute Units

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Neo Member

Member

Member

do not tempt fate do not contrain Wonder Woman's thighs do not do not

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

King Snowflake

Linux User

Similar threads