Support NeoGAF

Kazekage1981 · May 3, 2022

thicc_girls_are_teh_best said:
LPDDR4X SDRAM. They're arranged SoP (system-on-package) style mounted on top of each other. Speed is around 4266 MT/s.

This article might provide some more useful info on the M1 range if you're interested.

Since all parts of this is apple, and granted its ARM based, can Intel and AMD do the same thing to knock out NVIDIA for x86? I got a strong feeling it's headed this way.....

Kazekage1981 · May 5, 2022

AMD implements VCN 4.0

InfiniteCombo · May 5, 2022

The current flagship (6900 XT, which I own) is a 300W GPU.

The 7900 XT being a 67% increase over the 6900 XT in power consumption seems a bit suspicious.... (Unless AMD threw power efficiency design to the wind, of course)

Dream-Knife · May 5, 2022

thicc_girls_are_teh_best said:
LPDDR4X SDRAM. They're arranged SoP (system-on-package) style mounted on top of each other. Speed is around 4266 MT/s.

This article might provide some more useful info on the M1 range if you're interested.

Sorry for the noob question, but how does memory bandwidth work? On Nvidia it will give a number like 9500mhz, and on AMD it will give you 2000mhz. Then they say AMD has 16gb/s and Nvidia has 19gb/s, but then they talk about bandwidth being upto 1tb.

MightySquirrel · May 5, 2022

Kazekage1981 said:
AMD implements VCN 4.0

AV1 is taking so long. There will be some great new possibilities once it's out.

rtx-30-series-av1-decoding

www.nvidia.com

"We are working with Twitch on the next generation of game streaming. AV1 will enable Twitch viewers to watch at up to 1440p 120 FPS at 8mbps" or greatly improved iq but still 60 fps.

Corporal.Hicks · May 5, 2022

thicc_girls_are_teh_best said:
Hopefully. Depends on how much of those games use mesh shading (TF perf is actually more useful for that vs. fixed function graphics pipeline). Also probably how much a rasterization bump we get.

I totally forgot about mesh shading. 3Dmark benchmark suggest insane performance gains, so I wonder if this feature was implemented in some games already?

sendit · May 5, 2022

My wallet is ready.

thicc_girls_are_teh_best · May 5, 2022

Corpolar.Hicks said:
I totally forgot about mesh shading. 3Dmark benchmark suggest insane performance gains, so I wonder if this feature was implemented in some games already?

It's definitely in the Matrix demo, at least parts of it. But I'm not sure if any commercial games are currently using mesh shaders (or primitive shaders, for that matter). They are probably still exclusively on the fixed function graphics pipeline since most games have been cross-gen so far.

MightySquirrel said:
AV1 is taking so long. There will be some great new possibilities once it's out.

rtx-30-series-av1-decoding

rtx-30-series-av1-decoding

www.nvidia.com

"We are working with Twitch on the next generation of game streaming. AV1 will enable Twitch viewers to watch at up to 1440p 120 FPS at 8mbps" or greatly improved iq but still 60 fps.

Oh that's gun b gud. This should also hopefully trickle down for lower resolutions and framerates too (I usually set Twitch streams at low bitrate unless there are certain moments I'm actually watching more attentively, then I might jump the resolution up to the source. Otherwise I treat them like audio podcasts).

Hopefully Twitch changes the audio bitrate at lower resolutions; just do them on two different encode paths like how Youtube does.

Dream-Knife said:
Sorry for the noob question, but how does memory bandwidth work? On Nvidia it will give a number like 9500mhz, and on AMD it will give you 2000mhz. Then they say AMD has 16gb/s and Nvidia has 19gb/s, but then they talk about bandwidth being upto 1tb.

Personally I don't even look at the memory controller clocks when it comes to GPUs, just the bus size, and module bandwidth.

If you have, say, a 14 Gbps (gigabits per second) GDDR6 module, like the current-gen systems do, then that means each I/O pin on the module used for data can transfer at 14 Gbps, or 1.75 GB (gigabytes) per second (divide any ****bit amount by eight to get the ****byte, there are eight bits in each byte). Then multiply that by the number of I/O data pins; GDDR modules are 32-bit so they have 32 I/O data pins. That's how you get 56 GB/s module bandwidth.

Then look at the bus size; these are also in bits. The PS5 has a 256-bit GDDR6 memory bus; since they aren't using clamshell mode then each module runs at the full bit rate (32-bit, vs. 16-bit for clamshell configurations). That means you can put eight 32-bit modules on the bus. So multiply the module bandwidth by the product of the bus size divided by the module bit rate (in this case, 256/32 = 8) and you get 448 GB/s, the GDDR6 bandwidth in PS5.

You can use that same method for figuring out Series X, Series S, and pretty much any other modern GPU. HBM designs are different because they have a lot more data I/O lanes (128 vs 32, therefore they are 128-bit memory devices vs. 32-bit memory devices) and are designed for stacking via TSVs (through-silicon vias) in typical stacks of 4-Hi, 8-Hi, 12-Hi and (supposedly, for HBM3) 16-Hi. Stack sizes tell you how many modules are in the stack; you can then look at the capacity per module multiplied by the number in the stack to find out the total capacity per stack. You can also multiply the per-module data I/O pinout amount by the number of modules in the stack to determine the bus size of the stack.

For figuring per-module bandwidth you just use the same method as for GDDR memories. DDR system RAM is a bit different; you actually want to use the module speed, usually expressed in MHz (some also express it in MT, or megatransfers). For example DDR4-3200 is 3200 MHz; you multiply the 3200 MHz by 64 (the number of data I/O bits for DDR memories), and then divide that amount by 8 (translate the bit amount to a byte amount) for 25,600,000 MB/s, or 25.6 GB/s. Some people would split the 3200 MHz by 2 since the memory clock is technically 1600 MHz and it's doubled due to the way DDR works, but if you already know that you can skip that step.

Kazekage1981 said:
Since all parts of this is apple, and granted its ARM based, can Intel and AMD do the same thing to knock out NVIDIA for x86? I got a strong feeling it's headed this way.....

They're already trying to do that xD. Take a look at AMD's M100 designs, that is an indication where RDNA 4 and especially RDNA 5 will go design-wise. Intel already have Ponte Vecchio and their own MCD designs going.

What I'm more interested in is if (or more like when) AMD, Intel & Nvidia move away from GDDR for mainstream GPUs and start using HBM. And I'm especially interested if any of them design GPUs in the future around HBM-PIM technologies because that will probably represent another paradigm shift IMHO (and 10th-gen consoles would benefit a ton from it as well).

cinnamonandgravy · May 5, 2022

FireFly said:
I am sure before Zen 1 there were plenty of people saying the same thing about AMD CPUs.

athlon 64 was a saint.

Epic Sax CEO · May 7, 2022

What we already know about RDNA3:

twilo99 · May 7, 2022

512 MB cache

JohnnyFootball · May 7, 2022

StateofMajora said:
Vega 56 and polaris were good products.

The performance was good, but they were power hungry dead-end products. There is a reason that architecture was largely abandoned. It wasn't a failure, but it wasn't the success it needed to be long term.

StateofMajora · May 7, 2022

JohnnyFootball said:
The performance was good, but they were power hungry dead-end products. There is a reason that architecture was largely abandoned. It wasn't a failure, but it wasn't the success it needed to be long term.

Vega was pretty hungry (though the 64 was particularly inefficient hence why I didn't mention it) but Polaris wasn't THAT much hungrier than 1060.

I agree that it was a dead end architecture though.

Kazekage1981 · May 11, 2022

Key Points (dont fully understand what this means from a tech perspective):
-Single GCD instead of 2
-Increase MCD count which acts like a memory controller
-192MB-384MB infinity cache
-48 work group processors split into 6 shader engines
-1GCD=N31
Source: Red Gaming Tech

New AMD Driver giving increase DX11 performance improvement:

Source: Red Gaming Tech

LordOfChaos · May 17, 2022

So Intel might have like, a month to themselves on the market during the inventory dry up period before these next gen cards hit

Sanepar · May 17, 2022

I don't care about 7900 xt. I want know about 7800 xt and if will be better than a 4070.

Ivan · May 17, 2022

I'm very excited about this big jump in performance, but at the same time I'm bitter because the last game that made us truly care about that kind of stuff was made in 2007.

I wonder if there is any business model with all of the big guys helping that would make pc centric development happen again...

Most games would still be multiplatform, but there has to be SOMETHING for the crowd that's paying top prices and is enthusiastic as ever.

Crysis 4 could be that kind of an experiment, who knows...

What else do you think would make sense as a pc only/hardware pushing game?

Black_Stride · May 18, 2022

Sanepar said:
I don't care about 7900 xt. I want know about 7800 xt and if will be better than a 4070.

The 7800XT should easily walk a 4070.
Noting that a 6800XT competes with a 3080 in raster.
From the current leaks theres quite alot of space between Navi 32 and Navi 31.
Id assume a 7800XT fits somewhere in between there.....even if Navi 32 is the 7800XT that already looks to be able to work a 4070 as is right now.

Epic Sax CEO · May 18, 2022

People are underestimating AMD too much. The current RDNA3 chips are already on par with Ampere in rasterization. On 4K they lag a bit because of less bandwidth and Ampere's extra bloated flops. No matter how strong Ada will be, AMD can easily double performance by doubling hardware for RDNA3, and it's expected to have higher bandwidth, more IC, improvements in IPC with stronger RT and increased clocks. Of course, Nvidia will still come ahead in some areas because of all that dedicated hardware but I expect the different to be smaller than now.

akaFullmetal · May 18, 2022

Just needs tony starks arc reactor to power it but very cool

.

Kazekage1981 · May 18, 2022

I think RDNA 3 will have the advantage on price to performance ratio as always, efficiency, and less power usage. The RDNA 3 architecture is the key to the iGPU/APU starting end of this year, beginning of next year on small form factors, and slim form factors, along with updated drivers, FSR 2.0, you got a winning combo.

Im glad NVIDIA will brute force performance, but at a cost of higher price, higher power usage, and discrete form factor.

twilo99 · May 18, 2022

I've been on AMD cards for almost 3 years now, no issues.

I did try the 3070ti last year but switched to a 6800xt shortly after because the performance was better and the price was about the same.

Looking forward to RDNA3.

twilo99 · May 18, 2022

Bo_Hazem said:
I don't like neither Apple or MS, and before M1, Apple was overhyped as fuck, mind you.

Are you strictly talking about desktop/laptops? Apple's ARM based CPUs have been at least one generation ahead of the competition for at least a decade now...

manfestival · May 18, 2022

Have had the 6800xt since March or April of 2021. Have only had small issues like the first month that were resolved by a driver update. Any other issues I have had were caused by my overclocking tinkering that the card/software resolves itself. Just only wish this card ran a little cooler. I know reviews were impressed with the OC thermal performance but... I guess I want more efficiency/etc.

Jigsaah · May 18, 2022

Holy Fuck the Wattage. That might be the reason I don't get one. Neither Lovelace or this AMD card. Until there are enough games or applications I use to warrant this kind of power...why would I take on this. I would need to upgrade multiple components all over again. I got 750 watts right now. Lovelace taking up 600? So new power supply, new cpu and possibly new mobo if I decide to go back to Intel. Absolutely insane.

Epic Sax CEO · May 19, 2022

Jigsaah said:
Holy Fuck the Wattage. That might be the reason I don't get one. Neither Lovelace or this AMD card. Until there are enough games or applications I use to warrant this kind of power...why would I take on this. I would need to upgrade multiple components all over again. I got 750 watts right now. Lovelace taking up 600? So new power supply, new cpu and possibly new mobo if I decide to go back to Intel. Absolutely insane.

Just don't get the top end.
The midrange should be faster than the current tops and use less power.

Reizo Ryuu · May 19, 2022

Does anyone have an idea when the 7000 series will release?
My vega is still holding in there, but this 6750xt is looking really nice...

Bo_Hazem · May 19, 2022

twilo99 said:
Are you strictly talking about desktop/laptops? Apple's ARM based CPUs have been at least one generation ahead of the competition for at least a decade now...

Yes, more like Apple vs PC/laptops. On phones/tablets it's still comparable until you bring M1 to the table, Apple wins in some, Android on others, but M1 wins it all.

alucard0712_rus · May 19, 2022

In Cold Blood said:
100Tflops....
Jesus H Christ.
While I'm not a PC gamer, I love to see the GPU makers pushing performance as much as they can.
But nothing will take advantage of it.

Supersampling, frame-rate? There is a lot of ways you can take advantage of additional performance. To me If you can't use your GPU at maximum you're dumb

Epic Sax CEO · May 19, 2022

Reizo Ryuu said:
Does anyone have an idea when the 7000 series will release?
My vega is still holding in there, but this 6750xt is looking really nice...

End of the year at least, more like start of next year.
Zen 4 should come first.

Kazekage1981 · May 19, 2022

Source: gamermeld youtube channel

RDNA 3 may feature VLIW2. Source: Red Gaming Tech. What is VLIW2?

Irobot82 · May 19, 2022

Kazekage1981 said:
Source: gamermeld youtube channel

RDNA 3 may feature VLIW2. Source: Red Gaming Tech. What is VLIW2?

VLIW stands for Very Long Instruction Word. It's an instruction set for parallelism. According to wikipedia.

In Cold Blood · May 20, 2022

alucard0712_rus said:
Supersampling, frame-rate? There is a lot of ways you can take advantage of additional performance. To me If you can't use your GPU at maximum you're dumb

The devs won't.
GPUs get released quicker than Devs can take advantage of them.
If they are halfway through their game and suddenly a new PC card comes out with 100tflops they arnt going to go back and redo all their mesh models to take advantage of it, because evey year a newer more powerful card comes out.
But still, I love seeing new cards come out.

alucard0712_rus · May 20, 2022

In Cold Blood said:
The devs won't.
GPUs get released quicker than Devs can take advantage of them.
If they are halfway through their game and suddenly a new PC card comes out with 100tflops they arnt going to go back and redo all their mesh models to take advantage of it, because evey year a newer more powerful card comes out.
But still, I love seeing new cards come out.

That's why I'm talking about basic things like resolution (including supersampling) and frame-rate. Those things alone can greatly enchance any game and celling here is very high (supersampling with very high refresh rate). Do not forget about VR and things like Unreal Engine - those can eat any GPU pretty much.
In short - I don't see any problem with "too much power" in GPU space.

tusharngf · May 20, 2022

Whitecrow · May 20, 2022

It looks like if I have a job when this releases, my bank account is gonna have a problem

GreatnessRD · May 20, 2022

As a 6800 XT owner, I'm really curious to see how AMD competes at the raytracing level. Even though Raytracing means nothing to me personally, I still chuckle that the RT performance on the card brings it to its knees. So definitely curious to see how they bounce back in that department. Especially since FSR 2.0 looks to bring parity to DLSS 23 or whatnot.

winjer · May 20, 2022

Kazekage1981 said:
Source: gamermeld youtube channel

RDNA 3 may feature VLIW2. Source: Red Gaming Tech. What is VLIW2?

That is not from Gamermeld. They are just reporting on an article by Hardware Unboxed / Techspot.

Kazekage1981 · May 20, 2022

Apologies, that is correct

winjer

Will RDNA3 use LESS TDP then NVIDIA Lovelace? Does more TDP mean more TFLOPS and performance?

Epic Sax CEO · May 20, 2022

Kazekage1981 said:

Also:
Prices from Newegg as of today,

Ryzen 5800X3D = $473
RX 6950XT = $1100
Total = $1573

Core i9 12900K = $599
RTX 3090 = $1700
Total = $2299

Kazekage1981 · May 21, 2022

The video is very technical (I don't fully understand it), how does this help with current RDNA 2 products and future RDNA 3 and RDNA 4?

BattleScar · May 21, 2022

Kazekage1981 said:
Apologies, that is correct winjer

Will RDNA3 use LESS TDP then NVIDIA Lovelace? Does more TDP mean more TFLOPS and performance?

TDP = Thermal Design Power
Its complicated because its never used in the same way. But in essence is the amount of heat the GPU is designed to cope with. Remember every Watt of power a GPU consumes is equal to a Watt of heat produced.
300W PC is acting like a 300W heater.

Panajev2001a · May 22, 2022

Kazekage1981 said:
The video is very technical (I don't fully understand it), how does this help with current RDNA 2 products and future RDNA 3 and RDNA 4?

For sure you will have engineers taking the input of these software improvements to decide what to add in terms of HW support on top of just adding more resources (more CU's or for RT cores per CU and/or more units dedicated to accelerate ML operations at a low power), but it is not a 1:1 relationship meaning some of these best practices might work even better with new HW but there might be things you stop doing because it is not actually needed/as efficient on new HW (it happened with the PS2 GS to PS3 RSX transition where state changes were almost free on a very fine grained based and on PS3 you had to rely a lot more on batching and avoid flushing buffers unnecessarily).

This video was describing new best practices to lower cost and/or improve image quality purely via software. The interesting option is finding the best way to speed things up in a way that ML algorithms find easiest to process the final output and render something visually convincing. Tying the two steps together a bit more so to speak if possible.

Kazekage1981 · May 22, 2022

Is HIPRT and OROCHI the FSR of Ray Tracing from AMD?

Panajev2001a · May 22, 2022

Kazekage1981 said:
Is HIPRT and OROCHI the FSR of Ray Tracing from AMD?

Orochi is an abstraction layer on top of CUDA and the open API AMD has for pure compute. Not sure about HIPRT more than a flavour of that for RT?

tusharngf · Jul 5, 2022

AMD Radeon RX 7900 XT RDNA 3 "Navi 31" Graphics Card Specs, Performance, Price & Availability – Everything We Know So Far

AMD Navi 31 'Plum Bonito' GPU - The Next-Gen RDNA 3 Powerhouse

5nm Process Node
Advanced Chiplet Packaging
Rearchitected Compute Unit
Optimized Graphics Pipeline
Next-Gen AMD Infinity Cache
>50% Perf/Watt vs RDNA 2

[h2][/h2][h2][/h2]

AMD RDNA GPU (Generational Comparison) Preliminary:

GPU NAME	NAVI 10	NAVI 21	NAVI 31
GPU Process	7nm	7nm	5nm (6nm?)
GPU Package	Monolithic	Monolithic	MCD (Multi-Chiplet Die)
Shader Engines	2	4	6
GPU WGPs	20	40	30 (Per MCD) 60 (In Total)
SPs Per WGP	128	128	256
Compute Units (Per Die)	40	80	120 (per MCD) 240 (in total)
Cores (Per Die)	2560	5120	7680
Cores (Total)	2560	5120	15360 (2 x MCD)
Peak Clock	1905 MHz	2250 MHz	2500 MHz
FP32 Compute	9.7	23	38.4
Memory Bus	256-bit	256-bit	256-bit
Memory Type	GDDR6	GDDR6	GDDR6
Memory Capacity	8 GB	16 GB	32 GB
Infinity Cache	N/A	128 MB	512 MB
Flagship SKU	Radeon RX 5700 XT	Radeon RX 6900 XTX	Radeon RX 7950 XT
TBP	225W	330W	500W
Launch	Q3 2019	Q4 2020	Q4 2022

According to the latest information, the AMD Navi 31 GPU with RDNA 3 architecture is expected to offer a single GCD with 48 WGPs, 12 SAs, and 6 SEs. This will give out a total of 12,288 stream processors which is lower than the previous count. This will also drop the overall compute performance unless AMD goes crazy with over 3.0 GHz clock frequencies on its flagship part. The Navi 31 GPU will also carry 6 MCD's which will feature 64 MB Infinity Cache per die and are also likely to carry the 64-bit (32-bit x 2) memory controllers that will provide the chip with a 384-bit bus interface.

As for clock speeds, the AMD Navi 31 GPU is said to offer clock speeds that can hit or even exceed 3 GHz. NVIDIA's flagship GPUs are also said to offer close to 2.8 GHz clock speeds but AMD has had a clear advantage in clock speeds over NVIDIA during the past generation so its expected to continue. A 3 GHz clock speed means that we can expect over 75 TFLOPs of FP32 performance on the newest flagship which will be a 2.3x increase over the current RDNA 2 flagship, the RX 6950 XT.

AMD Radeon RX 7900 XT: ~75TFLOPs (FP32) (Assuming 3.0 GHz clock)
AMD Radeon RX 6950 XT: 23.80 TFLOPs (FP32) (2324 MHz Boost Clock)
AMD Radeon RX 6900 XT: 23.04 TFLOPs (FP32) (2250 MHz Boost clock)
AMD Radeon RX 6800 XT: 20.74 TFLOPs (FP32) (2250 MHz Boost clock)
AMD Radeon RX 6800: 16.17 TFLOPs (FP32) (2105 MHz Boost clock)

Source: https://wccftech.com/roundup/amd-radeon-rx-7900-xt/

InfiniteCombo · Jul 5, 2022

"Plum Bonito?" The fuck kind of codename is that!?

Support NeoGAF

AMD Radeon RX 7900 XT Flagship RDNA 3 Graphics Card Could Reach Almost 100 TFLOPs, Navi 31 GPU Frequency Hitting Over 3 GHz

Member

Member

Banned

Banned

Banned

Member

Member

Member

Member

Banned

Member

GerAlt-Right. Ciriously.

Banned

Member

Member

Member

Member

do not tempt fate do not contrain Wonder Woman's thighs do not do not

Banned

Member

Member

Member

Member

Member

Gold Member

Banned

Member

Banned

Banned

Banned

Member

Member

Banned

Banned

Member

Banned

Member

Member

Member

Banned

Member

Member

GAF's Pleasant Genius

Member

GAF's Pleasant Genius

Member

AMD Radeon RX 7900 XT RDNA 3 "Navi 31" Graphics Card Specs, Performance, Price & Availability – Everything We Know So Far

AMD Navi 31 'Plum Bonito' GPU - The Next-Gen RDNA 3 Powerhouse

AMD RDNA GPU (Generational Comparison) Preliminary:

Banned

Similar threads