Support NeoGAF

Quantum253 · Mar 18, 2024

Nvidia and the future of AI

Blackwell Innovations to Fuel Accelerated Computing and Generative AI
Blackwell's six revolutionary technologies, which together enable AI training and real-time LLM inference for models scaling up to 10 trillion parameters, include:

World's Most Powerful Chip — Packed with 208 billion transistors, Blackwell-architecture GPUs are manufactured using a custom-built 4NP TSMC process with two-reticle limit GPU dies connected by 10 TB/second chip-to-chip link into a single, unified GPU.
Second-Generation Transformer Engine — Fueled by new micro-tensor scaling support and NVIDIA's advanced dynamic range management algorithms integrated into NVIDIA TensorRT™-LLM and NeMo Megatron frameworks, Blackwell will support double the compute and model sizes with new 4-bit floating point AI inference capabilities.
Fifth-Generation NVLink — To accelerate performance for multitrillion-parameter and mixture-of-experts AI models, the latest iteration of NVIDIA NVLink® delivers groundbreaking 1.8TB/s bidirectional throughput per GPU, ensuring seamless high-speed communication among up to 576 GPUs for the most complex LLMs.
RAS Engine — Blackwell-powered GPUs include a dedicated engine for reliability, availability and serviceability. Additionally, the Blackwell architecture adds capabilities at the chip level to utilize AI-based preventative maintenance to run diagnostics and forecast reliability issues. This maximizes system uptime and improves resiliency for massive-scale AI deployments to run uninterrupted for weeks or even months at a time and to reduce operating costs.
Secure AI — Advanced confidential computing capabilities protect AI models and customer data without compromising performance, with support for new native interface encryption protocols, which are critical for privacy-sensitive industries like healthcare and financial services.
Decompression Engine — A dedicated decompression engine supports the latest formats, accelerating database queries to deliver the highest performance in data analytics and data science. In the coming years, data processing, on which companies spend tens of billions of dollars annually, will be increasingly GPU-accelerated.

https://nvidianews.nvidia.com/news/nvidia-blackwell-platform-arrives-to-power-a-new-era-of-computing

S0ULZB0URNE · Mar 18, 2024

Blackwell

OverHeat · Mar 18, 2024

ChorizoPicozo · Mar 18, 2024

wil jensen be high on crypto-A.I-crack while doing his speech

DenchDeckard · Mar 18, 2024

Skifi28 · Mar 18, 2024

The entire keynote will be AI generated on a 5090.

Audiophile · Mar 18, 2024

I for one welcome our chiplet future!

S0ULZB0URNE · Mar 18, 2024

Whip out the 5090 and I'll whip out my......

Credit card

Topher · Mar 18, 2024

Mister Wolf · Mar 18, 2024

OverHeat · Mar 18, 2024

S0ULZB0URNE said:
Whip out the 5090 and I'll whip out my......

Credit card

Hell yeah credit card in one hand dick in the other lol

Kuranghi · Mar 18, 2024

I do love a Jensen presentation regardless of content. He's such a silly billy.

S0ULZB0URNE · Mar 18, 2024

OverHeat said:
Hell yeah credit card in one hand dick in the other lol

Kuranghi · Mar 18, 2024

Should I watch the presentation now or after I take hallucinogens tonight?

OverHeat said:
Hell yeah credit card in one hand dick in the other lol

Absolutely stunning rendered and upscaled cocks all around you. That's the dream.

Skifi28 · Mar 18, 2024

Kuranghi said:
Should I watch the presentation now or after I take hallucinogens tonight?

Absolutely stunning rendered and upscaled cocks all around you. That's the dream.

Cocks with DLSS4 are better than real cocks.

StreetsofBeige · Mar 18, 2024

I watching it now on the side. even though I dont understand whats he's saying, at least you can tell based on his background he understand the tech. So he's a grassroots techie who became CEO and can talk about this shit 24/7.

On the other hand, look how many numbnut execs in the world get hired, and you can tell when they speak they dont even know their own product lines.

Audiophile · Mar 18, 2024

Fun fact: Jensen & Lisa Su are first cousins once removed.

James Sawyer Ford · Mar 18, 2024

How many hope and dream unicorn buzzwords can they repeat to juice the stock price even further?

StreetsofBeige · Mar 18, 2024

Audiophile said:
Fun fact: Jensen & Lisa Su are first cousins once removed.

Fun fact 2: I never knew this until checked his bio a few years ago. But I thought Jensen was a totally made up name. But it's really just a modernized spelling of his real name Jen-Hsun.

FalconPunch · Mar 18, 2024

What's with Jensen comparing unlike things. FP16 to FP8 to FP6? Wtf?

Little Chicken · Mar 18, 2024

I thought he was having a stroke on stage just then, but it was just a series of bad jokes not landing.

StreetsofBeige · Mar 18, 2024

Little Chicken said:
I thought he was having a stroke on stage just then, but it was just a series of bad jokes not landing.

I dont know how legit or BS all his gabbing and slides are (as John Sawyer said above a lot of buzzwords), but I'm enjoying his stage show. He's pretty engaging, but so far his jokes are lousy. Great tech storyteller. Not great comedy storyteller.

roosnam1980 · Mar 18, 2024

Little Chicken said:
I thought he was having a stroke on stage just then, but it was just a series of bad jokes not landing.

dude is cringe

iHaunter · Mar 18, 2024

AI ain't gonna do shit. Need 30+ years for it to do what he's claiming. Just trying to sit at the top of the AI meme train.

Quantum253 · Mar 18, 2024

It's going to be interesting seeing GPUs with cores and how game streaming starts to change with AI leading backend computing to manageable transfer rates

Coulomb_Barrier · Mar 18, 2024

Again, this is not actually him standing on stage nor is it human, it's an AI generated image of Jens you can tell by the hair.

LordOfChaos · Mar 18, 2024

Jen-Hsun is quite a funny presenter, got a lot better at it over the years. There's one CEO I don't have to worry about going on ketamine trips and ranting about trans kids on twitter.

GHG · Mar 18, 2024

Coulomb_Barrier said:
Again, this is not actually him standing on stage nor is it human, it's an AI generated image of Jens you can tell by the hair.

He's standing up on stage in front of a live audience.

Quantum253 · Mar 18, 2024

The cat theory reminds me of how to generate images (however dense/big you want) with limited data needed. Or the need to fully render environments, worlds, etc. it could be exponentially increased.

Darkkahn · Mar 18, 2024

YES, FINALLY! EARTH 2!

Coulomb_Barrier · Mar 18, 2024

GHG said:
He's standing up on stage in front of a live audience.

The crowd is AI generated too, with cuts of real video

ResetEraVetVIP · Mar 18, 2024

Coulomb_Barrier said:
The crowd is AI generated too, with cuts of real video

Lmao

willothedog · Mar 18, 2024

lobot master race incoming

Bernoulli · Mar 18, 2024

explains why chinese have good and cheap cars

Bernoulli · Mar 18, 2024

Their software is well marketed but AMD is going to bring more powerful hardware again at a fraction of the cost

Quantum253 · Mar 18, 2024

A matter of time before kids get their own BDS robots to hang out with

ThisIsMyDog · Mar 18, 2024

BattleScar · Mar 18, 2024

Massive waste of time this.

Darkkahn · Mar 18, 2024

Is there an AI that predicts if I can keep my job for the next 5 years?

FalconPunch · Mar 18, 2024

Jensen promised a lot of things but whats the delivery timescale?

Quantum253 · Mar 18, 2024

That's going to be interesting how Blackwell is integrated into the chipsets/hardware and software development cycles.

cinnamonandgravy · Mar 18, 2024

Bernoulli said:
Their software is well marketed but AMD is going to bring more powerful hardware again at a fraction of the cost

A.Romero · Mar 18, 2024

What he showed today is pretty crazy. Really. The part that impressed me the most was Blackwell's spine having an equivalent of all of the Internet's aggregate bandwidth. It's crazy.

Quantum253 · Mar 18, 2024

BattleScar said:
Massive waste of time this.

I was hoping for some innovation in graphics and how ML models can be used in upscaling/generational development

HRK69 · Mar 18, 2024

I hate the word "workflow" now.

OverHeat · Mar 18, 2024

Bernoulli said:
Their software is well marketed but AMD is going to bring more powerful hardware again at a fraction of the cost

Quantum253 · Mar 18, 2024

A.Romero said:
What he showed today is pretty crazy. Really. The part that impressed me the most was Blackwell's spine having an equivalent of all of the Internet's aggregate bandwidth. It's crazy.

Absolutely. From a business perspective, this is going to be massive. Everything will be ran through models and have some type of AI driven aspect. From the outside, there didn't seem much there, but how everything we love in the games industry will be transformed by this tech and it will govern the next consoles/GPUs/game development/etc.

Bernoulli · Mar 18, 2024

OverHeat said:

AMD's next-gen Instinct MI400X will take the AI GPU battle directly to NVIDIA and its next-gen Blackwell B100 AI GPU, where we should expect major upgrades in AI performance from both new AI accelerators. HBM3e memory offers a 50% increase in speeds over HBM3, with up to 10TB/sec of memory bandwidth per system and 5TB/sec of memory bandwidth per chip, with memory capacities of up to 141GB HBM3e memory per GPU.

However, AMD's upcoming Instinct MI300 refresh will be a refreshed fighter against H200 and B100 from NVIDIA. Kepler says: "there will (be) an MI300 refresh with HBM3e. Also B100 is expensive as, so MI300 still has an advantage in cost".

Read more: https://www.tweaktown.com/news/9642...ed-in-2025-mi300-refresh-the-works/index.html

Quantum253 · Mar 18, 2024

Bernoulli said:
AMD's next-gen Instinct MI400X will take the AI GPU battle directly to NVIDIA and its next-gen Blackwell B100 AI GPU, where we should expect major upgrades in AI performance from both new AI accelerators. HBM3e memory offers a 50% increase in speeds over HBM3, with up to 10TB/sec of memory bandwidth per system and 5TB/sec of memory bandwidth per chip, with memory capacities of up to 141GB HBM3e memory per GPU.

However, AMD's upcoming Instinct MI300 refresh will be a refreshed fighter against H200 and B100 from NVIDIA. Kepler says: "there will (be) an MI300 refresh with HBM3e. Also B100 is expensive as, so MI300 still has an advantage in cost".

Read more: https://www.tweaktown.com/news/9642...ed-in-2025-mi300-refresh-the-works/index.html

That's pretty impressive as well. The next 5-10 years is going to be crazy. NVidia is about to do for AI what Microsoft did for the Operating System. AMD is running like Apple so we're about to have the Apple/Microsoft showdown again, but this time with NVidia and AMD. The fact Blackwell can process 10 TB/second chip-to-chip link into a single, unified GPU is crazy to me.

Feel Like I'm On 42 · Mar 18, 2024

Amd should be embarrassed by how far Nvidia has advanced beyond them in technology

Support NeoGAF

NIVIDA Transformative Moment in AI -Live

Gold Member

Member

« generous god »

Member

Moderated wildly

Member

Member

Member

Identifies as young

Member

« generous god »

Member

Member

Member

Member

Member

Member

Banned

Member

Member

Banned

Member

Member

Member

Gold Member

Member

Member

Member

Gold Member

Member

Member

Member

Member

M2 slut

M2 slut

Gold Member

Member

Member

Member

Member

Gold Member

Member

Member

Gold Member

Banned

« generous god »

Gold Member

M2 slut

Gold Member

Member

Similar threads