NVIDIA Launches Blackwell-Powered DGX SuperPOD for Generative AI Supercomputing at Trillion-Parameter Scale

NVIDIA today announced its next-generation AI supercomputer — the NVIDIA DGX SuperPOD powered by NVIDIA GB200 Grace Blackwell Superchips — for processing trillion-parameter models with constant uptime for superscale generative AI training and inference workloads.

Featuring a new, highly efficient, liquid-cooled rack-scale architecture, the new DGX SuperPOD is built with NVIDIA DGX™ GB200 systems and provides 11.5 exaflops of AI supercomputing at FP4 precision and 240 terabytes of fast memory — scaling to more with additional racks.

Each DGX GB200 system features 36 NVIDIA GB200 Superchips — which include 36 NVIDIA Grace CPUs and 72 NVIDIA Blackwell GPUs — connected as one supercomputer via fifth-generation NVIDIA NVLink®. GB200 Superchips deliver up to a 30x performance increase compared to the NVIDIA H100 Tensor Core GPU for large language model inference workloads.

“NVIDIA DGX AI supercomputers are the factories of the AI industrial revolution,” said Jensen Huang, founder and CEO of NVIDIA. “The new DGX SuperPOD combines the latest advancements in NVIDIA accelerated computing, networking and software to enable every company, industry and country to refine and generate their own AI.”

The Grace Blackwell-powered DGX SuperPOD features eight or more DGX GB200 systems and can scale to tens of thousands of GB200 Superchips connected via NVIDIA Quantum InfiniBand. For a massive shared memory space to power next-generation AI models, customers can deploy a configuration that connects the 576 Blackwell GPUs in eight DGX GB200 systems connected via NVLink.

New Rack-Scale DGX SuperPOD Architecture for Era of Generative AI
The new DGX SuperPOD with DGX GB200 systems features a unified compute fabric. In addition to fifth-generation NVIDIA NVLink, the fabric includes NVIDIA BlueField®-3 DPUs and will support NVIDIA Quantum-X800 InfiniBand networking, announced separately today. This architecture provides up to 1,800 gigabytes per second of bandwidth to each GPU in the platform.

Additionally, fourth-generation NVIDIA Scalable Hierarchical Aggregation and Reduction Protocol (SHARP)™ technology provides 14.4 teraflops of In-Network Computing, a 4x increase in the next-generation DGX SuperPOD architecture compared to the prior generation.

Turnkey Architecture Pairs With Advanced Software for Unprecedented Uptime
The new DGX SuperPOD is a complete, data-center-scale AI supercomputer that integrates with high-performance storage from NVIDIA-certified partners to meet the demands of generative AI workloads. Each is built, cabled and tested in the factory to dramatically speed deployment at customer data centers.

The Grace Blackwell-powered DGX SuperPOD features intelligent predictive-management capabilities to continuously monitor thousands of data points across hardware and software to predict and intercept sources of downtime and inefficiency — saving time, energy and computing costs.

The software can identify areas of concern and plan for maintenance, flexibly adjust compute resources, and automatically save and resume jobs to prevent downtime, even without system administrators present.

If the software detects that a replacement component is needed, the cluster will activate standby capacity to ensure work finishes in time. Any required hardware replacements can be scheduled to avoid unplanned downtime.

NVIDIA DGX B200 Systems Advance AI Supercomputing for Industries
NVIDIA also unveiled the NVIDIA DGX B200 system, a unified AI supercomputing platform for AI model training, fine-tuning and inference.

DGX B200 is the sixth generation of air-cooled, traditional rack-mounted DGX designs used by industries worldwide. The new Blackwell architecture DGX B200 system includes eight NVIDIA Blackwell GPUs and two 5th Gen Intel® Xeon® processors. Customers can also build DGX SuperPOD using DGX B200 systems to create AI Centers of Excellence that can power the work of large teams of developers running many different jobs.

DGX B200 systems include the FP4 precision feature in the new Blackwell architecture, providing up to 144 petaflops of AI performance, a massive 1.4TB of GPU memory and 64TB/s of memory bandwidth. This delivers 15x faster real-time inference for trillion-parameter models over the previous generation.

DGX B200 systems include advanced networking with eight NVIDIA ConnectX™-7 NICs and two BlueField-3 DPUs. These provide up to 400 gigabits per second bandwidth per connection — delivering fast AI performance with NVIDIA Quantum-2 InfiniBand and NVIDIA Spectrum™-X Ethernet networking platforms.

Software and Expert Support to Scale Production AI
All NVIDIA DGX platforms include NVIDIA AI Enterprise software for enterprise-grade development and deployment. DGX customers can accelerate their work with the pretrained NVIDIA foundation models, frameworks, toolkits and new NVIDIA NIM microservices included in the software platform.

NVIDIA DGX experts and select NVIDIA partners certified to support DGX platforms assist customers throughout every step of deployment, so they can quickly move AI into production. Once systems are operational, DGX experts continue to support customers in optimizing their AI pipelines and infrastructure.

Availability
NVIDIA DGX SuperPOD with DGX GB200 and DGX B200 systems are expected to be available later this year from NVIDIA’s global partners.

For more information, watch a replay of the GTC keynote or visit the NVIDIA booth at GTC, held at the San Jose Convention Center through March 21.

Fanatec and Formula 1 Renew Licensing Partnership, Launching the ClubSport Racing Wheel F1

Trust GXT Launches Redex II Wireless Gaming Mouse

Epomaker Upgrades TH80 Pro Into V2 – Programmable Mechanical Keyboard With a Screen

Introducing Wacom Movink: The First OLED Pen Display for Creative Professionals

NEEWER DL300 Upgraded Motorized Camera Dolly Kit Review

Pivo Pod Silver Review (Remote Bundle)

Cougar Creator’s Studio – Royal 150 Desk, Forte Mic Arm and DUO35 Monitor Arm

OneKey Classic Crypto Hardware Wallet and OneKey Lite Review

The Funky Kit Show LIVE Ep.291 – Mobile Pixels Duex Plus, Computex 2024 update, Steel…

The Funky Kit Show LIVE Ep.290 – Gigabyte PC build update, Lexar SL500 Portable SSD,…

Our Podcast Show Ep.87 – The Forgotten Social Media Apps

The Funky Kit Show LIVE Ep.289 – Gigabyte PC Project 2024, Review updates, Lexar PLAY…

Our Podcast Show Ep.87 – The Forgotten Social Media Apps

Our Podcast Show Ep.86 – Apple Vs Department Of Justice: The Ultimate Tech Showdown!

Our Podcast Show Ep.85 – Is Rivian About To Have Their Tesla Model 3 Moment?

Our Podcast Show Ep.84 -No Love For 2024 Apple MacBook Air M3 (Why?)

Prize Giveaway #182 – Win an ASRock B660M PG Riptide Motherboard

LIVE Prize Giveaway #181 – Win a Lexar PLAY 2230 PCIe 4.0 1TB SSD

Prize Giveaway #180 – Win a Gigabyte X670E AORUS PRO X Motherboard

Prize Giveaway #179 – Win an ASRock B650M PG Riptide Motherboard

Prize Giveaway #178 – Win an ASRock Z790 PG Lightning WiFi Motherboard

Gigabyte AORUS Z790 X Media Event 2023 in San Diego

LTX 2023 Expo – Event Coverage

Computex 2023: Cooler Master HQ Visit

Back to TT x Computex Event – Thermaltake Bicycle Store

NVIDIA Launches Blackwell-Powered DGX SuperPOD for Generative AI Supercomputing at Trillion-Parameter Scale

Richard Aizlewood

Leave a Comment Cancel Reply

Mobile Pixels Duex Plus 13.3-inch 1080p Portable Laptop Monitor Review

Corsair iCUE LINK RX120 RGB Cooling Fans Review

Gigabyte AORUS WATERFORCE X II 360 ICE All-in-One CPU Cooler Review

Lexar SL500 2TB Portable SSD Review

Gigabyte AORUS Z790 Elite X WiFi 7 Motherboard Review

FSP Dagger Pro 850W SFX (ATX 3.0/PCIe 5.0) PSU Review

Fanatec and Formula 1 Renew Licensing Partnership, Launching the ClubSport Racing Wheel F1

TEAMGROUP’s T-FORCE & T-CREATE Takes Home Four Awards at 2024 Red Dot...

Mobile Pixels Duex Plus 13.3-inch 1080p Portable Laptop Monitor Review

Cooler Master Announces New High-Performance PC Case – the MasterBox 600

Trust GXT Launches Redex II Wireless Gaming Mouse

Epomaker Upgrades TH80 Pro Into V2 – Programmable Mechanical Keyboard With a...

NVIDIA Launches Blackwell-Powered DGX SuperPOD for Generative AI Supercomputing at Trillion-Parameter Scale

Related posts

Leave a Comment Cancel Reply