NVIDIA launches DGX GH200 AI supercomputer-EEWORLD

Collect

New AI supercomputer connects 256 Grace Hopper superchips into a massive, 1-Exaflop, 144TB GPU for giant models that power generative AI, recommendation systems and data processing

COMPUTEX - May 29, 2023 - NVIDIA today announced the launch of a new large-memory AI supercomputer - the NVIDIA DGX™ supercomputer powered by the NVIDIA® GH200 Grace Hopper superchip and the NVIDIA NVLink® Switch System, designed to help Develop massive, next-generation models for generative AI language applications, recommendation systems, and data analysis workloads.

The large shared memory space of NVIDIA DGX GH200 connects 256 GH200 super chips through NVLink interconnect technology and NVLink Switch System, allowing them to operate as a single GPU. It offers 1 exaflop of performance and 144 TB of shared memory - nearly 500 times larger than the memory of the previous generation NVIDIA DGX A100 launched in 2020.

NVIDIA founder and CEO Jensen Huang said: "Generative AI, large-scale language models and recommendation systems are the digital engines of the modern economy. The DGX GH200 AI supercomputer integrates NVIDIA's most advanced accelerated computing and network technologies to expand the frontier of AI. "

NVIDIA NVLink technology scales AI at scale

The GH200 superchip uses the NVIDIA NVLink-C2C chip interconnect to integrate the Arm-based NVIDIA Grace™ CPU with the NVIDIA H100 Tensor Core GPU, eliminating the need for traditional CPU-to-GPU PCIe connections. This increases bandwidth between the GPU and CPU by 7x, reduces interconnect power consumption by more than 5x compared to the latest PCIe technology, and provides a 600GB Hopper architecture GPU building block for the DGX GH200 supercomputer.

DGX GH200 is the first supercomputer to pair the Grace Hopper super chip with the NVIDIA NVLink Switch System, a new interconnect that allows all GPUs in the DGX GH200 system to operate together as a whole. The previous generation system could only integrate 8 GPUs into one GPU through NVLink without affecting performance.

The DGX GH200 architecture increases NVLink bandwidth by more than 48 times compared to the previous generation, enabling large-scale AI supercomputer capabilities to be provided through simple programming on a single GPU.

New research tools for AI pioneers

Google Cloud, Meta and Microsoft are among the first companies expected to plug into the DGX GH200 to explore its capabilities for generative AI workloads. NVIDIA also intends to make the DGX GH200 design available as a blueprint to cloud service providers and other hyperscalers so that they can further customize it to their own infrastructure.

"Building advanced generative models requires innovative AI infrastructure," said Mark Lohmeyer, Vice President of Cloud Computing at Google. "The Grace Hopper superchip's new NVLink and shared memory solve key bottlenecks for large-scale AI, and we look forward to its implementation in Google Cloud and our Develop powerful capabilities in generative AI programs.”

Alexis Björlin, Vice President of Infrastructure, AI Systems and Acceleration Platforms at Meta, said: "As AI models become larger and larger, they require powerful infrastructure that can scale to meet growing demand. NVIDIA's Grace Hopper design looks to be able to Let researchers explore new ways to solve their greatest challenges."

Girish Bablani, corporate vice president of Azure Infrastructure at Microsoft, said, “Training large AI models has been a resource- and time-intensive task in the past. The potential of DGX GH200 to process terabyte-level data sets enables developers to operate at a larger scale and Conduct high-level research at a faster pace.”

New NVIDIA Helios supercomputer will advance research and development

NVIDIA is building its own DGX GH200-based AI supercomputer to support the work of the R&D team.

The supercomputer, called NVIDIA Helios, will be equipped with four DGX GH200 systems. Each will be interconnected via an NVIDIA Quantum-2 InfiniBand network to increase data throughput for training large AI models. Helios will contain 1,024 Grace Hopper superchips and is expected to be online by the end of this year.

Fully integrated and built for giant models

The DGX GH200 supercomputer includes NVIDIA software to provide a turnkey, full-stack solution for the largest AI and data analytics workloads. NVIDIA Base Command™ software provides AI workflow management, enterprise-class cluster management and multiple libraries to accelerate compute, storage and network infrastructure, as well as system software optimized for running AI workloads.

It also includes NVIDIA AI Enterprise, the software layer of the NVIDIA AI platform. It provides more than 100 frameworks, pre-trained models, and development tools to simplify the development and deployment of production AI such as generative AI, computer vision, and speech AI.

Availability

The NVIDIA DGX GH200 supercomputer is expected to be available by the end of this year.

Watch Jen-Hsun Huang introduce the NVIDIA DGX GH200 supercomputer during the COMPUTEX 2023 keynote.

Keywords：NVIDIA Reference address：NVIDIA launches DGX GH200 AI supercomputer

Previous article：Gartner releases four trends shaping the future of cloud, data center and edge infrastructure
Next article：Intel joins hands with Danghong Qitian and China Mobile to build a cloud-edge-device converged computing architecture to promote a new e-sports experience in "Thousands of Stores"

Recommended ReadingLatest update time:2024-11-16 09:18

Keysight Technologies Joins AI-RAN Alliance to Advance AI Innovation in Mobile Networks

New alliance focuses on integrating AI innovations into wireless communications to improve performance of wireless access networks Keysight Technologies provides professional measurement technologies for improving spectrum efficiency and optimizing wireless access network performance for artificial i

[Test Measurement]

Keysight Technologies Joins AI-RAN Alliance to Advance AI Innovation in Mobile Networks

Artificial intelligence technology research and development company CoCoPIE received tens of millions of yuan in Series A financing

Recently, CoCoPIE announced that it had received tens of millions of yuan in Series A financing, led by Sequoia China Seed Fund. CoCoPIE was founded in 2020 and focuses on the research and development and commercialization of real-time artificial intelligence technology for mobile devices such as mobile phones, IoT,

[Mobile phone portable]

Canalys: 60% of personal computers will be compatible with AI functions in 2027, and shipments are expected to exceed 175 million units

According to the latest report released by market research agency Canalys, in the current AI wave, personal computers, as the core tool for modern work and leisure for enterprises and consumers, will now face earth-shaking transformation in terms of software and hardware capabilities, thus welcoming the large-scale po

[Home Electronics]

AI ignites edge computing revolution—Advantech's 2024 Embedded Industry Partner Conference is about to start!

In recent years, the number of connected IoT devices has shown a linear growth trend, and the devices themselves are becoming more and more intelligent. The implementation and integration of artificial intelligence and the Internet of Things in practical applications will push human society into the era of "inte

[Industrial Control]

AI ignites edge computing revolution—Advantech's 2024 Embedded Industry Partner Conference is about to start!

The AI industry faces a diversity crisis: Women and people of color are marginalized

The artificial intelligence industry is facing a "diversity crisis," researchers at the AI Now Institute said in a report released today, raising critical questions about the direction of the field. The report found that women and people of color are severely underrepresented. The report pointed out that studies h

[Mobile phone portable]

Chinese scientists use artificial intelligence to detect blinding eye diseases in infants and young children

The team of Zhongshan Ophthalmology Center of Sun Yat-sen University held a press conference on the 30th and announced that it has joined forces with multiple institutions around the world to jointly research and develop a mobile phone intelligent screening system for visual impairment in infants and young children. P

[Medical Electronics]

AI everywhere, leading the future: 2024 Intel Cup Undergraduate Electronic Design Competition Embedded System Special Invitational Competition successfully concluded

The 2024 Intel Cup Undergraduate Electronic Design Competition Embedded System Invitational Competition came to an end, witnessing young people unleashing the infinite possibilities of AI Recently, the award ceremony of the 2024 Intel Cup Undergraduate Electronic Design Competition Embedded System

[Network Communication]

AI everywhere, leading the future: 2024 Intel Cup Undergraduate Electronic Design Competition Embedded System Special Invitational Competition successfully concluded

IBM's new CEO: We are not considering splitting up the company. AI is our main offensive direction

On May 6, IBM's new CEO Arvind Krishna said that the epidemic has brought new digital transformation business opportunities. He will not consider splitting the company and will continue to consider acquisitions. Due to the impact of the epidemic, the Think 2020 conference that opened today was fully changed to an onli

[Internet of Things]

IBM's new CEO: We are not considering splitting up the company. AI is our main offensive direction

Popular Resources
Popular amplifiers