CIO Influence
IT and DevOps

NVIDIA Grace Hopper Systems Roll into GTC: A Rack ‘n’ Roll Experience

NVIDIA Grace Hopper Systems Roll into GTC: A Rack 'n' Roll Experience

The upcoming NVIDIA GTC conference will honor the legacy of software pioneer Grace Hopper. Advanced systems featuring powerful processors, named in tribute to Hopper, will be showcased at the event. These systems are designed to elevate computing capabilities and will be exhibited from March 18-21. Exhibitors will present over 500 servers across 18 racks equipped with NVIDIA GH200 Grace Hopper Superchips. This display, occupying the MGX Pavilion, will be the largest at NVIDIA’s San Jose Convention Center booth.

MGX: Streamlining Market Deployment

NVIDIA MGX is a blueprint for constructing accelerated servers, incorporating various GPUs, CPUs, and data processing units (DPUs) combinations. This framework caters to various applications spanning AI, high-performance computing, and NVIDIA Omniverse.

At GTC, attendees can closely examine MGX models tailored for enterprise, cloud, and telco-edge applications. These models are specifically optimized for generative AI inference, recommenders, and data analytics tasks.

The pavilion will feature accelerated systems housing single and dual GH200 Superchips within 1U and 2U chassis configurations. These systems are interconnected via NVIDIA BlueField-3 DPUs and NVIDIA Quantum-2 400Gb/s InfiniBand networks, facilitated by LinkX cables and transceivers.

Moreover, these systems adhere to industry standards for 19- and 21-inch rack enclosures, with many offering E1.S bays to accommodate nonvolatile storage solutions.

Exploring Grace Hopper Systems

  • ASRock RACK’s MECAI: This system, measuring 450 x 445 x 87mm, specializes in accelerating AI and 5G services, particularly in constrained spaces at the edge of telco networks.
  • ASUS’s ESC NM2N-E1: Designed to fit into racks holding up to 32 GH200 processors, this MGX server supports air- and water-cooled nodes.
  • Foxconn’s MGX Systems: Offerings include a 4U model capable of accommodating up to eight NVIDIA H100 NVL PCIe Tensor Core GPUs, catering to demanding computational tasks.
  • GIGABYTE’s XH23-VG0-MGX: Known for its ample storage capabilities, this system features six 2.5-inch Gen5 NVMe hot-swappable bays and two M.2 slots.
  • Inventec’s MGX Systems: Compatible with 19- and 21-inch racks, these systems utilize three different liquid cooling implementations, enhancing performance and reliability.
  • Lenovo’s MGX Servers: Lenovo offers a diverse range of 1U, 2U, and 4U MGX servers, including models supporting direct liquid cooling, catering to various deployment needs.
  • Pegatron’s AS201-1N0: This air-cooled server integrates a BlueField-3 DPU, enhancing networking capabilities through software-defined, hardware-accelerated networking.
  • QCT’s QuantaGrid D74S-IU: Stackable up to 16 systems into a single QoolRack, each equipped with two GH200 Superchips, providing scalable and efficient computing solutions.
  • Supermicro’s ARS-111GL-NHR: Part of a portfolio offering air- and liquid-cooled GH200 and NVIDIA Grace CPU systems, this server features nine hot-swappable fans for optimal cooling.
  • Wiwynn’s SV7200H: A 1U dual GH200 system supporting a BlueField-3 DPU and a remotely managed liquid-cooling subsystem, ensuring efficient and manageable operations.
  • Wistron’s MGX Servers: These 4U GPU systems specialize in AI inference and mixed workloads, accommodating up to eight accelerators in a single system, catering to diverse computational needs.

These newly introduced servers complement three existing accelerated systems announced at COMPUTEX last May, further expanding the offerings of MGX-powered solutions.

Empowering Performance: Grace Hopper’s Dual-Purpose

System builders are increasingly embracing the hybrid processor for its potent capabilities.

GH200 Superchips seamlessly integrate a high-performance, power-efficient Grace CPU with a robust NVIDIA H100 GPU. Together, they leverage a fast NVIDIA NVLink-C2C interconnect to share hundreds of gigabytes of memory.

This combination results in a formidable processor and memory complex, ideally suited for tackling today’s most challenging tasks, including executing large language models. Equipped with ample memory and processing speed, they excel in linking generative AI models to data sources, enhancing accuracy through retrieval-augmented generation (RAG).

Enhanced Efficiency: GH200 Superchip Performance

The GH200 Superchip offers remarkable efficiency, boasting up to 4 times the performance compared to utilizing the H100 GPU with traditional CPUs. This significant advancement particularly benefits tasks such as generating recommendations for online shopping or media streaming.

During its debut on the MLPerf industry benchmarks in November, GH200 systems demonstrated outstanding performance by completing all data center inference tests. This accomplishment further solidifies the GH200’s position as a leader in computational capabilities, building upon the already impressive performance of H100 GPUs. In every aspect, GH200 systems are elevating computing to unprecedented levels, continuing the legacy of innovation initiated by their namesake on the first mainframe computers over seven decades ago.

FAQs

1. What is NVIDIA GTC, and why is it significant for the tech industry?

NVIDIA GTC, or the GPU Technology Conference, is a global event focusing on the latest advancements in GPU technology and its applications across various industries. It serves as a platform for innovators, researchers, and industry leaders to exchange ideas and showcase cutting-edge developments in artificial intelligence, high-performance computing, and more.

2. Who was Grace Hopper, and why are systems named after her featured at GTC?

Grace Hopper was a pioneering computer scientist and naval officer known for significantly contributing to developing early programming languages and computer technology. NVIDIA honors her legacy by naming their advanced systems, featuring powerful processors, after her. These systems are showcased at GTC to highlight the ongoing advancements in computing technology.

3. What are the key features of the NVIDIA GH200 Grace Hopper Superchips?

The NVIDIA GH200 Grace Hopper Superchips combines a high-performance Grace CPU with a muscular NVIDIA H100 GPU, resulting in a potent hybrid processor. These Superchips boast impressive efficiency and offer up to 4 times the performance compared to using the H100 GPU with traditional CPUs. They also facilitate seamless memory sharing over a fast NVIDIA NVLink-C2C interconnect.

4. How do the GH200 systems contribute to advancing AI and accelerated computing?

GH200 systems significantly enhance efficiency and performance, particularly in AI inference and recommendation generation tasks. Their superior computational capabilities, demonstrated by their successful completion of all data center inference tests on MLPerf benchmarks, make them invaluable tools for pushing the boundaries of AI and accelerated computing.

[To share your insights with us as part of editorial or sponsored content, please write to sghosh@martechseries.com]

Related posts

Pangaia Deploys Board To Transform Sales Reporting and Analytics

CIO Influence News Desk

Cognizant Expands Digital Engineering Capabilities with Hunter Technical Resources Acquisition

CIO Influence Interview with Filip Verloy, Field CTO for the EMEA Region at Noname Security

Sudipto Ghosh