Available Now

Supermicro GB300 NVL72

A rack-scale AI system from Supermicro with 72 Blackwell Ultra GPUs and 36 Grace CPUs in a single liquid-cooled 48U rack. Up to 21 TB HBM3e GPU memory. Built for large-scale AI training, reasoning, and inference.

Model
SRS-GB300-NVL72
Supermicro SRS-GB300-NVL72
OEM Partner / Tier 1
GPUs
72X
NVIDIA B300 GPUs (Blackwell Ultra)
Memory
~21TB
Total HBM3e memory
Form Factor
48U
Liquid-cooled rack
Networking
800GBE
ConnectX-8 SuperNIC
Server Details

Heading

Compute

72 Blackwell Ultra GPUs

72 NVIDIA B300 (Blackwell Ultra) GPUs with up to 288 GB HBM3e per GPU and approximately 21 TB total GPU memory. The entire rack operates as a single NVLink domain.

Processors

36 NVIDIA Grace CPUs

36 Arm-based NVIDIA Grace CPUs tightly coupled with the GPU fabric. Up to 17 TB LPDDR5X system memory. Purpose-built to minimize CPU-GPU bottlenecks in AI workloads.

Cooling

NVLink 5 at Rack Scale

1.8 TB/s NVLink bandwidth per GPU, 130 TB/s aggregate across the rack. All 72 GPUs communicate as a single compute domain, eliminating multi-node overhead for large models.

Build

Supermicro Direct Liquid Cooling

Full rack liquid cooling with in-rack CDU and redundant pumps, backed by Supermicro's leadership in direct liquid cooling technology. Required for the thermal envelope of 72 GPUs at 132 to 140 kW per rack.

GPU / 01
GPU
72x NVIDIA B300 (Blackwell Ultra) GPUs
GPU Count
72 GPUs across 18x 1U compute trays (4 GPUs per tray)
GPU Memory
Up to 288 GB HBM3e per GPU (approximately 21 TB total)
Interconnect
NVLink 5 with NVSwitch; 1.8 TB/s per GPU; 130 TB/s aggregate; 9x NVLink Switch trays
Host / 02
CPU
36x NVIDIA Grace (Arm Neoverse V2), 72 cores per CPU
System Memory
Up to 17 TB LPDDR5X
I/O / 03
Storage
Up to 144x E1.S PCIe 5.0 drive bays (8 per compute tray)
Networking
ConnectX-8 SuperNIC; 800 Gb/s per GPU; Quantum-X800 InfiniBand or Spectrum-X Ethernet
Power & Form / 04
Cooling
Direct liquid cooling (required); in-rack CDU with redundant pumps
Power Supply
8x 1U 33 kW power shelves; 48V DC busbar; approximately 132 to 140 kW per rack
Form Factor
48U
Platform / NVIDIA GB300 NVL72

NVIDIA GB300 NVL72: Rack-Scale AI for the Reasoning Era

The GB300 NVL72 is NVIDIA's flagship rack-scale AI platform, purpose-built for the shift toward test-time scaling and AI reasoning workloads. Compared to Hopper-based systems, the GB300 NVL72 delivers a 10x improvement in tokens per second per user and a 5x improvement in throughput per megawatt, combining to a 50x increase in overall AI factory output.

The Supermicro SRS-GB300-NVL72 is one of the GB300 NVL72 systems available through Arc Compute. Supermicro brings industry-leading direct liquid cooling technology, manufacturing capacity, and end-to-end data center building block solutions to this platform. Our team handles capacity planning, facility assessment, and deployment support for rack-scale and multi-rack installations.

GB300 NVL72 Family
NVIDIA HGX B300 8-GPU baseboard
Available Through Arc
Aivres | Supermicro
Why Arc Compute

The team behind your infrastructure

Configured for your workload

Every cluster is designed around your workloads, your facility, and your operational model.

Fast, predictable deployment

Systems are validated, burned in, and shipped with full deployment support.

Deep GPU Infrastructure expertise

Hands-on experience designing and deploying high-density GPU infrastructure.

Full lifecycle support

From planning and procurement through deployment, operations, and scaling.

Quote Request

Supermicro GB300 NVL72

Submission Successful!
A member of the Arc team will reach out to you within one business day.
Something went wrong while submitting. Please try again, or email us directly at sales@arccompute.io.