A rack-scale AI system from Aivres with 72 Blackwell Ultra GPUs and 36 Grace CPUs in a single liquid-cooled 48U rack. Up to 21 TB HBM3e GPU memory. Built for large-scale AI training, reasoning, and inference.


72 NVIDIA B300 (Blackwell Ultra) GPUs with up to 288 GB HBM3e per GPU and approximately 21 TB total GPU memory. The entire rack operates as a single NVLink domain.
36 Arm-based NVIDIA Grace CPUs tightly coupled with the GPU fabric. Up to 17 TB LPDDR5X system memory. Purpose-built to minimize CPU-GPU bottlenecks in AI workloads.
1.8 TB/s NVLink bandwidth per GPU, 130 TB/s aggregate across the rack. All 72 GPUs communicate as a single compute domain, eliminating multi-node overhead for large models.
Direct liquid cooling with in-rack CDU and redundant pumps, designed for sustained high-density operation at 132 to 140 kW per rack. Required for the thermal envelope of this system.
The GB300 NVL72 is NVIDIA's flagship rack-scale AI platform, purpose-built for the shift toward test-time scaling and AI reasoning workloads. Compared to Hopper-based systems, the GB300 NVL72 delivers a 10x improvement in tokens per second per user and a 5x improvement in throughput per megawatt, combining to a 50x increase in overall AI factory output.
The Aivres KRS8500V3 is one of the GB300 NVL72 systems available through Arc Compute. Aivres brings a focused approach to GPU infrastructure at scale, with system designs optimized for AI-native workloads and dense deployment environments. Our team handles capacity planning, facility assessment, and deployment support for rack-scale and multi-rack installations.

Every cluster is designed around your workloads, your facility, and your operational model.
Systems are validated, burned in, and shipped with full deployment support.
Hands-on experience designing and deploying high-density GPU infrastructure.
From planning and procurement through deployment, operations, and scaling.