Name: Aivres KRS8000V4 NVIDIA Vera Rubin NVL72 GPU Server
Brand: Aivres
Availability: PreOrder

Server Details

Aivres KRS8000V4

Compute

72 Rubin Ultra GPUs

72 next-generation NVIDIA Rubin Ultra GPUs in a single rack, operating as one unified compute domain. Expected to deliver significant gains in compute density and performance per watt over the GB300 NVL72.

Processors

36 Vera CPUs

36 NVIDIA Vera CPUs purpose-built for tight integration with the GPU fabric. The successor to Grace, designed to further minimize CPU-GPU bottlenecks in AI workloads.

Cooling

Next-Gen NVLink

Next-generation NVLink interconnect providing high-bandwidth, low-latency connectivity across all 72 GPUs. The full rack is expected to operate as a single compute node.

Build

Full Rack Liquid Cooling

Direct liquid cooling designed for sustained high-density operation at the thermal demands of 72 next-generation GPUs. Built for continuous AI factory workloads.

GPU / 01

GPU

72x NVIDIA Rubin Ultra GPUs

GPU Count

72 GPUs across expected 18x compute trays

GPU Memory

Next-generation HBM (capacity TBD)

Interconnect

Next-generation NVLink with NVSwitch

Host / 02

CPU

36x NVIDIA Vera CPUs (Arm-based)

System Memory

Next-generation LPDDR (capacity TBD)

I/O / 03

Storage

Configuration dependent (TBD at launch)

Networking

Expected ConnectX-8 or successor; high-bandwidth RDMA

Power & Form / 04

Cooling

Direct liquid cooling (required)

Power Supply

Configuration dependent (TBD at launch)

Form Factor

48U rack (expected)

Platform / NVIDIA Vera Rubin NVL72

NVIDIA Vera Rubin NVL72: The Next Generation of Rack-Scale AI

The NVIDIA Vera Rubin NVL72 is the successor to the GB300 NVL72, combining 72 Rubin Ultra GPUs and 36 Vera CPUs into a single unified rack-scale compute domain. NVIDIA has stated the platform is designed to deliver significant gains in performance per watt and lower cost per token compared to previous generations, particularly for large-scale training and mixture-of-experts models.

The Aivres KRS8000V4 is one of the Vera Rubin NVL72 systems that will be available through Arc Compute. Aivres brings a focused approach to AI-native infrastructure at scale, with system designs optimized for dense GPU deployments and high-utilization environments. Our team is actively supporting early infrastructure planning for organizations building toward Rubin-generation rack-scale AI.

Vera Rubin NVL72 Family