Coming Soon

Aivres Vera Rubin NVL72

Next-generation rack-scale AI system from Aivres with 72 Rubin Ultra GPUs and 36 Vera CPUs in a single liquid-cooled rack. Designed for frontier-scale training, reasoning, and hyperscale inference. Coming soon.

Model
KRS8000V4
Aivres KRS8000V4
OEM Partner / Tier 1
GPUs
72X
NVIDIA Rubin Ultra GPUs
Memory
TBD
Next-gen HBM (TBD)
Form Factor
48U
Liquid-cooled rack
Server Details

Aivres KRS8000V4

Compute

72 Rubin Ultra GPUs

72 next-generation NVIDIA Rubin Ultra GPUs in a single rack, operating as one unified compute domain. Expected to deliver significant gains in compute density and performance per watt over the GB300 NVL72.

Processors

36 Vera CPUs

36 NVIDIA Vera CPUs purpose-built for tight integration with the GPU fabric. The successor to Grace, designed to further minimize CPU-GPU bottlenecks in AI workloads.

Cooling

Next-Gen NVLink

Next-generation NVLink interconnect providing high-bandwidth, low-latency connectivity across all 72 GPUs. The full rack is expected to operate as a single compute node.

Build

Full Rack Liquid Cooling

Direct liquid cooling designed for sustained high-density operation at the thermal demands of 72 next-generation GPUs. Built for continuous AI factory workloads.

GPU / 01
GPU
72x NVIDIA Rubin Ultra GPUs
GPU Count
72 GPUs across expected 18x compute trays
GPU Memory
Next-generation HBM (capacity TBD)
Interconnect
Next-generation NVLink with NVSwitch
Host / 02
CPU
36x NVIDIA Vera CPUs (Arm-based)
System Memory
Next-generation LPDDR (capacity TBD)
I/O / 03
Storage
Configuration dependent (TBD at launch)
Networking
Expected ConnectX-8 or successor; high-bandwidth RDMA
Power & Form / 04
Cooling
Direct liquid cooling (required)
Power Supply
Configuration dependent (TBD at launch)
Form Factor
48U rack (expected)
Platform / NVIDIA Vera Rubin NVL72

NVIDIA Vera Rubin NVL72: The Next Generation of Rack-Scale AI

The NVIDIA Vera Rubin NVL72 is the successor to the GB300 NVL72, combining 72 Rubin Ultra GPUs and 36 Vera CPUs into a single unified rack-scale compute domain. NVIDIA has stated the platform is designed to deliver significant gains in performance per watt and lower cost per token compared to previous generations, particularly for large-scale training and mixture-of-experts models.

The Aivres KRS8000V4 is one of the Vera Rubin NVL72 systems that will be available through Arc Compute. Aivres brings a focused approach to AI-native infrastructure at scale, with system designs optimized for dense GPU deployments and high-utilization environments. Our team is actively supporting early infrastructure planning for organizations building toward Rubin-generation rack-scale AI.

Vera Rubin NVL72 Family
NVIDIA HGX B300 8-GPU baseboard
Available Through Arc
Aivres | Supermicro
Why Arc Compute

The team behind your infrastructure

Configured for your workload

Every cluster is designed around your workloads, your facility, and your operational model.

Fast, predictable deployment

Systems are validated, burned in, and shipped with full deployment support.

Deep GPU Infrastructure expertise

Hands-on experience designing and deploying high-density GPU infrastructure.

Full lifecycle support

From planning and procurement through deployment, operations, and scaling.

Quote Request

Aivres Vera Rubin NVL72

Submission Successful!
A member of the Arc team will reach out to you within one business day.
Something went wrong while submitting. Please try again, or email us directly at sales@arccompute.io.