A GPU Job Scheduler is a tool that manages and schedules the allocation of GPUs in a cluster environment. They enable the efficient utilization of GPU resources by allocating them to the jobs that need them. Schedulers also provide a unified interface for submitting, monitoring, and controlling the execution of GPU jobs in clusters.
Although schedulers can be very useful to Systems Administrators, they have drawbacks when it comes to truly maximizing utilization and performance. These issues are addressed by Arc Compute's ArcHPC.
To increase GPU utilization when multiple jobs need to be trained, many Schedulers rely on multi-tenancy through MIG (Multi-Instance GPU). Let's run through an example to highlight the differences between Schedulers utilizing MIG and Arc Compute utilizing GVM Server's feature SMVGPU (
Simultaneous Multi-Virtual GPU).