System Overview
Introduction
Four nodes within the raad2 infrastructure contain graphics processing units (GPUs) that may be employed for general purpose computing. These four nodes are treated as a separate cluster logically, and have their own login node called raad2-gfx. Physically, however, they are connected to the same infrastructure as raad2, using an FDR infiniband network to integrate with the same DDN storage system that the non-GPU nodes rely on. The graphics nodes are equipped with NVIDIA V100 GPUs (2 per node) and Intel Xeon Skylake processors. Users who want to accelerate their AI, HPC or Data Science applications can largely benefit from this resource. Most commonly used GPU packages are already available on the system. |
|
Job Scheduler
GPU Cluster uses 'slurm' has a job scheduler.
Workload Manager | Slurm 20.11.7 |
Queue | gpu |
Local SSD Storage | /tmp |
Per User GPU limit | 1 GPU Per Job |
Per User CPU limit | 18 CPUs Per Job |
Per User memory limit | 92GB Per Job |
Default Walltime job | 1 Hour |
Maximum Walltime job | 24 Hours |