To run jobs on Gadi, users should submit to a specific queue on a corresponding node. Which queue and which node you choose to run on will depend on a variety of factors. For example, any job that needs to run on GPUs will need to be submitted to the gpuvolta or dgxa100 queue, as these are the only queues with access to GPUs. Any jobs that require a large amount of memory should be submitted to the hugemem queue to take advantage of the persistent memory there. Note |
---|
If your job can run on the nodes in a normal queue, you should use those queues. The normal queues have more nodes available for your jobs, and will allow users, and jobs that require a specialised queue, to get fair access to those resources. |
The queue structure is split into two main levels of priority, express and normal , which correlates directly to the queue names. Express queues are designs to support work that needs a faster turnaround, but will be charged accordingly at a higher service unit charge. |