6

I'm having some trouble with a specific node. Until I resolve it, I don't want any jobs to run on ii. How can I temporarily take this node out of the nodes "pool"?

David B
  • 2,614

4 Answers4

7

To disable:

qmod -d *@node_name

To re-enable:

qmod -e *@node_name
Kevin Panko
  • 7,466
user322498
  • 71
  • 1
  • 2
2

If you're running 6.1 or better, here's the best way. Create a new hostgroup called @disabled

qconf -ahgrp @disabled

Create a new resource quota set with

qconf -arqs limit hosts @disabled to slots=0

Now, to disable a host, just add it to the host group

qconf -aattr hostgroup hostlist MYHOST @disabled

To reenable the host, remove it from the host group

qconf -dattr hostgroup hostlist MYHOST @disabled

This process will stop new jobs from being scheduled to the machine and allow the currently running jobs to complete.

Kevin Panko
  • 7,466
0

gridsuspend - Suspends one or more hosts from executing grid jobs. Example: gridsuspend -s -r "reason comment here" <host_name> 1d

0

Without knowing your SGE version I cannot say for certain that this will achieve the desired outcome, however, qconf -de foo will delete the execution host foo. qconf -ae foo will then add the host foo back to the execution list.

Tok
  • 489
  • 4
  • 3