I'm having some trouble with a specific node. Until I resolve it, I don't want any jobs to run on ii. How can I temporarily take this node out of the nodes "pool"?
Asked
Active
Viewed 1.2k times
4 Answers
7
To disable:
qmod -d *@node_name
To re-enable:
qmod -e *@node_name
Kevin Panko
- 7,466
user322498
- 71
- 1
- 2
2
If you're running 6.1 or better, here's the best way. Create a new hostgroup called @disabled
qconf -ahgrp @disabled
Create a new resource quota set with
qconf -arqs limit hosts @disabled to slots=0
Now, to disable a host, just add it to the host group
qconf -aattr hostgroup hostlist MYHOST @disabled
To reenable the host, remove it from the host group
qconf -dattr hostgroup hostlist MYHOST @disabled
This process will stop new jobs from being scheduled to the machine and allow the currently running jobs to complete.
Kevin Panko
- 7,466
0
gridsuspend - Suspends one or more hosts from executing grid jobs. Example: gridsuspend -s -r "reason comment here" <host_name> 1d
0
Without knowing your SGE version I cannot say for certain that this will achieve the desired outcome, however, qconf -de foo will delete the execution host foo. qconf -ae foo will then add the host foo back to the execution list.
Tok
- 489
- 4
- 3