Submitted by tensai on
Second class I unfortunately came in a little late, so I missed a few things (including a seat!), which is sad because it was a good presentation. Scott McQuay showed us TORQUE. It's a cluster resource manager, which is a fancy way to say that it handles distributing jobs to a cluster of computers. Overall I was impressed with it. The only weak point appears to be the scheduler where the default scheduler is naive and a more advanced one is proprietary only. And unfortunately I can't see anywhere that I could actually use this software, but it's a good one to keep in mind.
Miscellaneous Notes:
- resource manager
- pbs_mom - per-node daemon
- pbs_server - master
- qstat
- pbs_sched - basic fifo scheduler
- moab - commercial scheduler with more brains
- set number of cpus per back-end
- jobs can be wall time limited
- dancer shell w/ public keys
Recent comments