wiki:GCCluster

Version 3 (modified by Pieter Neerincx, 12 years ago) (diff)

--

GCC cluster

The GCC has its own 480 core cluster. The main workhorses are 10 servers with 48 cores, 256 GB, 1 GBit management NIC and a 10 GBit NIC for a dedicated IO connection to a 2 PB shared GPFS for storage.

Servers

FunctionDNSIPDeamonsComments
User interface nodecluster.gcc.rug.nl195.169.22.156- (clients only)Login node to submit and inspect jobs.
Relatively powerful machine.
Users can run code outside the scheduler for debugging purposes.
scheduler VMscheduler01195.169.22.214pbs_server
maui
Dedicated scheduler
No user logins if this one is currently the production scheduler
scheduler VMscheduler02195.169.22.190pbs_server
maui
Dedicated scheduler
No user logins if this one is currently the production scheduler
Execution nodetargetgcc01192.168.211.191pbs_momDedicated test node: only the test-short and test-long queues run on this node.
Crashing the test node shall not affect production!.
Execution nodetargetgcc02192.168.211.192pbs_momRedundant production node: only the default gcc and priority gaf queues run on this node.
Execution nodetargetgcc03192.168.211.193pbs_momRedundant production node: only the default gcc and priority gaf queues run on this node.
Execution nodetargetgcc04192.168.211.194pbs_momRedundant production node: only the default gcc and priority gaf queues run on this node.
Execution nodetargetgcc05192.168.211.195pbs_momRedundant production node: only the default gcc and priority gaf queues run on this node.
Execution nodetargetgcc06192.168.211.196pbs_momRedundant production node: only the default gcc and priority gaf queues run on this node.
Execution nodetargetgcc07192.168.211.197pbs_momRedundant production node: only the default gcc and priority gaf queues run on this node.
Execution nodetargetgcc08192.168.211.198pbs_momRedundant production node: only the default gcc and priority gaf queues run on this node.
Execution nodetargetgcc09192.168.211.199pbs_momRedundant production node: only the default gcc and priority gaf queues run on this node.
Execution nodetargetgcc10192.168.211.200pbs_momRedundant production node: only the default gcc and priority gaf queues run on this node.

PBS software / flavour

The current setup uses the resource manager Torque 2.5.12 combined with the scheduler Maui 3.3.1.

Maui

Runs only on the schedulers with config files in

/usr/local/maui/

Torque

Torque clients are available on all servers.
Torque's pbs_server deamon runs only on the schedulers.
Torque's pbs_mom daemon runs only on the execution nodes where the real work is done.
Torque config files are installed in

/var/spool/torque/

Dual scheduler setup

Installation details

Attachments (9)

Download all attachments as: .zip