Job monitoring
Computer Center
Job monitoring
Your jobs in the queue (use -nobatch for alternative listing):
[wgs] ~ condor_q [-nobatch]
Check why your jobs might not be able to start at all:
[wgs] ~ condor_q -better-analyze <job-id>
Check for held jobs (i.e.: ran into issues during runtime) and show the failure reason:
[wgs] ~ condor_q -hold
Determine where your jobs are running:
[wgs] ~ condor_status -constraint 'RemoteUser == "<account>@zeuthen.desy.de"'
Simple cluster overview (alternatively limiting to nodes with special constraints):
[wgs] ~ condor_status -compact
[wgs] ~ condor_status -constraint 'GPUs_DeviceName == "NVIDIA GeForce RTX 3090"' -compact
Simple cluster monitoring dashboard