The page contains a collection of command hints. These are commands that you might find useful for determining the status of jobs on RA.
Example:
[tkaiser@ra ~]$ showbf Partition Tasks Nodes StartOffset Duration StartDate --------- ----- ----- ------------ ------------ -------------- ALL 24 3 00:00:00 00:59:57 13:04:34_09/15 ALL 16 2 00:00:00 INFINITY 13:04:34_09/15
This shows that I could start a job on 3 nodes with a run time of under 1 hour and have it run immediately.
Example: get the esstimated start time of job 83862.
[tkaiser@ra ~]$ showstart 83862 job 83862 requires 24 procs for 1:06:00:00 Estimated Rsv based start in 2:51:33 on Tue Sep 15 16:06:35 Estimated Rsv based completion in 1:08:51:33 on Wed Sep 16 22:06:35 Best Partition: base [tkaiser@ra ~]$
Example: check the status of job 79042.
[tkaiser@ra ~]$ checkjob 79042 job 79042 AName: DFT_config6 State: Idle Creds: user:tkaiser group:tkaiser account:challenge class:MEDIUM2 WallTime: 00:00:00 of 2:00:00:00 SubmitTime: Mon Aug 24 16:45:18 (Time Queued Total: 17:51:09 Eligible: 16:25:34) Total Requested Tasks: 32 Req[0] TaskCount: 32 Partition: ALL Memory >= 0 Disk >= 0 Swap >= 0 Opsys: --- Arch: --- Features: --- BypassCount: 12 Partition List: base StartPriority: 10219 rejected for State - rejected for Reserved - NOTE: job cannot run in partition base (idle procs do not meet requirements : 0 of 32 procs found) idle procs: 32 feasible procs: 0 Node Rejection Summary: [State: 264][Reserved: 4] [tkaiser@ra ~]$
This shows that job 79042, with name DFT_config6 is currently not running, (Idle) It was submitted by user tkaiser who is in the challenge accounting group. See:http://petra.mines.edu/cgi-bin/rastats/index.cgi for a listing of machine usage by accounting groups.
2 days were requested for this job on 32 processors.
It is currently not running because 32 free processors were not found the last time the scheduler checked for available nodes. Note: Other 32 node jobs might run first if they have a higher priority or if they are submitted for a short time.
Example:
[tkaiser@ra ~]$ mdiag -p
diagnosing job priority information (partition: ALL)
Job PRIORITY* Cred(Accnt:Class) FS( User:Accnt) Serv(QTime)
Weights -------- 1( 1: 1) 20( 5: 21) 1( 2)
64023 78030 0.0( 0.0: 0.0) 2.4( 5.0: -5.8) 97.6(39978)
78157 11456 0.0( 0.0: 0.0) 71.7( 2.3: 19.0) 28.3(1622.)
...
...
79029 10300 0.0( 0.0: 0.0) 79.7( 2.3: 19.0) 20.3(1044.)
79040 10254 0.0( 0.0: 0.0) 80.1( 2.3: 19.0) 19.9(1021.)
79042 10236 0.0( 0.0: 0.0) 80.2( 2.3: 19.0) 19.8(1012.)
78138 8389 0.0( 0.0: 0.0) 53.4( -6.2: 12.2) 46.6(1953.)
78143 8389 0.0( 0.0: 0.0) 53.4( -6.2: 12.2) 46.6(1953.)
78144 8389 0.0( 0.0: 0.0) 53.4( -6.2: 12.2) 46.6(1953.)
78145 8389 0.0( 0.0: 0.0) 53.4( -6.2: 12.2) 46.6(1953.)
78146 8389 0.0( 0.0: 0.0) 53.4( -6.2: 12.2) 46.6(1953.)
78148 8363 0.0( 0.0: 0.0) 53.6( -6.2: 12.2) 46.4(1940.)
...
...
...
78384 -8558 0.0( 0.0: 0.0) 96.8(-26.8:-14.7) 3.2(148.0)
78385 -8558 0.0( 0.0: 0.0) 96.8(-26.8:-14.7) 3.2(148.0)
79051 -6103 0.0( 0.0: 0.0) 95.2( -2.5:-14.7) 4.8(162.0)
79052 -1693 0.0( 0.0: 0.0) 86.8( 4.3: -5.8) 13.2(152.0)
Percent Contribution -------- 0.0( 0.0: 0.0) 55.2( 11.8: 47.8) 44.8( 44.8)
* indicates absolute/relative system prio set on job
[tkaiser@ra ~]$
The first column gives the jobs number and the second the priority. Priorities change over time. A higher number indicates that a greater priority has been assigned to the job scheduler. There are two factors that effect this priority: the time a job has been waiting; and the "Fair Share" algorithm.
The right-hand column contains the time a job has been eligible to run in parentheses (minutes.) Next to this time is a priority related to the wait time.
Under the "FS( User:Accnt)" column there are three numbers. The number outside of the parentheses is again a priority. The inside numbers are the weighting factors for a user and the users account group. If a user has recently made heavy use of RA, then they will have a lower individual weighting. If the rest of the people in the account group have used a lot of time recently, then the second weighting will also be low.