You can retrieve a report on all jobs which have failed within the last 30 days using the following variation of the sacct command:
sacct
$ sacct --state=F --starttime now-30days --endtime now
You can adjust the time window using the –starttime and –endtime parameters and the job status with the –state option. Some common options:
–starttime
–endtime
–state
# All failed jobs in the previous week: $ sacct --state=F --starttime now-7days --endtime now
# Jobs that ran out of memory in the last twelve hours: $ sacct --state=OOM --starttime now-12hours --endtime now
# Jobs that hit a runtime timeout in the last 45 minutes: $ sacct --state=TO --starttime now-45minutes --endtime now
For reference, the common Slurm job status codes include:
Back to FAQ
Table of Contents
Main Content Sections
Documentation Tools