Alec Koumjian
02/22/2023, 1:45 PMBackoffLimitExceeded
. It is easy for a k8s job to enter this state if the cluster's scaling has reached capacity and a job is unable to be scheduled for a sufficient period of time. This leaves runs hanging indefinitely as kubernetes will not continue to try to schedule the job after capacity is available again.daniel
02/22/2023, 2:03 PMAlec Koumjian
02/22/2023, 2:04 PM