Alec Koumjian02/22/2023, 1:45 PM
. It is easy for a k8s job to enter this state if the cluster's scaling has reached capacity and a job is unable to be scheduled for a sufficient period of time. This leaves runs hanging indefinitely as kubernetes will not continue to try to schedule the job after capacity is available again.
daniel02/22/2023, 2:03 PM
Alec Koumjian02/22/2023, 2:04 PM