Eliott Bigiaoui
02/01/2023, 9:51 PMENGINE_EVENT Still waiting for compute resources to spin up.
, it displayed RUN_FAILURE Agent db8a94e8 detected that run worker failed.
and the run stopped.
It is a job that usually runs correctly every two hours for a few seconds, not needing a lot of memory or compute resources.
Is there a reason why that run failed? Is there a way to automatically relaunch the job when the run failed for that reason?
Thanks!
(cc @Kevin Martin)daniel
02/01/2023, 10:23 PMfrom dagster import job
@job(tags={"dagster/max_retries": 3})
def sample_job():
pass
If you have a link to the run we can take a look at why it failedEliott Bigiaoui
02/03/2023, 4:40 PM