Hi, we had a failed job because engine could not f...
# dagster-plus
d
Hi, we had a failed job because engine could not find resources to spin up for about 4-5 minutes Can we do something to avoid this?
j
Hey @Deividas Jodogalvis what agent are you using? depending on the agent type i can recommend a few ways to make it more unlikely to fail to start
d
serverless agent metadata of agent:
Copy code
{
  "image_tag": "6522cfca-f55f630b",
  "version": "1.3.14rc2",
  "type": "ServerlessUserCodeLauncher"
}
and a question: would retry policy on an op help? would it work if no resources were spinned up? 😅
j
ah in serverless a sensor that triggers on run failures or retry policy on the job would help
d
ok, thanks! will add retry policy then