How long is "a long time to spin up"?
This article makes it sound like ECS+Fargate could spin up thousands of tasks in a few minutes. We're also hoping to scale horizontally; there are some setup ops in our job, but the bulk of it is running ~1k ops which can go in parallel, and take 1-20 minutes each. So 1m startup overhead would be fine.
I guess I'd misunderstood something previously thinking Dagster would do an op per task on ECS+Fargate.
In general I've been pretty confused by the cross product of job-v-op and executor, if there is now or if there could be a summary table of what's available in the docs that would be a big help.
I had been leaning towards ECS instead of EKS since it has slightly simpler setup / lower overhead due to not having to run persistent clusters.