swenzel
01/14/2020, 10:50 AMalex
01/14/2020, 4:10 PMIs there any reason why you’re using your own k8s launcher in dagster-k8s instead of Dask Kubernetes?We are currently building out
dagster-celery
& dagster-k8s
together. I expect when we expand dagster-k8s
to support Dask we will use Dask Kubernetes.max
01/14/2020, 7:31 PMswenzel
01/15/2020, 7:06 AMmax
01/15/2020, 6:07 PMswenzel
01/16/2020, 1:29 PMwould love to know more about the issues you had observing jobsIt's hard to figure out the reason why a job is in a certain state. If it's running, it might actually be running but might as well be that the pod which was spawned by the job is stuck in pending. Why? Maybe because of an image pull error, Maybe because it's not schedulable. Is the image pull error due to a missing/wrong image or due to missing/wrong credentials? Is it not schedulable because the cluster is full? Does it fit on a node at all? Most of that you can find out by reading the pod events, but you won't find that information on the job resource. Therefore, since we have to watch the pods anyway, we'll just manage them ourselves and thereby only need to watch and deal with one instead of two k8s resources.
max
01/16/2020, 6:02 PM