Hi all, does anyone know if Dagster supports execu...
# announcements
a
Hi all, does anyone know if Dagster supports executing Google Cloud Dataflow jobs, specifically those that are implemented in Scala? We have a substantial amount of code in that world and are interested in potentially using dagster as an orchestrator for those jobs.
n
👋 hey Andrew, we haven’t directly built support for Cloud Dataflow, but it should be pretty straightforward to implement a Dataflow resource if you’re interested in giving it a shot
a
@nate sure, any pointers on where to start with such a resource?
a
a
excellent, thanks
@nate one more question: is it possible to invoke docker containers via dagster as well? Would the approach be similar?
n
sure, when you say docker containers do you mean on GKE?
or some other substrate
a
gke
n
awesome. and are you interested in instigating execution of externally-defined containers, or running Dagster solids as containers? for the former, its pretty straightforward to write a solid where the function body kicks off execution of externally-defined containers. for the latter, we’ve got a Helm chart which you can use to deploy Dagster to GKE (we use GKE for our Dagster deployments internally) and execute pipelines there
a
it’d be the former
i.e., a constituent solid in a pipeline would run a container
n
yeah, I don’t know if we have a great example of that in the codebase, but we use this for instigating execution solids-as-containers and entire pipelines-as-containers on k8s. As long as you’re using the Python kubernetes API and have the appropriate k8s privileges in place, it should be pretty straightforward to instigate container execution on k8s from a solid
a
oh got it