Hey 👋
I’m looking for advice on which deployment approach I should look at before I invest in learning about ECS or K8s
Keep in mind this is all new territory for me.
We are starting to build a forecasting framework around dagster which uses a combination of DBT and python and will run on AWS. The DBT steps are for pre-processing / post-processing and obviously don’t need a lot of resources.
We also have python ops/ assets downstream that will have various resource requirements. At a minimum we will want probably X1 instances for loading large datasets and doing some light work with multiprocessing.
Then we have a model training op / asset which will need a GPU.
Given that I will need different instances for each op in the compute graph, what are my options for deployment?
The ECS deployment tutorial seems to be doable with my level of knowledge but there’s nothing in there about specific compute resources per op (from what I can see) - any resources on this would be greatly appreciated.
P.s. I use ops / assets above because I have used ops in the past but will try and use the asset based approach going forward if there is no reason not to