Alvaro Arias
05/30/2022, 10:53 AMdef process_clients_file():
dataframe = get_dataframe()
clients_array = process_dataframe(dataframe)
store_clients(clients_array)
@op(required_resource_keys={"dgraph_manager"})
def store_clients(context, clients_array):
for client in client_array:
context.resources.dgraph_manager.load_client(client)
I need to speed up the "for" loop but I don't know if its possible to do that on Dagster. Maybe I need to make this step in other container or something like that. Looking into documentation I found that Dagster have support for Dask but I read "we use Dask to orchestrate execution of the steps in a job, not to parallelize computation within those steps." so now I'm lost.
Thanks!Isaac Harris-Holt
05/30/2022, 11:00 AMAlvaro Arias
05/30/2022, 11:03 AMowen
05/31/2022, 10:02 PMAlvaro Arias
06/01/2022, 3:19 PMowen
06/01/2022, 3:43 PM