Bennett Kanuka
04/26/2021, 5:40 PMFileManager
but I can't get any further than that.
Could someone explain how I would do this or point to an example of a pipeline that passes file handles rather than pickled data?
Edit: this would be running on GCP in production so I would be using the dagster_gcp.gcs_file_manager
but I dont know how to use italex
04/26/2021, 5:56 PMIOManager
abstraction should help with this use case
https://docs.dagster.io/concepts/io-management/io-managers#io-managersBennett Kanuka
04/26/2021, 5:58 PMalex
04/26/2021, 6:21 PMFileManager
is a resource
https://docs.dagster.io/concepts/modes-resources
The idea being you can have different implementation for that resource so that you can locally use the file system then in prod use GCS
The file manager setup will have interactions with that resource happening in the body of your solids for reading/writing FileHandles <-> file data
IOManager
is another resource, but its a special one in that Dagster will take care of invoking it behind the scenes so that your @solid
code can be oriented around just working with the data and separate out completely how its stored