Rory LaMendola
08/24/2023, 3:08 PMs3_prefix
in the io manager resource definition on the job itself, but i'm wondering if there's a way of dynamically setting the s3 prefix in an op depending on some inputs to the op. for example, if we're running a pipeline that takes in customer_id
as an input for some of the op configs, is there a way to include that in the s3_prefix so that things are more easily searchable in s3?jamie
08/24/2023, 3:19 PMRory LaMendola
08/24/2023, 3:21 PMjamie
08/24/2023, 3:25 PMRory LaMendola
08/24/2023, 3:25 PMjamie
08/24/2023, 3:27 PMRory LaMendola
08/24/2023, 3:27 PMRory LaMendola
08/24/2023, 3:28 PMjamie
08/24/2023, 3:30 PMjamie
08/24/2023, 3:31 PMRory LaMendola
08/24/2023, 3:32 PMjamie
08/24/2023, 3:34 PM@asset
function in dagster is responsible for updating that data asset. So each time you materialize an asset, the contents of the “data asset” is replaced. You can do partitioned assets, or incremental updates to the asset, but the basic case is that the asset is replaced each time.
You don’t loose the ability to see old runs (like you can still see the logs and stuff), but we don’t keep old versions of the data assets around by default. You could write that functionality yourself with I/O managers thoughjamie
08/24/2023, 3:35 PMRory LaMendola
08/24/2023, 3:45 PM