Tom Reilly
07/26/2022, 9:40 PMAssetMaterialization
. This is working, but when viewing the assets in dagit there is no lineage. Is there a way we can link these so that the lineage shows sftp file asset --> local compressed file --> local uncompressed file(s) --> s3 file
? Is there a better approach to replicate this pattern using assets instead of ops?sean
07/26/2022, 10:12 PMTom Reilly
07/26/2022, 10:49 PMsandy
07/27/2022, 1:34 AMTom Reilly
07/27/2022, 2:51 PMcontext.log_event(
AssetMaterialization(
asset_key="vendor_abc_s3_files",
partition=partition_date,
metadata={
...attributes about the file...
}
)
)
We currently derive runtime asset keys like job_name/run_id/op_name/filename
but I'm wondering if that is too narrowly scoped and a more generalized asset key with multiple partitions is a better approach.sandy
07/27/2022, 11:35 PMI'm understanding you correctly regarding runtime asset partitions, would the SDA be something like "vendor_abc_s3_files" with a 1:1 relationship between the asset partitions and s3 files? Would a similar pattern in ops be logging multiple materializations for "vendor_abc_s3_files" with a partition date and more specific file info attached via metadata?Exactly
Keshav
07/28/2022, 5:51 AMHebo Yang
07/29/2022, 10:07 PMsandy
07/29/2022, 10:08 PMHebo Yang
07/29/2022, 10:10 PMsandy
07/29/2022, 10:12 PMHebo Yang
07/29/2022, 10:13 PM