Timo Vink
03/30/2023, 7:07 PMjob
• Use an S3 sensor to create a dynamically partitioned asset
for each output written to S3 by this job
• Use `asset`s for the (pure) downstream tasks
But in this model I don't have a single place in dagit to see my data end-to-end. The jobs and their logs don't show up in the asset graph here. The link between the job and the assets is implicit via S3, not explicit, like with assets that depend on other assets.Timo Vink
03/30/2023, 7:10 PMsean
03/31/2023, 10:31 PMTimo Vink
03/31/2023, 10:35 PMTimo Vink
03/31/2023, 10:37 PMsean
04/03/2023, 4:56 PM@asset
def my_asset(context):
if context.partition_key in context.instance.get_materialization_count_by_partition(AssetKey(["my_asset"]):
raise Exception()
...
That’s a private API, but I can expand the public context.get_asset_provenance
to work with partitions for this week, that will enable you to tell if it’s been materialized too.Timo Vink
04/03/2023, 7:49 PMTimo Vink
04/03/2023, 7:52 PMTimo Vink
04/04/2023, 1:39 AM