nq8tc3y6
07/18/2023, 6:31 AMdef get_partitions(current_time):
return build_partitions_from_api(call_api())
partition_definition = DynamicPartitionsDefinition(get_partitions)
def get_run_config_for_partition(partition):
build_config(blah blah)
partitioned_config = PartitionedConfig(partition_definition, get_run_config_for_partition)
@job(config=partitioned_config)
def my_etl():
blah blah
I see severals problems :
- API is called too often
- I use a configured job. I would prefer use partition_def
My second solution is :
Use a DynamicPartitionsDefinition with a name, and create an asset that add a partition with context.instance.add_dynamic_partitions()
Then, I can add a schedule that materialize this asset and run my job.
Is there a better solution?YH
07/18/2023, 7:20 AMDailyPartitionsDefinition
https://docs.dagster.io/_apidocs/partitions#dagster.DailyPartitionsDefinitionnq8tc3y6
07/18/2023, 7:38 AMYH
07/18/2023, 7:39 AMYH
07/18/2023, 7:40 AMhas_dynamic_partition
allows me to keep track if the partition has been processed or notclaire
07/18/2023, 9:19 PM