Jonny Wray
02/14/2023, 2:08 PMowen
02/14/2023, 6:32 PMNothing
, which indicates to Dagster that the io_manager does not need to be invoked when handling the outputs/inputs. See: https://dagster.slack.com/archives/C01U954MEER/p1676310924565129?thread_ts=1676257567.623189&cid=C01U954MEER for a bit more discussion there.
For question 2: I'm most familiar with dbt/Dagster here. The load_assets_from_dbt_project/manifest
functions support a partition_key_to_vars
function, which allows you to define a translation between a dagster partition key (e.g. 2023-01-01
) to a dictionary of dbt vars (e.g. {"run_date": "2023-01-01"}
). So you can define your dbt assets to be daily partitioned, as well as a function to turn those partitions in to variables in the dbt runtime. The dagster-airbyte integration does not currently support partitions as far as I'm aware, but it seems like something similar might make sense here, if the incremental updates to the airbyte stream are regular (i.e. in 1-day chunks). If they're not, I think it'd be fine to just model the airbyte sync as an unpartitioned assetJonny Wray
02/14/2023, 6:53 PMBenoit Fayolle
02/24/2023, 9:06 AMpartition_key_to_vars
function working but I don't understand why only the partition key is exposed. In my view, massive value of partitions come from the fact that the user is able to backload or rerun past partitions. Without the partition end, I have to basically rewrite my own partition logic inside partition_key_to_vars
.Benoit Fayolle
02/24/2023, 9:12 AM