Fede Isas
03/14/2023, 7:22 PMtenant_id
). <100 million rows total count (between all tables and tenants).
I'd like to periodically (hourly?) generate S3 parquet files per table per tenant. /lake/{tenant_id}/{table}.parquet
.
Transformations are basically plucking and renaming columns.
My research lead me to consider Airbyte + Dagster to accomplish this. Am I on the right track?Adam Bloom
03/14/2023, 7:25 PMFede Isas
03/14/2023, 7:28 PMAdam Bloom
03/14/2023, 7:56 PMFede Isas
03/14/2023, 8:38 PMAdam Bloom
03/14/2023, 8:42 PMFede Isas
03/14/2023, 8:43 PMAdam Bloom
03/14/2023, 9:49 PMFede Isas
03/14/2023, 9:53 PM