I was curious if one can put together a 100% open-source equivalent to what is considered Modern Data Stack and wrote a blog to share my findings. Very curious to hear your thoughts and if I missed any great products!
08/19/2021, 10:25 PM
You appear to have accidentally put dagster at the end rather than the beginning of the list of transformation tool options 😉
08/19/2021, 10:26 PM
I would love to see Dagster disrupt the Airflow family and become the go-to orchestrator!
08/19/2021, 10:28 PM
Jesting aside - I propose adding Impala to the warehousing category.
I'm also finding the venerable SQLite a powerful addition to my toolbox, both for local dev / test environments and for "single user" datamarts via cross-compliations like https://github.com/sql-js/sql.js/
08/19/2021, 10:29 PM
Is there a SaaS product that Impala “replaces”?
just to make the reference easy to relate to for the readers
08/20/2021, 8:57 AM
Not sure. The company I work for went from Redshift to Impala (on prem)