https://dagster.io/ logo
Title
m

Mykola Palamarchuk

01/17/2022, 3:32 PM
Hi Team! I'd like to understand how Dagster uses a database for internal storage. Is it recommended to use external one for better reliability (instead of the one created by Helm chart)? Does Dagster make a lot of load on the database? Is it a good idea to use the same database for Dagster internal storage and for ETL jobs?
a

Alex Service

01/17/2022, 3:35 PM
Good questions. Somewhere in the documentation, it mentions that using an external db is a good idea. I can’t speak to load, as my use-case is currently pretty small, but when I floated the idea of using an existing database instance server to a devops guy I know, he didn’t raise any concerns
m

Mykola Palamarchuk

01/17/2022, 3:56 PM
I'm also curious about table name collisions...
a

Alex Service

01/17/2022, 4:06 PM
How so?
m

Mykola Palamarchuk

01/17/2022, 4:43 PM
I can't find a list of database tables that dagster creates for itself. Where does it live in documentation?
a

Anatoly Laskaris

01/17/2022, 5:03 PM
I think you should give dagster it's own database that only dagster will use. This way there will be no collisions.
By database I mean not whole postgres server, but
database
that you can create with
CREATE DATABASE dagster
statement
👆 1
a

Alex Service

01/17/2022, 6:15 PM
I wasn’t clear in my original message, but Anatoly says it more clearly 🙂
d

daniel

01/17/2022, 7:09 PM
Once you're up and running we do recommend using an external database, yeah: https://docs.dagster.io/deployment/guides/kubernetes/deploying-with-helm#database - that way if you uninstall your helm chart you won't lose all your data, and you can take advantage of managed DB solutions like RDS