Eduardo Muñoz
03/20/2023, 4:58 PMalex
03/22/2023, 2:45 PMAs I’m new to Dagster, I’m wondering if this is the best approach. The current UI is extremely slow, sometimes unresponsive when loading runs, for example.That set-up should be fine, but it depends on the resources available to the processes in question. What version of dagster is running? How much cpu/mem does the dagit server have? How much cpu/mem does the database have? If this is an older instance, running
dagster instance migrate
/ dagster instance reindex
may improve database performance.
thinking about proposing a migration to a different deployment system. One Dagit Instance per repository, each one with its own local storage and DBThis may address resource contention by making more resources available, and if this achieves usability you desire then it should be a fine option.
delete runs older than a certain age. Is this the best approach?Yea at this point. Looks something like this im guessing https://github.com/dagster-io/dagster/discussions/12047
Eduardo Muñoz
03/22/2023, 3:11 PMThat set-up should be fine, but it depends on the resources available to the processes in question.Dagster version: 1.0.12 Server specs: t3a.xlarge, 4vCPU and 16GiB (for the past 7 days, cpu has never been more than 70%, usually around 30% and RAM always between 3-6 GiB) DB specs: RDS db.t3.micro (2 vCPU and 1GiB) Postgres 12.11 (for the past 7 days, cpu has been around 10-20% and read iops around 75-150; im going to enable performance insights to get more visibility into this; however, using
pgcli
, the query times are not that high, maybe around 5-10 seconds to get all the runs for example)
We already run migrate as part of the startup script
This may address resource contention by making more resources available, and if this achieves usability you desire then it should be a fine option.thanks!
Yea at this point. Looks something like this im guessing https://github.com/dagster-io/dagster/discussions/12047yes, it mostly looks like that 🙂
alex
03/22/2023, 3:32 PMim going to enable performance insightsGreat - ya some additional profiling information will likely be very useful.
The current UI is extremely slow, sometimes unresponsive when loading runs, for example.Is this the page that shows a paginated list of recent runs or the event log for a specific run? Ild be surprised if the former was an issue but the latter can vary quite dramatically based on how you use the system. For example, using
context.log
in large volumes or with large data can drastically degrade performance.Eduardo Muñoz
03/22/2023, 3:42 PMIs this the page that shows a paginated list of recent runs or the event log for a specific run?sometimes the schedules do not even get displayed 😅 deleting some runs did help with loading runs for a specific job we do use context.log in every job would it be better to use a different logging method? what's the benefit of context.log, besides being stored with the runs?
alex
03/22/2023, 3:57 PMsometimes the schedules do not even get displayedwhich page is this exactly? The schedule pages im thinking of have been rewritten since 1.0.12 so upgrading may address this issue.
deleting some runs did help with loading runs for a specific jobI believe a DB index was added to address this in 1.1.18, so upgrading may address this issue.
would it be better to use a different logging method? what’s the benefit of context.log, besides being stored with the runs?
context.log
entries show up next to the structured events in the main event feed. High signal messages are useful to surface in this way. “Compute logs” or the raw stderr/stdout captured during execution is a more efferent way to capture various log output https://docs.dagster.io/deployment/dagster-instance#compute-log-storage. These are available to view in dagit with this button:Eduardo Muñoz
03/22/2023, 4:02 PMwhich page is this exactly? The schedule pages im thinking of have been rewritten since 1.0.12 so upgrading may address this issue.navbar -> deployment -> schedules anything under deployment is very slow to load, sometimes, does not load at all
“Compute logs” or the raw stderr/stdouthere you mean print statement and/or built in logger?
alex
03/22/2023, 4:16 PMEduardo Muñoz
03/22/2023, 4:16 PM