Hi everyone I ve updated my dagster setup to 0 8 2 and found dagster #announcements

Hi everyone! I've updated my dagster setup to 0.8....

matas

06/17/2020, 1:39 PM

Hi everyone! I've updated my dagster setup to 0.8.2 and found that there is no reload button anymore and at the same time dagit instance doesn't watch files so I have to redeploy my containers or run dagit and celery workers upder pm2 or any other watching manager alternative. Is this correct or am I missing something? Why have you eliminated the reload functionality?

schrockn

06/17/2020, 1:44 PM

Hey matas we had to temporarily remove that function to account for the new architecture. We are re-adding it this week. The backend is already implemented (see https://dagster.phacility.com/D3519) and the frontend will be done on Thursday.

🎉 1

schrockn

06/17/2020, 1:44 PM

Were you relying on that capability in prod?

matas

06/17/2020, 1:50 PM

Just trying to figure out how to organize code updates. We're not using dagster in prod yet but wanted to start this week after you've released 0.8.0 So what was your initial idea about it? should dagit watch files or should it be manualy restarted \ redeployed?

matas

06/17/2020, 1:51 PM

And the same question about celery workers. Would it be OK if I put them all under pm2 control and use CliApiRunLauncher?

schrockn

06/17/2020, 1:53 PM

so with any of the run launchers we always launch a fresh process

schrockn

06/17/2020, 1:53 PM

so it will pick up the latest code on disk

matas

06/17/2020, 2:00 PM

and what happens with workers then? Imagine I have a long running pipeline with several solids. I start it with runlauncher and it generates an indeendent process for communication with workers, right? but what happens if I change last solid's code under worker while the pipeline is running? will it immediately take the updated version or does it have to be restarted to update?

schrockn

06/17/2020, 2:03 PM

i’m not sure precisely what configuration you have running. and @nate is more intimately familiar with how the workers are structured

schrockn

06/17/2020, 2:03 PM

what engine are you using within the celery worker?

matas

06/17/2020, 2:05 PM

what do you mean by "engine"?

matas

06/17/2020, 2:10 PM

here is our setup https://dagster.slack.com/archives/CCCR6P2UR/p1589551556113900?thread_ts=1589551556.113900

matas

06/17/2020, 2:37 PM

Oh, I see, It seems to return us back to DinD issue 😃

nate

06/17/2020, 2:40 PM

hey @matas yes by default in the current implementation, the celery workers execute user-code in process (this function is what is executed on the workers), which means they need to be redeployed on code change right now

matas

06/17/2020, 2:42 PM

and if I restart a running worker it would break the pipeline, right?

nate

06/17/2020, 2:46 PM

only if there is a task currently executing on that particular worker - if you use

TERM

this should happen gracefully: https://docs.celeryproject.org/en/stable/userguide/workers.html#restarting-the-worker

matas

06/17/2020, 2:48 PM

so If I tell my pm2-watcher to restart worker process gracefully with TERM on code changes - it should make the deal?

nate

06/17/2020, 2:50 PM

yes I think that should do the trick, but let me know if you run into any issues

matas

06/17/2020, 2:56 PM

sure, thank you!

matas

06/17/2020, 4:19 PM

eeee it is working! redeploying seems not to affect running tasks I had to make a little wrapper around dagster-celery to trap SIGINT and substitute it with SIGTERM

nate

06/17/2020, 4:22 PM

🎉

matas

06/18/2020, 9:44 AM

bad news @nate it was working because of my misconfiguration - all the jobs were executing not in celery but in the same process. Now I've made a simple example pipeline of 5 solids and a celery executor with 2 workers. Each solid has a sleep(x) statement inside, I change x and redeploy while running. That led me to a very strange state: I deployed 3, 6 and 10 seconds sleep solids, and it is predictable that when I redeploy while the pipeline is running I get the freshest code when a new solid starts (though of course it is not the desired behavior) but! after my last deploy (6s sleep) both workers restarted but I still have random code executing O_o. I even waited for all the pipelines to complete, redeployed again (with 1s sleep) and still having 2 and 3 versions of code executing in one pipeline randomly

matas

06/18/2020, 9:59 AM

So the main question is still the same: how can I isolate my pipeline runs from the new code deployments (not loosing solids parallelism)? Can it be achieved anyhow with dagster now or what do we have to implement to get it?

3 Views

Open in Slack

Previous Next