Ilya Pogorelsky
08/31/2023, 2:51 PMSYNC_JOB_CONFIG = {
"execution": {
"config": {
"multiprocess": {
"max_concurrent": 1,
},
}
}
}
@job(name="sync", config=SYNC_JOB_CONFIG)
def sync_job():
op3(op2(op1()))
but getting errors from dagster on startup:
re.errors.DagsterInvalidConfigError: Error in config when building job 'sync'
Error 1: Missing required config entry "ops" at the root. Sample config for missing entry: {'ops': {'op1': {'config': {'s3_bucket': '...'}}, 'op2': {'config': {'filename': '...', 's3_bucket': '...'}}}}
I don’t want to specify any configuration for these ops, as the config will be injected by the sensor RunRequest
Ilya Pogorelsky
08/31/2023, 3:04 PMZach
08/31/2023, 3:43 PMIlya Pogorelsky
08/31/2023, 4:07 PMk8s_job_executor
we actually wanted to have in-process
like behavior for some of the jobs we launch inside a pod. need to experiment a bit.Ilya Pogorelsky
08/31/2023, 5:56 PMin_process
nor multiprocess_executor.configured({"max_concurrent": 1})
is constraining the number of job runs that happen concurrently.
may be these are meant to constraint how many ops/assets can run within a given job run instance, but not across ALL job run requests.Zach
08/31/2023, 5:58 PMIlya Pogorelsky
08/31/2023, 5:59 PMZach
08/31/2023, 6:00 PMZach
08/31/2023, 6:01 PMIlya Pogorelsky
08/31/2023, 6:02 PMZach
08/31/2023, 6:02 PMIlya Pogorelsky
08/31/2023, 6:02 PMZach
08/31/2023, 6:03 PM