Davi
10/11/2022, 11:20 AMIssac Loo
10/11/2022, 1:18 PMRunRequest(run_config={config_from_files(...),
"resources": {
"SOMETHING": {
"config": {...}
}
}
})
Zachary Bluhm
10/11/2022, 2:09 PMMax Wong
10/11/2022, 2:24 PMdagster_gcp.dataproc_op
under dagster-io, but nowhere elseBinoy Shah
10/11/2022, 3:23 PMAbdul Salama
10/11/2022, 4:48 PMMax Wong
10/11/2022, 4:56 PMLiam Evans
10/11/2022, 5:22 PM2022-10-11 18:22:28 +0100 - dagster.daemon.SensorDaemon - INFO - Not checking for any runs since no sensors have been started.
2022-10-11 18:23:23 +0100 - dagster.daemon.SchedulerDaemon - INFO - Checking for new runs for the following schedules: twitter_bot_graph_schedule
2022-10-11 18:23:23 +0100 - dagster.daemon.SchedulerDaemon - WARNING - Using UTC as the timezone for twitter_bot_graph_schedule as it did not specify an execution_timezone in its definition.
2022-10-11 18:23:23 +0100 - dagster.daemon.SchedulerDaemon - INFO - No new tick times to evaluate for twitter_bot_graph_schedule
2022-10-11 18:23:23 +0100 - dagster.daemon.SensorDaemon - INFO - Not checking for any runs since no sensors have been started.
2022-10-11 18:24:25 +0100 - dagster - DEBUG - twitter_bot_graph - faa141de-63f2-44d1-b695-101959254621 - 42154 - RUN_START - Started execution of run for "twitter_bot_graph".
2022-10-11 18:24:25 +0100 - dagster - DEBUG - twitter_bot_graph - faa141de-63f2-44d1-b695-101959254621 - 42154 - ENGINE_EVENT - Executing steps in process (pid: 42154)
2022-10-11 18:24:25 +0100 - dagster - DEBUG - twitter_bot_graph - faa141de-63f2-44d1-b695-101959254621 - 42154 - RESOURCE_INIT_STARTED - Starting initialization of resources [io_manager].
2022-10-11 18:24:25 +0100 - dagster - DEBUG - twitter_bot_graph - faa141de-63f2-44d1-b695-101959254621 - 42154 - RESOURCE_INIT_SUCCESS - Finished initialization of resources [io_manager].
2022-10-11 18:24:25 +0100 - dagster - DEBUG - twitter_bot_graph - faa141de-63f2-44d1-b695-101959254621 - 42154 - get_next_fixture_obj - LOGS_CAPTURED - Started capturing logs for step: get_next_fixture_obj.
Daniel Katz
10/11/2022, 7:03 PMbuild_asset_reconciliation_sensor
to keep some assets up to date as up-stream assets change, but one problem I ran into when deploying on K8s is that I can’t specify the tags for auto generated job the sensor kicks off. I’m using the K8sRunLauncher and I want to control the TTL of the K8s jobs that get spun up to run the sensors job. Would it be possible to expose the tags
param in build_asset_reconciliation_sensor
which would be added the the Job runs the sensor requests?Pablo Beltran
10/11/2022, 8:07 PMsaravan kumar
10/11/2022, 8:33 PMdagster._core.errors.DagsterExecutionHandleOutputError: Error occurred while handling output "result" of step "multiply_the_word":
File "/usr/local/lib/python3.7/site-packages/dagster/_core/execution/plan/execute_plan.py", line 224, in dagster_event_sequence_for_step
for step_event in check.generator(step_events):
File "/usr/local/lib/python3.7/site-packages/dagster/_core/execution/plan/execute_step.py", line 366, in core_dagster_event_sequence_for_step
for evt in _type_check_and_store_output(step_context, user_event, input_lineage):
File "/usr/local/lib/python3.7/site-packages/dagster/_core/execution/plan/execute_step.py", line 423, in _type_check_and_store_output
for evt in _store_output(step_context, step_output_handle, output, input_lineage):
File "/usr/local/lib/python3.7/site-packages/dagster/_core/execution/plan/execute_step.py", line 578, in _store_output
handle_output_gen,
File "/usr/local/lib/python3.7/site-packages/dagster/utils/_init__.py", line 432, in iterate_with_context
return
File "/usr/local/lib/python3.7/contextlib.py", line 130, in _exit_
self.gen.throw(type, value, traceback)
File "/usr/local/lib/python3.7/site-packages/dagster/_core/execution/plan/utils.py", line 82, in solid_execution_error_boundary
) from e
The above exception was caused by the following exception:
PermissionError: [Errno 13] Permission denied: '/opt/dagster'
File "/usr/local/lib/python3.7/site-packages/dagster/_core/execution/plan/utils.py", line 47, in solid_execution_error_boundary
yield
File "/usr/local/lib/python3.7/site-packages/dagster/utils/_init__.py", line 430, in iterate_with_context
next_output = next(iterator)
File "/usr/local/lib/python3.7/site-packages/dagster/_core/execution/plan/execute_step.py", line 557, in _gen_fn
gen_output = output_manager.handle_output(output_context, output.value)
File "/usr/local/lib/python3.7/site-packages/dagster/_core/storage/fs_io_manager.py", line 143, in handle_output
mkdir_p(os.path.dirname(filepath))
File "/usr/local/lib/python3.7/site-packages/dagster/utils/_init__.py", line 182, in mkdir_p
os.makedirs(path)
File "/usr/local/lib/python3.7/os.py", line 213, in makedirs
makedirs(head, exist_ok=exist_ok)
Reid Beels
10/12/2022, 12:17 AMJB
10/12/2022, 2:06 AMrun_coordinator:
module: dagster._core.run_coordinator
class: QueuedRunCoordinator
config:
max_concurrent_runs: 25
What am I missing?Roman Maliushkin
10/12/2022, 10:30 AM@run_status_sensor
decorator as well as implemented in @schedule
decorator with context.scheduled_execution_time
attribute?
With @schedule
I can use get previous date and pass it to config, like:
date = (context.scheduled_execution_time - timedelta(days=1)).strftime(DATE_FORMAT)
How can I do the same but with @run_status_sensor
decorator?Issac Loo
10/12/2022, 1:59 PMGerben van der Huizen
10/12/2022, 2:32 PMdagster/dagster-k8s
image we are using from 0.14.9
to 1.0.12
. After running dagster instance migrate
it seemed that all our ops, jobs, schedules etc. were all still intact and working fine. However, when browsing through the schedules interface I noticed that the Partition Set
was no longer being displayed (given a None
value) . I can still view all the partitions when I navigate to the run view of individual jobs + the partition_set
column in the runs
table seems to be filled as well.
Are we missing some migration step for this particular field? Or maybe we need to add some additional config to the job in the repository?Eegan K
10/12/2022, 2:40 PMSlackbot
10/12/2022, 2:44 PMJordan
10/12/2022, 3:22 PMsensor
to detect files with high minimum_interval_seconds
parameter (e.g. 12 hours) because the associated job performs many expensive queries on a database. In my daily use, I would like to be able to trigger the sensor tick manually (without having to wait 11 hours in the worst case) if I notice that an important file has been dropped.
Is there a way to meet this need?
I also wonder if it is possible to get the cursor of a sensor inside an op/asset to update it.Laurent
10/12/2022, 4:17 PMDuncan
10/12/2022, 6:17 PMdagster._core.errors.DagsterLaunchFailedError: Error during RPC setup for executing run: dagster._core.errors.DagsterRunNotFoundError: gRPC server could not load run 03e55140-2491-41b5-9560-496679471899 in order to execute it. Make sure that the gRPC server has access to your run storage.
Pablo Beltran
10/12/2022, 6:40 PMDavid Eklund
10/12/2022, 7:37 PMtga
10/12/2022, 10:04 PMAdam Bloom
10/12/2022, 11:51 PMretry_policy
argument that ops do. Is there another way to add retries to asset materializations?
2. If I move an existing partitioned job to be a partitioned asset, I'm assuming I'll need to do some sort of manual migration to the database to keep materialization history? Or is there some magic way of keeping the names consistent to avoid that?Apoorv Yadav
10/13/2022, 8:11 AMYogic Wahyu
10/13/2022, 10:25 AME1012 14:14:23.690197467 86 <http://fork_posix.cc:76]|fork_posix.cc:76]> Other threads are currently calling into gRPC, skipping fork() handlers
The last line of log is like this (attached images) which shows several dagster daemon with different pid, I think. My guess is the daemon create replica or duplicated process and that affect the heartbeat mechanism (not detected by dagit then the daemon is hanged/stuck/restarted/killed. Therefore, the job/pipeline is not triggered). Any keen ideas about this?Jamie Lee
10/13/2022, 10:34 AMdbt_build_op
to use the --resource-type
parameter that is available in the dbt build
cli command?Denis Maciel
10/13/2022, 11:55 AMdagster
and dagster-gcp
are incompatible.
To reproduce the issue, create a file called <http://requirements.in|requirements.in>
and list your dependencies in it. In my case,
dagster
dagster-gcp
After having installed pip-tools, run pip-compile <http://requirements.in|requirements.in>
.
I am getting the following error:
Could not find a version that matches grpcio<1.48.1,<2.0dev,>=1.32.0,>=1.33.2,>=1.38.1,>=1.43.0,>=1.49.1 (from dagster==1.0.12->-r <http://requirements.in|requirements.in> (line 1))
...
There are incompatible versions in the resolved dependencies:
grpcio>=1.49.1 (from grpcio-status==1.49.1->google-api-core[grpc]==2.10.2->google-cloud-storage==2.5.0->dagster-gcp==0.16.12->-r <http://requirements.in|requirements.in> (line 2))
grpcio<2.0dev,>=1.38.1 (from google-cloud-bigquery==2.34.4->dagster-gcp==0.16.12->-r <http://requirements.in|requirements.in> (line 2))
grpcio<1.48.1,>=1.32.0 (from dagster==1.0.12->-r <http://requirements.in|requirements.in> (line 1))
grpcio>=1.43.0 (from grpcio-health-checking==1.43.0->dagster==1.0.12->-r <http://requirements.in|requirements.in> (line 1))
grpcio<2.0dev,>=1.33.2 (from google-api-core[grpc]==2.10.2->google-cloud-storage==2.5.0->dagster-gcp==0.16.12->-r <http://requirements.in|requirements.in> (line 2))
Is that a known issue? Any workarounds for that?Eegan K
10/13/2022, 1:39 PM