Bennett Norman
03/01/2023, 9:26 PM2023-03-01 12:24:01 -0900 - dagster - ERROR - epacems_job - ebe6e649-285d-4915-874e-b5adee4f0ce3 - 79821 - RUN_FAILURE - Execution of run for "epacems_job" failed. Steps failed: ['hourly_emissions_epacems'].
Has anyone else experienced this issue?daniel
03/01/2023, 10:11 PMBennett Norman
03/01/2023, 11:33 PMbuild_reconstructable_job
to run jobs via a python script. When I use execute_in_process
I get the full stack trace.daniel
03/01/2023, 11:47 PMBennett Norman
03/01/2023, 11:48 PM2023-03-01 14:43:28 [ INFO] catalystcoop.pudl.extract.ferc1:653 Dropping the old FERC Form 1 SQLite DB if it exists.
2023-03-01 14:43:28 -0900 - dagster - DEBUG - ferc_to_sqlite_job - 328c2bdb-24e8-41a8-96a3-fab9e0788198 - 98735 - xbrl2sqlite - STEP_START - Started execution of step "xbrl2sqlite".
2023-03-01 14:43:28 [ INFO] catalystcoop.pudl.extract.xbrl:84 Dropping the old FERC Form 1 XBRL derived SQLite DB if it exists.
2023-03-01 14:43:29 -0900 - dagster - ERROR - ferc_to_sqlite_job - 328c2bdb-24e8-41a8-96a3-fab9e0788198 - 98735 - xbrl2sqlite - STEP_FAILURE - Execution of step "xbrl2sqlite" failed.
dagster._core.errors.DagsterExecutionStepExecutionError: Error occurred while executing op "xbrl2sqlite"::
AssertionError: You are attempting to drop your database without setting clobber to False
Stack Trace:
File "/Users/bendnorman/mambaforge/envs/pudl-dev/lib/python3.10/site-packages/dagster/_core/execution/plan/utils.py", line 55, in op_execution_error_boundary
yield
File "/Users/bendnorman/mambaforge/envs/pudl-dev/lib/python3.10/site-packages/dagster/_utils/__init__.py", line 471, in iterate_with_context
next_output = next(iterator)
File "/Users/bendnorman/mambaforge/envs/pudl-dev/lib/python3.10/site-packages/dagster/_core/execution/plan/compute_generator.py", line 121, in _coerce_solid_compute_fn_to_iterator
result = invoke_compute_fn(
File "/Users/bendnorman/mambaforge/envs/pudl-dev/lib/python3.10/site-packages/dagster/_core/execution/plan/compute_generator.py", line 115, in invoke_compute_fn
return fn(context, **args_to_pass) if context_arg_provided else fn(**args_to_pass)
File "/Users/bendnorman/catalyst/pudl/src/pudl/extract/xbrl.py", line 143, in xbrl2sqlite
sqlite_engine = _get_sqlite_engine(form.value, output_path, clobber)
File "/Users/bendnorman/catalyst/pudl/src/pudl/extract/xbrl.py", line 91, in _get_sqlite_engine
pudl.helpers.drop_tables(sqlite_engine, clobber=clobber)
File "/Users/bendnorman/catalyst/pudl/src/pudl/helpers.py", line 1031, in drop_tables
raise AssertionError(
2023-03-01 14:43:29 -0900 - dagster - DEBUG - ferc_to_sqlite_job - 328c2bdb-24e8-41a8-96a3-fab9e0788198 - 98713 - ENGINE_EVENT - Multiprocess executor: parent process exiting after 7.74s (pid: 98713)
2023-03-01 14:43:29 -0900 - dagster - ERROR - ferc_to_sqlite_job - 328c2bdb-24e8-41a8-96a3-fab9e0788198 - 98713 - RUN_FAILURE - Execution of run for "ferc_to_sqlite_job" failed. Steps failed: ['dbf2sqlite', 'xbrl
2sqlite'].
Bennett Norman
03/01/2023, 11:48 PMdaniel
03/02/2023, 12:47 AMBennett Norman
03/02/2023, 12:55 AMdaniel
03/02/2023, 3:51 PMall_events
property that's a list of events - iterating through those and printing them out would probably have the failure message in it every timeBennett Norman
03/02/2023, 6:58 PMdaniel
03/02/2023, 7:07 PMdaniel
03/02/2023, 7:07 PMBennett Norman
03/02/2023, 7:08 PMBennett Norman
03/02/2023, 7:10 PMdaniel
03/02/2023, 7:13 PMrun_config={"execution": {"config": {"in_process": {}}}}
Bennett Norman
03/02/2023, 7:15 PM"execution": {"config": {"in_process": {}}}
is added to execute_job
daniel
03/02/2023, 7:16 PMBennett Norman
03/02/2023, 7:16 PMdaniel
03/02/2023, 7:16 PMBennett Norman
03/02/2023, 7:18 PMpudl_settings is being depcrated in favor of environment variablesPUDL_OUTPUT and PUDL_CACHE
pudl_settings is being depcrated in favor of environment variablesPUDL_OUTPUT and PUDL_CACHE. For more infosee: <https://catalystcoop-pudl.readthedocs.io/en/dev/dev/dev_setup.html>
sqlite and parquet directories are no longer being used. Make sure there is a single directory named 'output' at the root of your workspace. For more info see: <https://catalystcoop-pudl.readthedocs.io/en/dev/dev/dev_setup.html>
pudl_settings is being depcrated in favor of environment variablesPUDL_OUTPUT and PUDL_CACHE
pudl_settings is being depcrated in favor of environment variablesPUDL_OUTPUT and PUDL_CACHE. For more infosee: <https://catalystcoop-pudl.readthedocs.io/en/dev/dev/dev_setup.html>
sqlite and parquet directories are no longer being used. Make sure there is a single directory named 'output' at the root of your workspace. For more info see: <https://catalystcoop-pudl.readthedocs.io/en/dev/dev/dev_setup.html>
2023-03-02 10:17:33 [ WARNING] catalystcoop.pudl.workspace.setup:107 pudl_settings is being depcrated in favor of environment variablesPUDL_OUTPUT and PUDL_CACHE. For more infosee: <https://catalystcoop-pudl.readthedocs.io/en/dev/dev/dev_setup.html>
2023-03-02 10:17:33 [ WARNING] catalystcoop.pudl.workspace.setup:128 sqlite and parquet directories are no longer being used. Make sure there is a single directory named 'output' at the root of your workspace. For more info see: <https://catalystcoop-pudl.readthedocs.io/en/dev/dev/dev_setup.html>
/Users/bendnorman/catalyst/pudl/src/pudl/ferc_to_sqlite/cli.py:135: ExperimentalWarning: "build_reconstructable_job" is an experimental function. It may break in future versions, even between dot releases. To mute warnings for experimental functionality, invoke warnings.filterwarnings("ignore", category=dagster.ExperimentalWarning) or use one of the other methods described at <https://docs.python.org/3/library/warnings.html#describing-warning-filters>.
ferc_to_sqlite_reconstructable_job = build_reconstructable_job(
No dagster instance configuration file (dagster.yaml) found at /Users/bendnorman/catalyst/pudl-work. Defaulting to loading and storing all metadata with /Users/bendnorman/catalyst/pudl-work. If this is the desired behavior, create an empty dagster.yaml file in /Users/bendnorman/catalyst/pudl-work.
/Users/bendnorman/catalyst/pudl/src/pudl/ferc_to_sqlite/cli.py:141: ExperimentalWarning: "execute_job" is an experimental function. It may break in future versions, even between dot releases. To mute warnings for experimental functionality, invoke warnings.filterwarnings("ignore", category=dagster.ExperimentalWarning) or use one of the other methods described at <https://docs.python.org/3/library/warnings.html#describing-warning-filters>.
result = execute_job(
2023-03-02 10:17:34 -0900 - dagster - DEBUG - ferc_to_sqlite_job - 8ff4c146-9654-4a8b-8c94-a9a8a76b79de - 97201 - RUN_START - Started execution of run for "ferc_to_sqlite_job".
2023-03-02 10:17:34 -0900 - dagster - DEBUG - ferc_to_sqlite_job - 8ff4c146-9654-4a8b-8c94-a9a8a76b79de - 97201 - ENGINE_EVENT - Executing steps using multiprocess executor: parent process (pid: 97201)
2023-03-02 10:17:34 -0900 - dagster - DEBUG - ferc_to_sqlite_job - 8ff4c146-9654-4a8b-8c94-a9a8a76b79de - 97201 - dbf2sqlite - STEP_WORKER_STARTING - Launching subprocess for "dbf2sqlite".
2023-03-02 10:17:34 -0900 - dagster - DEBUG - ferc_to_sqlite_job - 8ff4c146-9654-4a8b-8c94-a9a8a76b79de - 97201 - xbrl2sqlite - STEP_WORKER_STARTING - Launching subprocess for "xbrl2sqlite".
pudl_settings is being depcrated in favor of environment variablesPUDL_OUTPUT and PUDL_CACHE
pudl_settings is being depcrated in favor of environment variablesPUDL_OUTPUT and PUDL_CACHE. For more infosee: <https://catalystcoop-pudl.readthedocs.io/en/dev/dev/dev_setup.html>
sqlite and parquet directories are no longer being used. Make sure there is a single directory named 'output' at the root of your workspace. For more info see: <https://catalystcoop-pudl.readthedocs.io/en/dev/dev/dev_setup.html>
pudl_settings is being depcrated in favor of environment variablesPUDL_OUTPUT and PUDL_CACHE
pudl_settings is being depcrated in favor of environment variablesPUDL_OUTPUT and PUDL_CACHE. For more infosee: <https://catalystcoop-pudl.readthedocs.io/en/dev/dev/dev_setup.html>
sqlite and parquet directories are no longer being used. Make sure there is a single directory named 'output' at the root of your workspace. For more info see: <https://catalystcoop-pudl.readthedocs.io/en/dev/dev/dev_setup.html>
pudl_settings is being depcrated in favor of environment variablesPUDL_OUTPUT and PUDL_CACHE
pudl_settings is being depcrated in favor of environment variablesPUDL_OUTPUT and PUDL_CACHE. For more infosee: <https://catalystcoop-pudl.readthedocs.io/en/dev/dev/dev_setup.html>
sqlite and parquet directories are no longer being used. Make sure there is a single directory named 'output' at the root of your workspace. For more info see: <https://catalystcoop-pudl.readthedocs.io/en/dev/dev/dev_setup.html>
pudl_settings is being depcrated in favor of environment variablesPUDL_OUTPUT and PUDL_CACHE
pudl_settings is being depcrated in favor of environment variablesPUDL_OUTPUT and PUDL_CACHE. For more infosee: <https://catalystcoop-pudl.readthedocs.io/en/dev/dev/dev_setup.html>
sqlite and parquet directories are no longer being used. Make sure there is a single directory named 'output' at the root of your workspace. For more info see: <https://catalystcoop-pudl.readthedocs.io/en/dev/dev/dev_setup.html>
2023-03-02 10:17:42 -0900 - dagster - DEBUG - ferc_to_sqlite_job - 8ff4c146-9654-4a8b-8c94-a9a8a76b79de - 97213 - STEP_WORKER_STARTED - Executing step "xbrl2sqlite" in subprocess.
2023-03-02 10:17:42 -0900 - dagster - DEBUG - ferc_to_sqlite_job - 8ff4c146-9654-4a8b-8c94-a9a8a76b79de - 97212 - STEP_WORKER_STARTED - Executing step "dbf2sqlite" in subprocess.
2023-03-02 10:17:42 -0900 - dagster - DEBUG - ferc_to_sqlite_job - 8ff4c146-9654-4a8b-8c94-a9a8a76b79de - 97213 - xbrl2sqlite - RESOURCE_INIT_STARTED - Starting initialization of resources [datastore, ferc_to_sqlite_settings, io_manager].
2023-03-02 10:17:42 -0900 - dagster - DEBUG - ferc_to_sqlite_job - 8ff4c146-9654-4a8b-8c94-a9a8a76b79de - 97212 - dbf2sqlite - RESOURCE_INIT_STARTED - Starting initialization of resources [datastore, ferc_to_sqlite_settings, io_manager].
2023-03-02 10:17:42 -0900 - dagster - DEBUG - ferc_to_sqlite_job - 8ff4c146-9654-4a8b-8c94-a9a8a76b79de - 97213 - xbrl2sqlite - RESOURCE_INIT_SUCCESS - Finished initialization of resources [datastore, ferc_to_sqlite_settings, io_manager].
2023-03-02 10:17:42 -0900 - dagster - DEBUG - ferc_to_sqlite_job - 8ff4c146-9654-4a8b-8c94-a9a8a76b79de - 97212 - dbf2sqlite - RESOURCE_INIT_SUCCESS - Finished initialization of resources [datastore, ferc_to_sqlite_settings, io_manager].
2023-03-02 10:17:42 -0900 - dagster - DEBUG - ferc_to_sqlite_job - 8ff4c146-9654-4a8b-8c94-a9a8a76b79de - 97213 - LOGS_CAPTURED - Started capturing logs in process (pid: 97213).
2023-03-02 10:17:42 -0900 - dagster - DEBUG - ferc_to_sqlite_job - 8ff4c146-9654-4a8b-8c94-a9a8a76b79de - 97213 - xbrl2sqlite - STEP_START - Started execution of step "xbrl2sqlite".
2023-03-02 10:17:42 [ INFO] catalystcoop.pudl.extract.xbrl:84 Dropping the old FERC Form 1 XBRL derived SQLite DB if it exists.
2023-03-02 10:17:42 -0900 - dagster - DEBUG - ferc_to_sqlite_job - 8ff4c146-9654-4a8b-8c94-a9a8a76b79de - 97212 - LOGS_CAPTURED - Started capturing logs in process (pid: 97212).
2023-03-02 10:17:42 -0900 - dagster - DEBUG - ferc_to_sqlite_job - 8ff4c146-9654-4a8b-8c94-a9a8a76b79de - 97212 - dbf2sqlite - STEP_START - Started execution of step "dbf2sqlite".
2023-03-02 10:17:42 [ INFO] catalystcoop.pudl.extract.ferc1:653 Dropping the old FERC Form 1 SQLite DB if it exists.
2023-03-02 10:17:44 -0900 - dagster - DEBUG - ferc_to_sqlite_job - 8ff4c146-9654-4a8b-8c94-a9a8a76b79de - 97201 - ENGINE_EVENT - Multiprocess executor: parent process exiting after 9.99s (pid: 97201)
2023-03-02 10:17:44 -0900 - dagster - ERROR - ferc_to_sqlite_job - 8ff4c146-9654-4a8b-8c94-a9a8a76b79de - 97201 - RUN_FAILURE - Execution of run for "ferc_to_sqlite_job" failed. Steps failed: ['dbf2sqlite', 'xbrl2sqlite'].
Bennett Norman
03/02/2023, 7:20 PMdaniel
03/02/2023, 7:29 PMBennett Norman
03/02/2023, 7:45 PMdaniel
03/02/2023, 7:49 PMBennett Norman
03/02/2023, 7:52 PMdagster.reconstructable
to ``dagster.build_reconstructable_job()` . This PR explains the context.daniel
03/02/2023, 7:54 PMBennett Norman
03/02/2023, 7:56 PMBennett Norman
03/02/2023, 7:57 PMdaniel
03/02/2023, 7:57 PMreconstructable_kwargs={"loglevel": args.loglevel, "logfile": args.logfile},
makes a difference (all this back and forth debugging is entirely optional btw, you've already gone above and beyond)daniel
03/02/2023, 7:58 PMBennett Norman
03/02/2023, 8:00 PMreconstructable_kwargs={"loglevel": args.loglevel, "logfile": args.logfile},
Bennett Norman
03/02/2023, 8:01 PMreconstructable
to build_reconstructable_job
daniel
03/02/2023, 8:12 PMdaniel
03/02/2023, 8:13 PMdaniel
03/02/2023, 8:14 PMBennett Norman
03/02/2023, 8:17 PMdaniel
03/02/2023, 8:26 PMBennett Norman
03/02/2023, 8:27 PMferc_to_sqlite
command.daniel
03/02/2023, 8:32 PMdaniel
03/02/2023, 8:32 PMBennett Norman
03/02/2023, 8:33 PMferc_to_sqlite src/pudl/package_data/settings/etl_fast.yml
Bennett Norman
03/02/2023, 8:33 PMdaniel
03/02/2023, 8:33 PMBennett Norman
03/02/2023, 8:33 PMdaniel
03/02/2023, 8:33 PMBennett Norman
03/02/2023, 8:33 PMBennett Norman
03/02/2023, 8:34 PMdaniel
03/02/2023, 8:37 PM2023-03-02 14:36:33 -0600 - dagster - DEBUG - ferc_to_sqlite_job - ba895ed4-8280-4453-8a80-e53f95906c8d - 7927 - STEP_WORKER_STARTED - Executing step "xbrl2sqlite" in subprocess.
2023-03-02 14:36:33 -0600 - dagster - DEBUG - ferc_to_sqlite_job - ba895ed4-8280-4453-8a80-e53f95906c8d - 7920 - STEP_WORKER_STARTED - Executing step "dbf2sqlite" in subprocess.
2023-03-02 14:36:33 -0600 - dagster - DEBUG - ferc_to_sqlite_job - ba895ed4-8280-4453-8a80-e53f95906c8d - 7927 - xbrl2sqlite - RESOURCE_INIT_STARTED - Starting initialization of resources [datastore, ferc_to_sqlite_settings, io_manager].
2023-03-02 14:36:33 -0600 - dagster - DEBUG - ferc_to_sqlite_job - ba895ed4-8280-4453-8a80-e53f95906c8d - 7920 - dbf2sqlite - RESOURCE_INIT_STARTED - Starting initialization of resources [datastore, ferc_to_sqlite_settings, io_manager].
2023-03-02 14:36:33 -0600 - dagster - DEBUG - ferc_to_sqlite_job - ba895ed4-8280-4453-8a80-e53f95906c8d - 7927 - xbrl2sqlite - RESOURCE_INIT_SUCCESS - Finished initialization of resources [datastore, ferc_to_sqlite_settings, io_manager].
2023-03-02 14:36:33 -0600 - dagster - DEBUG - ferc_to_sqlite_job - ba895ed4-8280-4453-8a80-e53f95906c8d - 7920 - dbf2sqlite - RESOURCE_INIT_SUCCESS - Finished initialization of resources [datastore, ferc_to_sqlite_settings, io_manager].
2023-03-02 14:36:33 -0600 - dagster - DEBUG - ferc_to_sqlite_job - ba895ed4-8280-4453-8a80-e53f95906c8d - 7927 - LOGS_CAPTURED - Started capturing logs in process (pid: 7927).
2023-03-02 14:36:33 -0600 - dagster - DEBUG - ferc_to_sqlite_job - ba895ed4-8280-4453-8a80-e53f95906c8d - 7920 - LOGS_CAPTURED - Started capturing logs in process (pid: 7920).
2023-03-02 14:36:33 -0600 - dagster - DEBUG - ferc_to_sqlite_job - ba895ed4-8280-4453-8a80-e53f95906c8d - 7927 - xbrl2sqlite - STEP_START - Started execution of step "xbrl2sqlite".
2023-03-02 14:36:33 [ INFO] catalystcoop.pudl.extract.xbrl:84 Dropping the old FERC Form 1 XBRL derived SQLite DB if it exists.
2023-03-02 14:36:33 -0600 - dagster - DEBUG - ferc_to_sqlite_job - ba895ed4-8280-4453-8a80-e53f95906c8d - 7920 - dbf2sqlite - STEP_START - Started execution of step "dbf2sqlite".
2023-03-02 14:36:33 [ INFO] catalystcoop.pudl.extract.ferc1:653 Dropping the old FERC Form 1 SQLite DB if it exists.
2023-03-02 14:36:33 [ INFO] catalystcoop.pudl.extract.ferc1:667 Creating a new database schema based on 2020.
2023-03-02 14:36:33 -0600 - dagster - ERROR - ferc_to_sqlite_job - ba895ed4-8280-4453-8a80-e53f95906c8d - 7927 - xbrl2sqlite - STEP_FAILURE - Execution of step "xbrl2sqlite" failed.
dagster._core.errors.DagsterExecutionStepExecutionError: Error occurred while executing op "xbrl2sqlite"::
Exception: SIMULATED FAILURE
Stack Trace:
File "/Users/dgibson/dagster/python_modules/dagster/dagster/_core/execution/plan/utils.py", line 55, in op_execution_error_boundary
yield
File "/Users/dgibson/dagster/python_modules/dagster/dagster/_utils/__init__.py", line 471, in iterate_with_context
next_output = next(iterator)
File "/Users/dgibson/dagster/python_modules/dagster/dagster/_core/execution/plan/compute_generator.py", line 121, in _coerce_solid_compute_fn_to_iterator
result = invoke_compute_fn(
File "/Users/dgibson/dagster/python_modules/dagster/dagster/_core/execution/plan/compute_generator.py", line 115, in invoke_compute_fn
return fn(context, **args_to_pass) if context_arg_provided else fn(**args_to_pass)
File "/Users/dgibson/pudl/src/pudl/extract/xbrl.py", line 145, in xbrl2sqlite
raise Exception("SIMULATED FAILURE")
daniel
03/02/2023, 8:37 PMBennett Norman
03/02/2023, 8:39 PMdaniel
03/02/2023, 8:48 PMBennett Norman
03/02/2023, 8:49 PMdaniel
03/02/2023, 8:49 PMBennett Norman
03/02/2023, 9:02 PMdaniel
03/02/2023, 9:03 PMalex
03/02/2023, 9:15 PM, logging to a single file from multiple processes is not supported, because there is no standard way to serialize access to a single file across multiple processes in Pythonhttps://docs.python.org/3/howto/logging-cookbook.html#logging-to-a-single-file-from-multiple-processes
Bennett Norman
03/02/2023, 9:16 PMBennett Norman
03/02/2023, 9:18 PMloggingshutdown
is still swallowing the stack traces.Bennett Norman
03/02/2023, 9:18 PMdaniel
03/02/2023, 9:18 PM