Binh Pham
07/02/2020, 8:32 PMalex
07/02/2020, 9:37 PMsandy
07/02/2020, 9:39 PMsandy
07/02/2020, 9:40 PMBinh Pham
07/02/2020, 10:09 PMsandy
07/02/2020, 10:14 PMcluster:
existing: <your cluster id>
sandy
07/02/2020, 10:14 PMBinh Pham
07/02/2020, 10:16 PMBinh Pham
07/02/2020, 11:24 PMKeyError Traceback (most recent call last)
<command--1> in <module>
13
14 with open(filename, "rb") as f:
---> 15 exec(f.read())
16
<string> in <module>
<string> in main(step_run_ref_filepath, pipeline_zip)
<string> in setup_storage(step_run_ref)
KeyError: 'storage'
sandy
07/02/2020, 11:51 PMresources:
pyspark_step_launcher:
config:
storage: s3
Binh Pham
07/02/2020, 11:54 PMsandy
07/03/2020, 12:43 AMBinh Pham
07/03/2020, 2:53 AMsandy
07/03/2020, 4:36 PMstorage
goes at the same level as resources
):
storage:
s3:
config:
s3_bucket: dagster-scratch-80542c2
s3_prefix: simple-pyspark
sandy
07/04/2020, 3:38 PMBinh Pham
07/06/2020, 5:14 PMsandy
07/06/2020, 5:14 PMBinh Pham
07/06/2020, 7:16 PMDagsterRunNotFoundError Traceback (most recent call last)
<command--1> in <module>
13
14 with open(filename, "rb") as f:
---> 15 exec(f.read())
16
<string> in <module>
<string> in main(step_run_ref_filepath, pipeline_zip)
/databricks/python/lib/python3.7/site-packages/dagster/core/execution/plan/external_step.py in run_step_from_ref(step_run_ref)
188
189 def run_step_from_ref(step_run_ref):
--> 190 step_context = step_run_ref_to_step_context(step_run_ref)
191 return core_dagster_event_sequence_for_step(step_context, step_run_ref.prior_attempts_count)
/databricks/python/lib/python3.7/site-packages/dagster/core/execution/plan/external_step.py in step_run_ref_to_step_context(step_run_ref)
175 DagsterInstance.ephemeral(),
176 )
--> 177 for _ in initialization_manager.generate_setup_events():
178 pass
179 pipeline_context = initialization_manager.get_object()
/databricks/python/lib/python3.7/site-packages/dagster/utils/__init__.py in generate_setup_events(self)
373 try:
374 while self.object is None:
--> 375 obj = next(self.generator)
376 if isinstance(obj, self.object_cls):
377 self.object = obj
/databricks/python/lib/python3.7/site-packages/dagster/core/execution/context_creation_pipeline.py in pipeline_initialization_event_generator(execution_plan, run_config, pipeline_run, instance, scoped_resources_builder_cm, system_storage_data, raise_on_error)
214 else:
215 # pipeline teardown failure
--> 216 raise dagster_error
217
218 if raise_on_error:
/databricks/python/lib/python3.7/site-packages/dagster/core/execution/context_creation_pipeline.py in pipeline_initialization_event_generator(execution_plan, run_config, pipeline_run, instance, scoped_resources_builder_cm, system_storage_data, raise_on_error)
188 )
189
--> 190 _validate_plan_with_context(pipeline_context, execution_plan)
191
192 yield pipeline_context
/databricks/python/lib/python3.7/site-packages/dagster/core/execution/context_creation_pipeline.py in _validate_plan_with_context(pipeline_context, execution_plan)
222 # perform any plan validation that is dependent on access to the pipeline context
223 def _validate_plan_with_context(pipeline_context, execution_plan):
--> 224 validate_reexecution_memoization(pipeline_context, execution_plan)
225
226
/databricks/python/lib/python3.7/site-packages/dagster/core/execution/memoization.py in validate_reexecution_memoization(pipeline_context, execution_plan)
24 raise DagsterRunNotFoundError(
25 'Run id {} set as parent run id was not found in instance'.format(parent_run_id),
---> 26 invalid_run_id=parent_run_id,
27 )
28
DagsterRunNotFoundError: Run id 5c34579a-2876-497c-9127-d1697f134c2d set as parent run id was not found in instance
sandy
07/06/2020, 10:31 PMBinh Pham
07/06/2020, 10:59 PM