Haydar Ali Ismail
11/02/2020, 4:26 PMspark-submit
parameter to run it on a cluster mode but now I’m getting this error when the it’s trying to import dagster:
from dagster import (
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/__init__.py", line 110, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/core/launcher/__init__.py", line 2, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/core/launcher/cli_api_run_launcher.py", line 6, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/api/execute_run.py", line 2, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/cli/__init__.py", line 8, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/cli/api.py", line 13, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/cli/workspace/__init__.py", line 1, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/cli/workspace/cli_target.py", line 13, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/core/host_representation/__init__.py", line 47, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/core/host_representation/repository_location.py", line 8, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/api/snapshot_execution_plan.py", line 9, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/grpc/__init__.py", line 11, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/grpc/client.py", line 7, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/grpc/__init__.py", line 23, in <module>
ImportError: cannot import name 'cygrpc' from 'grpc._cython' (/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/grpc/_cython/__init__.py)
any idea? I have tried to include grpcio
separately from dagster
as well but the issue still persistssandy
11/03/2020, 1:16 AMHaydar Ali Ismail
11/03/2020, 7:42 AMdagster
itself on the cluster to make the solution as portable as much as we can. I think I do need to have dagster
because when I try to run the pipeline it complains that it cannot import dagster
.
I am actually trying to make my own step launcher for a self-hosted Spark but having a bit of an issue trying to mimic the step and events stuff based from the Amazon EMR pyspark step launcher.
To make a bare minimum step launcher, what are the stuff that has to be implemented?sandy
11/06/2020, 4:44 PM