Noah K
10/31/2020, 11:17 AMdeadvoid
10/31/2020, 11:22 AMdeadvoid
10/31/2020, 11:27 AMdagster
table?deadvoid
10/31/2020, 11:28 AMIman Encarnacion
10/31/2020, 3:17 PMNote: You can turn off any of following running schedules, but you cannot turn them back on.
Haydar Ali Ismail
11/02/2020, 4:26 PMspark-submit
parameter to run it on a cluster mode but now I’m getting this error when the it’s trying to import dagster:
from dagster import (
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/__init__.py", line 110, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/core/launcher/__init__.py", line 2, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/core/launcher/cli_api_run_launcher.py", line 6, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/api/execute_run.py", line 2, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/cli/__init__.py", line 8, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/cli/api.py", line 13, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/cli/workspace/__init__.py", line 1, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/cli/workspace/cli_target.py", line 13, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/core/host_representation/__init__.py", line 47, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/core/host_representation/repository_location.py", line 8, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/api/snapshot_execution_plan.py", line 9, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/grpc/__init__.py", line 11, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/dagster/grpc/client.py", line 7, in <module>
File "/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/grpc/__init__.py", line 23, in <module>
ImportError: cannot import name 'cygrpc' from 'grpc._cython' (/appdata/hdfs/v8/yarn/nm/usercache/hismail/appcache/application_1602600045749_421754/container_e168_1602600045749_421754_01_000009/dependencies.zip/grpc/_cython/__init__.py)
any idea? I have tried to include grpcio
separately from dagster
as well but the issue still persistsTobias Macey
11/02/2020, 4:52 PMsashank
11/02/2020, 6:09 PMFrederic Abiven
11/02/2020, 6:51 PM@solid(
input_defs=[
InputDefinition(name="df_X", dagster_type=DataFrame),
InputDefinition(name='scaler', dagster_type=Optional[Any])],
output_defs=[
OutputDefinition(name="df_scaled_X", dagster_type=DataFrame),
OutputDefinition(name="scaler", dagster_type=Any)],
)
def min_max_scaler(_, df_X, scaler):
if not scaler:
scaler = MinMaxScaler().fit(df_X)
yield Output(pd.DataFrame(scaler.transform(df_X), columns=df_X.columns), "df_scaled_X")
yield Output(scaler, "scaler")
And then I call the solid like this :
min_max_scaler_train = min_max_scaler.alias("min_max_scaler_train")
df_X_train, scaler = min_max_scaler_train(df_X_train)
min_max_scaler_test = min_max_scaler.alias("min_max_scaler_test")
df_X_test, _ = min_max_scaler_test(df_X_test, scaler)
But then Dagit complains about a missing Input definition for the scaler, in the configuration.Steve Gustafson
11/02/2020, 10:44 PM@pipeline
def dostuff():
handle = init_stuff()
try:
step_a(handle)
step_b(handle)
...
except:
handle_exception(handle)
dwall
11/03/2020, 4:47 PMAndy H
11/05/2020, 12:17 AMdagster_aws
won't be a valid option in production.sashank
11/05/2020, 9:44 PMuser
11/05/2020, 10:38 PMFranz
11/06/2020, 8:25 AMdagster pipeline execute -f pipeline.py -e config.yaml
Dominic Pham
11/06/2020, 8:56 PMBasil V
11/06/2020, 11:57 PMmy_solid.alias('some alias')
command with and getting this error:
dagster.core.errors.DagsterUserCodeProcessError: (AttributeError) - AttributeError: module 'repository.utils.solids.my_solid' has no attribute 'alias'
Is the .alias
function still valid on solids or have things changed for how to do this? Thanks!Steve Gustafson
11/07/2020, 2:27 AMNoah K
11/07/2020, 5:11 AMSteve Gustafson
11/09/2020, 9:46 PMNoah K
11/09/2020, 9:47 PMNoah K
11/09/2020, 9:47 PMNoah K
11/09/2020, 9:47 PMMose
11/10/2020, 4:18 PMcontext
object, and are not solids...also they can be a completely different modules).
1. I tried print
and logging
, but I don't see them on dagit
when I run the pipeline.
2. I don't see any of my logs using <http://context.log.info|context.log.info>('My logs here')
in GCP logs(deployed using k8s), only the dagster logs(eg. Engine events...Started process for pipeline (pid: 17024).) can be seen.Yichen
11/10/2020, 4:59 PMTobias Macey
11/10/2020, 5:27 PMTobias Macey
11/10/2020, 5:27 PMYichen
11/10/2020, 5:58 PMmatas
11/10/2020, 6:02 PMschrockn
11/10/2020, 9:57 PM