user
09/10/2020, 11:40 PMcat
09/10/2020, 11:55 PMKevin
09/11/2020, 4:55 PMsashank
09/11/2020, 5:00 PMsashank
09/11/2020, 5:02 PMschrockn
09/11/2020, 5:02 PMschrockn
09/11/2020, 5:03 PM@pipeline
and @composite_solid
into a single abstraction: @graph
Tobias Macey
09/11/2020, 8:12 PMcat
09/11/2020, 11:46 PMuser
09/11/2020, 11:48 PMWali
09/12/2020, 2:00 AMSlackbot
09/14/2020, 2:36 PMSachit Shivam
09/14/2020, 2:50 PMTobias Macey
09/14/2020, 10:54 PMSachit Shivam
09/15/2020, 11:11 AMuser
09/15/2020, 10:48 PMyuhan
09/15/2020, 11:00 PMRichard Fisher
09/16/2020, 8:14 AMexample_pipeline.py
file (I’m using import src.load_and_store
in the example_pipeline.py
file (which is in the same directory as src
). This works correctly when running the pipeline using the Python API (through execute_pipeline
in _`__name__ == "__main__"`_ in example_pipeline.py
) and when running it through dagit
with a workspace.yaml
file:
load_from:
- python_file: example_pipeline.py
However, when I try run dagster pipeline execute -f example_pipeline.py
(from the same directory) I receive the error ModuleNotFoundError: No module named 'src'
.
I’m sure this is a fairly basic import error, but could someone please provide a solution/best practice for this?Rizky Eko Putra
09/16/2020, 8:23 AMdagit -f hello_world.py
on it and it say Serving on <http://127.0.0.1:3000>
how can I access that localhost:3000 via external IP gcp VM? any other config should I set?Ben Sully
09/16/2020, 8:44 AMssh -L localhost:3000:localhost:3000 <GCP IP>
I think, I forget the exact magic commands), or pass -h 0.0.0.0
to your dagit command (which make it accessible to everyone, if your GCP VM is open)Sergii Ivakhno
09/16/2020, 9:21 AMError 1: Missing required field "storage" at the root. Available Fields: "['execution', 'intermediate_storage', 'loggers', 'resources', 'solids', 'storage']".
I haven't seen storage
option in any docs so far. Could you kindly point out me to the resource on how to set this up. We are launching via Python API rather than CLI so I presume it will be some kind of @storage
decorator? Thanks in advance!Georg Bauerfeind
09/17/2020, 3:34 PMuser
09/17/2020, 9:28 PMbob
09/17/2020, 9:43 PMManas Jain
09/18/2020, 8:04 AMSergii Ivakhno
09/18/2020, 9:06 AMrun_config
defined in presets https://docs.dagster.io/tutorial/advanced_pipelines#pipeline-config-presets and execute_pipeline
? When I specify intermediate_storage
it works when put in run_config
within execute_pipeline
but not when I create PresetDefinition
object that is then passed to preset_defs
. Thanks for advice in advance!Jaakko Kangasharju
09/18/2020, 12:26 PMProcess SpawnProcess-1:
Traceback (most recent call last):
File "/usr/local/lib/python3.7/multiprocessing/process.py", line 297, in _bootstrap
self.run()
File "/usr/local/lib/python3.7/multiprocessing/process.py", line 99, in run
self._target(*self._args, **self._kwargs)
File "/usr/local/lib/python3.7/site-packages/dagster/grpc/impl.py", line 184, in start_run_in_subprocess
run_event_handler=lambda x: None,
File "/usr/local/lib/python3.7/site-packages/dagster/grpc/impl.py", line 133, in _run_in_subprocess
EngineEventData.in_process(pid, marker_end="cli_api_subprocess_init"),
File "/usr/local/lib/python3.7/site-packages/dagster/core/instance/__init__.py", line 943, in report_engine_event
check.inst_param(pipeline_run, "pipeline_run", PipelineRun)
File "/usr/local/lib/python3.7/site-packages/dagster/check/__init__.py", line 183, in inst_param
obj, ttype, param_name, additional_message=additional_message
File "/usr/local/lib/python3.7/site-packages/future/utils/__init__.py", line 446, in raise_with_traceback
raise exc.with_traceback(traceback)
dagster.check.ParameterCheckError: Param "pipeline_run" is not a PipelineRun. Got None which is type <class 'NoneType'>.
I can't find anything about this error message. Has anyone seen this? Any ideas what I could do to fix it?Kevin
09/18/2020, 6:39 PMdagster.core.errors.DagsterInvalidDefinitionError: In @pipeline LDA_pipeline, received invalid type <class 'str'> for input "base_path" (passed by keyword) in solid invocation "ETL". Must pass the output from previous solid invocations or inputs to the composition function as inputs when invoking solids during composition.
just trying to create some pipelines with dagstermill attached is my .py which defines my solids -- each of the notebooks does execute:
dagstermill.yield_result(...)
Nikhil Raut
09/19/2020, 12:01 PMNikhil Raut
09/21/2020, 6:33 AMNikhil Raut
09/21/2020, 6:33 AMSergii Ivakhno
09/21/2020, 11:31 AMimportlib.import_module(pipeline)
, where pipeline is a path to dagster pipeline, i.e. blah/pipeline.py
. When I run poetry run python dagtsrer.py --path blah/pipeline.py
I get an error
import dagster_pandas as dagster_pd
File "/Users/apple/Library/Caches/pypoetry/virtualenvs/pwmf-qUT5bdGW-py3.7/lib/python3.7/site-packages/dagster_pandas/__init__.py", line 1, in <module>
from dagster.core.utils import check_dagster_package_version
ModuleNotFoundError: No module named 'dagster.core'; 'dagster' is not a package
importlib.import_module(pipeline)
from poetry shell. Other third-party packages also seem to import dynamically with no problems. Any suggestions would be most appreciated - thanks in advance!dagtsrer.py
masked the dagtsrer
package during import 😱