Simon Späti
04/02/2020, 7:47 AMTobias Macey
04/02/2020, 2:10 PMWill Brown
04/02/2020, 4:28 PMChris Roth
04/02/2020, 8:41 PMdagster-celery worker start -y celery_config.yaml
with an sqs broker and i get ValueError: invalid literal for int() with base 10:
. i think the solution is to safequote these strings (eg https://docs.celeryproject.org/en/latest/getting-started/brokers/sqs.html), but i'm not sure if there's a way to do this without forking dagster-celery. any ideas?Chris Roth
04/02/2020, 8:42 PMZach
04/02/2020, 10:08 PMChris Roth
04/02/2020, 11:01 PMcelery_config.yaml
that is passed to dagster-celery
?user
04/03/2020, 1:12 AMprha
04/03/2020, 1:13 AMprha
04/03/2020, 1:20 AMschrockn
04/03/2020, 1:27 AMOwen
04/03/2020, 10:18 AMBen Smith
04/03/2020, 1:00 PMOwen
04/03/2020, 1:31 PM@pipleine
def my_pipeline():
run_cmd_in_venv = bash_script_solid(
'.../dagster_pipelines/hello_world.sh',
name = 'hello'
)
execute_solid(
run_cmd_in_venv
)
the output on starting dagit is as follows:
Loading repository...
2020-04-03 13:29:16 - dagster - DEBUG - ephemeral_hello_solid_pipeline - ed74cc0e-4ab8-4f0d-ad3f-3a0839c2ab3c - PIPELINE_START - Started execution of pipeline "ephemeral_hello_solid_pipeline".
2020-04-03 13:29:16 - dagster - DEBUG - ephemeral_hello_solid_pipeline - ed74cc0e-4ab8-4f0d-ad3f-3a0839c2ab3c - ENGINE_EVENT - Executing steps in process (pid: 29701)
event_specific_data = {"error": null, "marker_end": null, "marker_start": null, "metadata_entries": [["pid", null, ["29701"]], ["step_keys", null, ["['hello.compute']"]]]}
step_key = "hello.compute"
2020-04-03 13:29:16 - dagster - DEBUG - ephemeral_hello_solid_pipeline - ed74cc0e-4ab8-4f0d-ad3f-3a0839c2ab3c - STEP_START - Started execution of step "hello.compute".
solid = "hello"
solid_definition = "hello"
step_key = "hello.compute"
2020-04-03 13:29:16 - dagster - INFO - system - ed74cc0e-4ab8-4f0d-ad3f-3a0839c2ab3c - using temporary directory: /tmp
solid = "hello"
solid_definition = "hello"
step_key = "hello.compute"
2020-04-03 13:29:16 - dagster - INFO - system - ed74cc0e-4ab8-4f0d-ad3f-3a0839c2ab3c - Temporary script location: /tmp/hellop5yj64ld
solid = "hello"
solid_definition = "hello"
step_key = "hello.compute"
2020-04-03 13:29:16 - dagster - INFO - system - ed74cc0e-4ab8-4f0d-ad3f-3a0839c2ab3c - Running command:
echo "Hello"
solid = "hello"
solid_definition = "hello"
step_key = "hello.compute"
2020-04-03 13:29:16 - dagster - INFO - system - ed74cc0e-4ab8-4f0d-ad3f-3a0839c2ab3c - Hello
solid = "hello"
solid_definition = "hello"
step_key = "hello.compute"
2020-04-03 13:29:16 - dagster - INFO - system - ed74cc0e-4ab8-4f0d-ad3f-3a0839c2ab3c - Command exited with return code 0
... you can see that it executes the solid, which is pretty undesirable.
Am I doing something wrong?user
04/03/2020, 6:56 PMChris Roth
04/03/2020, 11:28 PMSimon Späti
04/06/2020, 7:04 AMJava gateway process exited before sending its port number
(full stack trace see in first comment). It works fine on my Macbook and linux, but in my a bit more complicated work environment with windows and WSL plus proxy, my local spark connection gives the above error. I tried several things with JAVA options for proxy etc. but before I search more, do you might have an idea where I should start searching or how to fix that? Would be much appreciated.Zach
04/06/2020, 10:48 PMZach
04/06/2020, 10:48 PMZach
04/06/2020, 10:49 PMJustin Swaney
04/07/2020, 1:48 PMqsub -sync y
calls within solids, but it is only submitting and running one job at a time. When I omit the -sync y
part, the pipeline completes immediately once all the jobs are submitted, which doesn't let me see the progress in dagit
Milos Tomic
04/08/2020, 11:24 AMdagstermill.yield_result
didn't help
Also why is documentation so wild, you start intro with defining solids with decorator and after a while you are using SolidDefinitions, is there a way for dagstermill solid to be defined with decorator?
Sorry if my questions are stupid, really just trying to learn
this is basically my structure and I would like my solid transformation_joins.. to be ipynb that can return value / variable from notebook, if that's possible.Vincent Goffin
04/08/2020, 2:46 PMViolated InRangeColumnConstraint (Column must have values between -inf and inf inclusive.)
for that column with the nans as offending values, is that normal?Milos Tomic
04/09/2020, 10:06 AMOmar Sultan
04/09/2020, 8:51 PMVincent Goffin
04/10/2020, 6:03 AMZach
04/10/2020, 3:25 PMDaniel Olausson
04/14/2020, 8:31 AM@solid(...)
def my_solid(context, **kwargs):
#some stuff
and then:
@pipeline
def my_pipeline():
task1 = my_solid.alias('task1')
task2 = my_solid.alias('task2')
task3 = my_solid.alias('task3')
task4 = my_solid.alias('task4')
task4(task1(), task2(), task3())
Chris Roth
04/14/2020, 8:43 PMcompute_log_manager
as the dagster.yaml
property but it only worked when i changed it to compute_logs
- maybe docs need to be updated?Chris Roth
04/14/2020, 8:45 PM