Levan
09/26/2022, 3:44 PMINPUT_PR
from in branch deployments as in here? When I export INPUT_PR
from github actions, it still doesn’t read it.claire
09/28/2022, 11:50 PMdaniel
09/29/2022, 7:54 PMclaire
09/29/2022, 9:01 PMCirdes Henrique
09/30/2022, 2:43 PMOren Lederman
09/30/2022, 7:31 PMDAGSTER_CLOUD_API_TOKEN
and ORGANIZATION_ID
, updated the registry in dagster_cloud.yml
and setup the auth to login to my gcp registry so far so good.
When I try to run the Hybrid Branch Deployments action, the build works, but it’s failing on one of the notification steps:
Traceback (most recent call last):
File "/create_or_update_comment.py", line 86, in <module>
main()
File "/create_or_update_comment.py", line 27, in main
pr_id = int(os.getenv("INPUT_PR"))
ValueError: invalid literal for int() with base 10: ''
Any clue to why this is happening? From what I can tell, the value of the pr
variable gets set in this action https://github.com/dagster-io/dagster-cloud-action/blob/main/actions/hybrid_branch_deploy/action.yml based on "${{ github.event.number }}"
. Does this mean it’s getting an empty value? Did I miss a setup in setting up this example repo?dish
09/30/2022, 9:23 PMOren Lederman
10/03/2022, 10:08 PMParse cloud workspace
script fails to checkout the code because it’s attempting to pull a branch that no longer exists.
(I’m using a sample repo creating using the quickstart template - https://github.com/dagster-io/dagster-cloud-hybrid-quickstart). Was able to recreate and temporarily fix by turning on and off the Automatically delete head branches
feature for my repo.
jobs:
parse_workspace:
runs-on: ubuntu-latest
outputs:
build_info: ${{ steps.parse-workspace.outputs.build_info }}
steps:
- name: Checkout repo
uses: actions/checkout@v3
with:
ref: ${{ github.head_ref }}
Leo Qin
10/05/2022, 8:16 PMMax Wong
10/11/2022, 11:11 AMZach P
10/11/2022, 2:57 PMMax Wong
10/12/2022, 12:47 PMWaiting for agent to sync changes to example_location...
Still waiting for agent to sync changes to example_location. This can take a few minutes.
Still waiting for agent to sync changes to example_location. This can take a few minutes.
Still waiting for agent to sync changes to example_location. This can take a few minutes.
Still waiting for agent to sync changes to example_location. This can take a few minutes.
Still waiting for agent to sync changes to example_location. This can take a few minutes.
Still waiting for agent to sync changes to example_location. This can take a few minutes.
Error: Some locations failed to load after being synced by the agent:
Error loading example_location: {'__typename': 'PythonError', 'message': 'docker.errors.APIError: 500 Server Error for <http+docker://localhost/v1.41/images/create?tag=1357a2e2f206e4f5a33a46efd6bb13b81b6e0b75&fromImage=***.dkr.ecr.***.amazonaws.com%2Fdagster-cloud-hybrid-quickstart>: Internal Server Error ("Head "https://***.dkr.ecr.***.<http://amazonaws.com/v2/dagster-cloud-hybrid-quickstart/manifests/1357a2e2f206e4f5a33a46efd6bb13b81b6e0b75|amazonaws.com/v2/dagster-cloud-hybrid-quickstart/manifests/1357a2e2f206e4f5a33a46efd6bb13b81b6e0b75>": no basic auth credentials")\n', 'stack': [' File "/dagster-cloud/dagster_cloud/workspace/user_code_launcher/user_code_launcher.py", line 779, in _reconcile\n new_server_handles[to_update_key] = self._start_new_server_spinup(\n', ' File "/dagster-cloud/dagster_cloud/workspace/docker/__init__.py", line 190, in _start_new_server_spinup\n for _line in docker.APIClient().pull(metadata.image, stream=True):\n', ' File "/usr/local/lib/python3.8/site-packages/docker/api/image.py", line 429, in pull\n self._raise_for_status(response)\n', ' File "/usr/local/lib/python3.8/site-packages/docker/api/client.py", line 270, in _raise_for_status\n raise create_api_error_from_http_exception(e) from e\n', ' File "/usr/local/lib/python3.8/site-packages/docker/errors.py", line 39, in create_api_error_from_http_exception\n raise cls(e, response=response, explanation=explanation) from e\n']}
• I have tested that on a machine running docker agent, it can pull the dagster image
wondering what could the error be from 🙏Zach
10/12/2022, 3:32 PMOren Lederman
10/12/2022, 5:33 PM.configured()
resources. The dict for the prod
environment was accidentally a set and not a dict. Also, for extra points - is it possible to add an option to Dagster cloud to only replace a running code location pod if the new one starts successfully?geoHeil
10/13/2022, 7:31 AMyuhan
10/13/2022, 9:00 PMSterling Paramore
10/13/2022, 11:55 PMQwame
10/14/2022, 3:33 PMMax Wong
10/15/2022, 6:11 PMstorage
config in dagster.yaml
?
https://docs.dagster.io/deployment/dagster-instance#compute-log-storageAki Iwa
10/25/2022, 11:04 AMDAGSTER_HOME
is set in workspace.yaml and dagster.yaml is placed in the DAGSTER_HOME
path of the Docker Image,
but the configuration is not reflected.Mark Fickett
10/25/2022, 1:32 PMDagsterUserCodeUnreachableError("Could not reach user code server") from e
The above exception was caused by the following exception:
grpc._channel._InactiveRpcError: <_InactiveRpcError of RPC that terminated with:
status = StatusCode.UNAVAILABLE
details = "DNS resolution failed for my-job-name-93ab65:4000: C-ares status is not ARES_SUCCESS qtype=A name=my-job-name-93ab65 is_balancer=0: Domain name not found"
debug_error_string = "{"created":"@1666703037.379755003","description":"DNS resolution failed for my-job-name-93ab65:4000: C-ares status is not ARES_SUCCESS qtype=A name=my-job-name-93ab65 is_balancer=0: Domain name not found","file":"src/core/lib/transport/error_utils.cc","file_line":165,"grpc_status":14}"
>
Qwame
10/25/2022, 5:53 PMMax Wong
10/26/2022, 3:30 AMStephen Bailey
10/26/2022, 5:53 PM- name: Run Dagster Test Job
uses: dagster-io/dagster-cloud-action/actions/utils/run@v0.1
with:
dagster_cloud_url: <https://my_url.dagster.cloud>
deployment: my_deployment
location: my_code_location_name
repository: my_repo_name
job: my_job_name
Tried passing in dagster_cloud_api_token
but got:
Warning: Unexpected input(s) 'dagster_cloud_api_token', valid inputs are ['entryPoint', 'args', 'organization_id', 'deployment', 'location', 'location_name', 'repository', 'job', 'tags_json', 'config_json', 'dagster_cloud_url']
Mark Fickett
11/01/2022, 3:00 PMZach P
11/01/2022, 8:42 PMdagster._core.errors.DagsterUserCodeUnreachableError: Could not send request to agent since no agents have recently heartbeated
This happens on a make_slack_run_on_failure_sensor
.
More details in comments 👀Max Wong
11/02/2022, 1:52 PMENTRYPOINT
in Dockerfile
(hybrid mode) (was migrating stuff from existing workflow and forgot to clean them up), when using branch deployment, the runner heartbeat is registered, but it throws DNS resolution failed
error.
Upon removing ENTRYPOINT
directive, everything works normally
I think this edge case could use a better error message 🚀Yang
11/02/2022, 7:28 PMLevan
11/03/2022, 11:01 AMdagster._core.scheduler.scheduler.DagsterSchedulerError: Unable to reach the user code server for schedule job1*. Schedule will resume execution once the server is available.
Any ideas?Robin Hawkes
11/03/2022, 8:31 PMdagster-starter
). How can I change these?