Alexei Zhuchkov
03/10/2023, 5:41 PMRunRequest(...)
when using a sensor?Alex Shtuchkin
03/10/2023, 6:07 PMZach Jablons
03/10/2023, 6:13 PMZach
03/10/2023, 6:44 PMrun_id
during initialization?Martin Preusse
03/10/2023, 10:28 PMOutput
to individual AssetOut()
in a multi_asset
?Harrison Conlin
03/10/2023, 10:36 PM__init.py
from the module to a new file, replace server names in the resource configs, add python_file to the workspace.yaml. (here is where I'm unsure) I'll need to change/add key_prefixes and group_names right?Jean Gonzalez
03/10/2023, 10:56 PMdagster._check.CheckError: Expected non-None value: None
File "/usr/local/lib/python3.9/site-packages/dagster/_core/errors.py", line 206, in user_code_error_boundary
yield
File "/usr/local/lib/python3.9/site-packages/dagster/_grpc/impl.py", line 328, in get_external_sensor_execution
return sensor_def.evaluate_tick(sensor_context)
File "/usr/local/lib/python3.9/site-packages/dagster/_core/definitions/sensor_definition.py", line 428, in evaluate_tick
result = list(self._evaluation_fn(context))
File "/usr/local/lib/python3.9/site-packages/dagster/_core/definitions/sensor_definition.py", line 598, in _wrapped_fn
for item in result:
File "/usr/local/lib/python3.9/site-packages/dagster/_core/definitions/run_status_sensor_definition.py", line 589, in _wrapped_fn
external_repository_origin = check.not_none(
File "/usr/local/lib/python3.9/site-packages/dagster/_check/__init__.py", line 1081, in not_none
raise CheckError(f"Expected non-None value: {additional_message}")
Alex Shtuchkin
03/10/2023, 11:32 PMAnton Peniaziev
03/11/2023, 7:34 PMAbhishek Agrawal
03/12/2023, 10:06 AMfrom dagster import Definitions, build_resources, job, ScheduleDefinition
import datasource_lake_defs
from custom_modules import spina_classes
from custom_modules import helper_functions
from custom_modules import dbt_functions
from custom_modules import armada_functions
customer_code_list = []
@job
def job_generate_customer_dbt_manifests():
# get the org list from armada
organisations = armada_functions.get_organisations()
organisation: spina_classes.Organisation
for organisation in organisations:
# get the config from armada]
run_config = armada_functions.get_config_from_armada(organisation)
if run_config.datasets:
customer_code = helper_functions.clean_object_name(organisation.organisation_name.lower())
customer_code_list.append(customer_code)
customer_name = organisation.organisation_name
with build_resources(
resources={"dbt_res": datasource_lake_defs.configured_dbt_cli_resource},
resource_config = {
"dbt_res": {
"config": {
"vars": dbt_functions.get_dbt_vars_for_customer(customer_code),
"target_path": f"target/{customer_code}"
}
}
}
) as resources:
resources.dbt_res.compile()
schedule_job_generate_customer_dbt_manifests = ScheduleDefinition(job=job_generate_customer_dbt_manifests, cron_schedule="*/10 * * * *")
defs = Definitions(
jobs=[job_generate_customer_dbt_manifests],
schedules=[schedule_job_generate_customer_dbt_manifests])
@Manan PJonny Wray
03/12/2023, 5:28 PMJon Erik Kemi Warghed
03/12/2023, 8:15 PMSheng Han Lim
03/13/2023, 6:09 AMAlexander Whillas
03/13/2023, 6:23 AMMalo PARIS
03/13/2023, 7:58 AM@asset(key_prefix=["ETL"],compute_kind="Connection")
def CREATE_CONNECTION():
my_connection = RFC(user="xxxx",
passwd="xxxxx", system="xxxx")
return my_connection
@asset(key_prefix=["ETL"])
def EXTRACT_TABLE(CREATE_CONNECTION):
DATA = CREATE_CONNECTION.get_table(arg…)
return DATA
Daniel Mosesson
03/13/2023, 9:11 AMSlackbot
03/13/2023, 10:00 AMMark
03/13/2023, 10:02 AMpython -m unittest -v
and while it's perfectly picking up all tests and running them properly, it's a bit difficult to read due to the built in DEBUG logs that are generated on every asset creation.
While I prefer to only surpress these debugs during unittests, I've tried adding the following to my `DAGSTER_HOME\dagster.yaml`:
python_logs:
python_log_level: INFO
but that didn't help (debug messages still visible). It did disable them in dagit though (which is actually what I didn't want :)).
I've also tried:
import logging
from dagster import get_dagster_logger
class TestMyTestCase(unittest.TestCase):
...
if __name__ == "__main__":
logger = get_dagster_logger()
logger.setLevel(<http://logging.INFO|logging.INFO>)
unittest.main()
This also didn't seem to help. I'd also prefer it if I can set the logging info on one location for all unit tests, rather than every file that contains tests (as I'm seperating my tests over multiple files). Any suggestions?Timothy Elder
03/13/2023, 3:40 PMModuleNotFoundError: No module named 'dagster_postgres'
Joe
03/13/2023, 4:02 PMDaniel Mosesson
03/13/2023, 4:11 PMSheng Han Lim
03/13/2023, 4:37 PMGatsby Lee
03/13/2023, 5:24 PMDaniel Vetter
03/13/2023, 7:02 PMTimothy Elder
03/13/2023, 8:10 PMQwame
03/13/2023, 8:42 PMload_input
function when reading it?Chaitya
03/13/2023, 8:45 PMdagit -f
without having to use a CLIJacob Marcil
03/13/2023, 9:43 PMSterling Paramore
03/13/2023, 9:54 PMMultiprocess executor: child process for step was terminated by signal 7 (SIGBUS).
dagster._core.executor.child_process_executor.ChildProcessCrashException
The error comes up at different times in the pipeline, but it happens within 2 minutes every time. I’m running dagit in a container on an M1 Mac. I’ve assigned 8GB memory to docker, and it doesn’t look like I’m getting at all close to the memory limit. I’m a bit stumped here.Simon Frid
03/13/2023, 11:50 PMsqlalchemy.exc.OperationalError: (psycopg2.errors.ProgramLimitExceeded) index row size 3424 exceeds btree version 4 maximum 2704 for index "idx_run_tags"
DETAIL: Index row references tuple (0,27) in relation "run_tags".
HINT: Values larger than 1/3 of a buffer page cannot be indexed.
Consider a function index of an MD5 hash of the value, or use full text indexing.
[SQL: INSERT INTO run_tags (run_id, key, value) VALUES (%(run_id)s, %(key)s, %(value)s)]
[parameters: ({'run_id': 'f09f8a7d-5501-4985-8c88-3046e52e43b9', 'key': '.dagster/repository', 'value': '__repository__@user-code-location'}, {'run_id': 'f09f8a7d-5501-4985-8c88-3046e52e43b9', 'key': 'dagster/partition', 'value': '2023-03-12'}, {'run_id': 'f09f8a7d-....