Jorge Sánchez (Jorjatorz)
12/21/2021, 4:42 PMmodel_id
UUID) and added this information to the AssetMaterialization metadata. Then the asset_sensor read this metadata and yield a run for the target pipeline with a configuration filled with this metadata. In other words, there was a “root” pipeline who received the initial configuration and this config was cascaded down to other pipelines through assets. If a pipeline had to add new config data, it added to the AssetMat. metadata and the next pipeline would make use of it. Usually the configs data configured a ResourceDefinition.string_resource()
so you can expect the the model_id
I commented before would become a String resource called model_id
, so it is easier to get it from all solids that required it.
I am currently migrating to the new framework, and these interconnected pipelines has now become graphs
The job provides the general configuration to the main resources but there are some resources (like the model_id
) that are created in a subgraph and subsequent graphs/solids will make use of it. The approach I am currently taking is that each graph will return a config_dict
and the next graph will get this config_dict
as input. So the graph that creates a model will insert the model_id
into it and subsequent graphs will have access to it, BUT this time they would get it through this config_dict
instead of a resource.
I would like to know your thoughts about this approach. Ideally I would like to be able to insert resources (for example the model_id
resource) or be able to modify the ResourceDefinition.string_resource()
from a solid, so the model_id
is empty and then populated with the id. Thank you 😄owen
12/21/2021, 5:51 PMJorge Sánchez (Jorjatorz)
12/21/2021, 5:55 PM