If a particular op is hanging and you're not sure why, using py-spy --dump on the relevant process is one way to go to produce a dump of each thread. That can be a bit involved in kubernetes but I added a guide here for some steps that you can take to set that up: https://github.com/dagster-io/dagster/discussions/14771#discussioncomment-6165783