Hey there! We have been seeing several cases in which jobs will die before they come up and be stuck in a running state for a long period of time. And a few cases where we can't see the job in dagit but there is a job in k8s that will still be active and continue to run. What are the current best practices for stopping these?