Dominik Liebler
11/16/2022, 8:42 AM.spec.ttlSecondsAfterFinished
in the Job resources it creates resulting in a lot of accumulated Jobs never being deleted. I tried to set the dagster-k8s/config
tag explicitly like described in the documentation but to no avail. I also tried updating to 1.0.17 but that didn’t help either. Is there something else I need to consider here?Adam Bloom
11/16/2022, 3:43 PMjohann
11/17/2022, 9:13 PMkubectl describe job
)? Or is it not getting respected (would point to Adam’s suggestion)Dominik Liebler
11/18/2022, 6:49 AMjohann
11/18/2022, 4:59 PMkubectl describe job
, it actually won’t appear there. It does show up for me though if I run kubectl get job <name> -o yaml
. We set a 1 day ttl by default that I’d expect you to see in the specDominik Liebler
11/21/2022, 7:29 AMapiVersion: batch/v1
kind: Job
metadata:
creationTimestamp: "2022-11-16T06:10:00Z"
labels:
<http://app.kubernetes.io/component|app.kubernetes.io/component>: run_worker
<http://app.kubernetes.io/instance|app.kubernetes.io/instance>: dagster
<http://app.kubernetes.io/name|app.kubernetes.io/name>: dagster
<http://app.kubernetes.io/part-of|app.kubernetes.io/part-of>: dagster
<http://app.kubernetes.io/version|app.kubernetes.io/version>: 1.0.17
dagster/job: AbcETL
dagster/run-id: 01f11a33-c9c8-4ce9-bb0e-326b972fb72e
name: dagster-run-01f11a33-c9c8-4ce9-bb0e-326b972fb72e
namespace: dagster
resourceVersion: "270217520"
uid: eafbdde6-2288-4db1-bc3f-5edc14889f9f
spec:
backoffLimit: 0
completions: 1
parallelism: 1
selector:
matchLabels:
controller-uid: eafbdde6-2288-4db1-bc3f-5edc14889f9f
template:
metadata:
creationTimestamp: null
labels:
<http://app.kubernetes.io/component|app.kubernetes.io/component>: run_worker
<http://app.kubernetes.io/instance|app.kubernetes.io/instance>: dagster
<http://app.kubernetes.io/name|app.kubernetes.io/name>: dagster
<http://app.kubernetes.io/part-of|app.kubernetes.io/part-of>: dagster
<http://app.kubernetes.io/version|app.kubernetes.io/version>: 1.0.17
controller-uid: eafbdde6-2288-4db1-bc3f-5edc14889f9f
dagster/job: AbcETL
dagster/run-id: 01f11a33-c9c8-4ce9-bb0e-326b972fb72e
job-name: dagster-run-01f11a33-c9c8-4ce9-bb0e-326b972fb72e
name: dagster-run-01f11a33-c9c8-4ce9-bb0e-326b972fb72e
spec:
containers:
- args:
- dagster
- api
- execute_run
- ...
env:
- name: DAGSTER_HOME
value: /opt/dagster/dagster_home
- name: DAGSTER_PG_PASSWORD
valueFrom:
secretKeyRef:
key: postgresql-password
name: dagster-postgresql-secret
envFrom:
- configMapRef:
name: dagster-dagster-user-deployments-etl-user-env
- secretRef:
name: dagster-slack-secret
- secretRef:
name: dagster-trino-credentials
image: dagster-user-code:3.1.1
imagePullPolicy: Always
name: dagster
resources: {}
terminationMessagePath: /dev/termination-log
terminationMessagePolicy: File
dnsPolicy: ClusterFirst
imagePullSecrets:
- name: deployment-token-dagster
restartPolicy: Never
schedulerName: default-scheduler
securityContext: {}
serviceAccount: dagster-dagster-user-deployments-user-deployments
serviceAccountName: dagster-dagster-user-deployments-user-deployments
terminationGracePeriodSeconds: 30
status:
completionTime: "2022-11-16T06:10:14Z"
conditions:
- lastProbeTime: "2022-11-16T06:10:14Z"
lastTransitionTime: "2022-11-16T06:10:14Z"
status: "True"
type: Complete
startTime: "2022-11-16T06:10:00Z"
succeeded: 1
johann
11/21/2022, 3:36 PMWhen I set it manually after the Job has completed, they job will be cleaned up as expected.
kubectl get job | grep -e dagster-run -e dagster-job | awk 'match($4,/[0-9]+d/) {print $1}' | xargs kubectl delete job
This deletes completed pods older than 1 day:
kubectl get pod | grep -e dagster-run -e dagster-job | awk 'match($3,/Completed/) {print $0}' | awk 'match($5,/[0-9]+d/) {print $1}' | xargs kubectl delete pod
Dominik Liebler
11/22/2022, 7:17 AM