[ci skip] fix cluster health: GPU tolerations, actualbudget nfs_server, AuthentikDown alert

- Add missing nvidia.com/gpu toleration to ollama and yt-highlights deployments
- Add node_selector gpu=true to ollama deployment
- Pass nfs_server variable through to actualbudget factory modules
- Fix AuthentikDown alert to match actual deployment name (goauthentik-server)
This commit is contained in:
Viktor Barzin 2026-02-24 22:55:58 +00:00
parent 4fab38da1f
commit c35bef2fd8
4 changed files with 18 additions and 1 deletions

View file

@ -427,7 +427,7 @@ serverFiles:
annotations:
summary: "Headscale VPN has no available replicas"
- alert: AuthentikDown
expr: (kube_deployment_status_replicas_available{namespace="authentik", deployment="authentik-server"} or on() vector(0)) < 1
expr: (kube_deployment_status_replicas_available{namespace="authentik", deployment="goauthentik-server"} or on() vector(0)) < 1
for: 5m
labels:
severity: critical