infra/stacks/platform/modules
Viktor Barzin 6f2f4c089c fix cluster health: resolve 21/23 failures from healthcheck
- nvidia: change GPU taint NoSchedule -> PreferNoSchedule to allow
  overflow scheduling on k8s-node1 (frees ~7Gi capacity)
- kyverno: increase reports-controller memory 256Mi -> 512Mi (OOMKilled)
- speedtest: add missing DB_PORT=3306 env var (nc: service "" unknown)
- realestate-crawler: increase API memory 64Mi -> 256Mi (OOMKilled)
- calibre: increase liveness probe timeout 1s -> 5s (false restarts)
2026-03-18 08:04:00 +00:00
..
authentik equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
cloudflared equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
cnpg equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
crowdsec equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
dbaas add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
headscale equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
infra-maintenance [ci skip] iSCSI migration, healthcheck fixes, health probes, etcd backup 2026-03-06 19:54:21 +00:00
iscsi-csi add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
k8s-portal equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
kyverno fix cluster health: resolve 21/23 failures from healthcheck 2026-03-18 08:04:00 +00:00
mailserver equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
metallb [ci skip] Move Terraform modules into stack directories 2026-02-22 14:38:14 +00:00
metrics-server equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
monitoring prometheus: increase memory to 4Gi and probe delays for TSDB compaction 2026-03-18 08:04:00 +00:00
nfs-csi equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
nvidia fix cluster health: resolve 21/23 failures from healthcheck 2026-03-18 08:04:00 +00:00
rbac add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
redis equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
reverse_proxy Remove all CPU limits cluster-wide to eliminate CFS throttling 2026-03-18 08:03:58 +00:00
sealed-secrets equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
technitium equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
traefik equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
uptime-kuma Remove all CPU limits cluster-wide to eliminate CFS throttling 2026-03-18 08:03:58 +00:00
vaultwarden fix vaultwarden backup image: use docker.io/library/alpine for Kyverno 2026-03-18 08:04:00 +00:00
vpa equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
wireguard equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
xray equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00