infra/stacks
Viktor Barzin 9f1b3a53d3 right-size cluster memory: reduce overprovisioned, fix under-provisioned services
Phase 1 - Quick wins (~4.5 Gi saved):
- democratic-csi: add explicit sidecar resources (64-80Mi vs 256Mi LimitRange default)
- caretta: 768Mi → 600Mi (VPA upper 485Mi)
- immich-ml: 4Gi → 3584Mi (VPA upper 2.95Gi, GPU margin)
- onlyoffice: 3Gi → 2304Mi (VPA upper 1.82Gi)

Phase 2 - Safety fixes (prevent OOMKills):
- frigate: 2Gi/8Gi → 5Gi/10Gi (VPA upper 7.7Gi, was 4% headroom)
- openclaw: 1280Mi req → 2Gi req=limit (documented 2Gi requirement)

Phase 3 - Additional right-sizing:
- authentik workers: 1Gi → 896Mi x3 (VPA upper 722Mi)
- shlink: 512Mi/768Mi → 960Mi req=limit (VPA upper 780Mi, safety increase)

Phase 4 - Burstable QoS for lower tiers:
- tier-3-edge: 128Mi/128Mi → 96Mi req / 192Mi limit
- tier-4-aux: 128Mi/128Mi → 64Mi req / 256Mi limit

Phase 5 - Monitoring:
- Add ClusterMemoryRequestsHigh alert (>85% allocatable, 15m)
- Add ContainerNearOOM alert (>85% limit, 30m)
- Add PodUnschedulable alert (5m, critical)

Cluster: 92.7% → 90.8% memory requests. Stirling-pdf now schedulable.
2026-03-18 08:04:01 +00:00
..
actualbudget equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
affine add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
audiobookshelf add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
blog add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
calibre fix cluster health: resolve 21/23 failures from healthcheck 2026-03-18 08:04:00 +00:00
changedetection add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
city-guesser add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
claude-memory right-size memory: set requests=limits based on actual usage 2026-03-18 08:03:59 +00:00
coturn add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
cyberchef add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
dashy add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
dawarich add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
descheduler migrate all secrets from SOPS to Vault KV 2026-03-18 08:03:59 +00:00
diun add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
ebook2audiobook migrate all secrets from SOPS to Vault KV 2026-03-18 08:03:59 +00:00
echo add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
excalidraw add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
f1-stream add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
forgejo add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
freedify equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
freshrss add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
frigate right-size cluster memory: reduce overprovisioned, fix under-provisioned services 2026-03-18 08:04:01 +00:00
grampsweb add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
hackmd fix: increase memory for OOMKilled services 2026-03-18 08:04:01 +00:00
health add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
homepage add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
immich right-size cluster memory: reduce overprovisioned, fix under-provisioned services 2026-03-18 08:04:01 +00:00
infra add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
isponsorblocktv add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
jsoncrack migrate all secrets from SOPS to Vault KV 2026-03-18 08:03:59 +00:00
k8s-dashboard migrate all secrets from SOPS to Vault KV 2026-03-18 08:03:59 +00:00
kms add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
linkwarden migrate all secrets from SOPS to Vault KV 2026-03-18 08:03:59 +00:00
matrix Migrate Matrix Synapse from SQLite to PostgreSQL 2026-03-18 08:03:58 +00:00
meshcentral add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
n8n fix: increase memory for OOMKilled services 2026-03-18 08:04:01 +00:00
navidrome add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
netbox add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
networking-toolbox add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
nextcloud equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
novelapp add AUTH_SECRET and ALLOWED_ORIGIN env vars to novelapp deployment 2026-03-18 08:04:00 +00:00
ntfy add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
ollama add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
onlyoffice right-size cluster memory: reduce overprovisioned, fix under-provisioned services 2026-03-18 08:04:01 +00:00
openclaw right-size cluster memory: reduce overprovisioned, fix under-provisioned services 2026-03-18 08:04:01 +00:00
osm_routing add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
owntracks add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
paperless-ngx add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
platform right-size cluster memory: reduce overprovisioned, fix under-provisioned services 2026-03-18 08:04:01 +00:00
plotting-book add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
poison-fountain right-size memory: set requests=limits based on actual usage 2026-03-18 08:03:59 +00:00
privatebin add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
real-estate-crawler fix cluster health: resolve 21/23 failures from healthcheck 2026-03-18 08:04:00 +00:00
reloader [ci skip] phase 5+6: update CI pipelines for SOPS, add sensitive=true to secret vars 2026-03-07 14:30:36 +00:00
resume add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
rybbit fix: openclaw policy violation + reduce memory requests for capacity 2026-03-18 08:04:01 +00:00
send add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
servarr equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
shadowsocks add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
speedtest fix: increase memory for OOMKilled services 2026-03-18 08:04:01 +00:00
stirling-pdf fix: openclaw policy violation + reduce memory requests for capacity 2026-03-18 08:04:01 +00:00
tandoor add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
terminal Add terminal stack - reverse proxy to ttyd behind authentik 2026-03-10 23:46:01 +00:00
tor-proxy add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
trading-bot right-size memory: set requests=limits based on actual usage 2026-03-18 08:03:59 +00:00
travel_blog add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
tuya-bridge add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
url right-size cluster memory: reduce overprovisioned, fix under-provisioned services 2026-03-18 08:04:01 +00:00
vault equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-18 08:04:00 +00:00
wealthfolio add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
webhook_handler add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
whisper add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00
woodpecker migrate all secrets from SOPS to Vault KV 2026-03-18 08:03:59 +00:00
ytdlp add vaultwarden daily backup CronJob to NFS 2026-03-18 08:04:00 +00:00