infra/stacks/platform/modules
Viktor Barzin f80e1fa868 cluster health fixes: NFS CSI, Immich ML, dbaas, Redis, DNS, trading-bot removal
- NFS CSI: fix liveness-probe port conflict (29652 → 29653)
- Immich ML: add gpu-workload priority class to enable preemption on node1
- dbaas: right-size MySQL memory limits (sidecar 6Gi→350Mi, main 4Gi→3Gi)
- Redis: add redis-master service via HAProxy for master-only routing,
  update config.tfvars redis_host to use it
- CoreDNS: forward .viktorbarzin.lan to Technitium ClusterIP (10.96.0.53)
  instead of stale LoadBalancer IP (10.0.20.200)
- Trading bot: comment out all resources (no longer needed)
- Vault: remove trading-bot PostgreSQL database role
2026-04-06 11:54:45 +03:00
..
authentik mitigate cluster instability during terraform applies 2026-03-15 17:23:39 +00:00
cloudflared consolidate MetalLB IPs: 5 → 1 (10.0.20.200) 2026-03-24 18:35:43 +02:00
cnpg equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-14 21:46:49 +00:00
crowdsec state(dbaas): update encrypted state 2026-03-19 20:23:59 +00:00
dbaas fix: rybbit init port, mysql memory limit, metallb alert selector 2026-03-24 18:55:07 +02:00
headscale consolidate MetalLB IPs: 5 → 1 (10.0.20.200) 2026-03-24 18:35:43 +02:00
infra-maintenance state(dbaas): update encrypted state 2026-03-19 20:23:59 +00:00
iscsi-csi right-size cluster memory: reduce overprovisioned, fix under-provisioned services 2026-03-15 15:30:18 +00:00
k8s-portal add agent route to k8s-portal 2026-03-23 02:25:08 +02:00
kyverno state(dbaas): update encrypted state 2026-03-19 20:23:59 +00:00
mailserver consolidate MetalLB IPs: 5 → 1 (10.0.20.200) 2026-03-24 18:35:43 +02:00
metallb [ci skip] Move Terraform modules into stack directories 2026-02-22 14:38:14 +00:00
metrics-server equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-14 21:46:49 +00:00
monitoring resilience improvements: MySQL anti-affinity comment, descheduler 5min, prometheus termination 60s 2026-04-06 00:25:49 +03:00
nfs-csi cluster health fixes: NFS CSI, Immich ML, dbaas, Redis, DNS, trading-bot removal 2026-04-06 11:54:45 +03:00
nvidia right-size 14 services and scale down GPU-heavy workloads [ci skip] 2026-03-15 23:00:49 +00:00
rbac multi-user access: fix template memory default, add storage quota, add CONTRIBUTING.md [ci skip] 2026-03-19 23:49:15 +00:00
redis state(dbaas): update encrypted state 2026-03-19 20:23:59 +00:00
reverse_proxy fix: add retry middleware and per-service rate limit for ha-sofia 2026-04-05 20:47:58 +03:00
sealed-secrets equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-14 21:46:49 +00:00
technitium cluster health fixes: NFS CSI, Immich ML, dbaas, Redis, DNS, trading-bot removal 2026-04-06 11:54:45 +03:00
traefik increase global rate limit from 10/50 to 50/200 2026-03-28 23:40:10 +02:00
uptime-kuma Remove all CPU limits cluster-wide to eliminate CFS throttling 2026-03-14 08:51:45 +00:00
vaultwarden add hourly SQLite integrity check for vaultwarden with Prometheus alerting 2026-03-23 00:50:15 +02:00
vpa equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-14 21:46:49 +00:00
wireguard consolidate MetalLB IPs: 5 → 1 (10.0.20.200) 2026-03-24 18:35:43 +02:00
xray equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-14 21:46:49 +00:00