infra/stacks/platform/modules
Viktor Barzin 56583c3825 fix: add retry middleware and per-service rate limit for ha-sofia
The global rate limit (10 req/s, 50 burst) was too aggressive for HA
dashboards that load 30+ JS files on page load, causing 429s. VPN tunnel
blips between London K8s and Sofia caused 502s with no retry fallback.

- Add traefik-retry middleware to reverse-proxy factory (all services)
- Add skip_global_rate_limit variable to both reverse-proxy factories
- Create ha-sofia-rate-limit middleware (100 avg, 200 burst)
- Apply to ha-sofia and music-assistant (both route to Sofia)
2026-04-05 20:47:58 +03:00
..
authentik mitigate cluster instability during terraform applies 2026-03-15 17:23:39 +00:00
cloudflared consolidate MetalLB IPs: 5 → 1 (10.0.20.200) 2026-03-24 18:35:43 +02:00
cnpg equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-14 21:46:49 +00:00
crowdsec state(dbaas): update encrypted state 2026-03-19 20:23:59 +00:00
dbaas fix: rybbit init port, mysql memory limit, metallb alert selector 2026-03-24 18:55:07 +02:00
headscale consolidate MetalLB IPs: 5 → 1 (10.0.20.200) 2026-03-24 18:35:43 +02:00
infra-maintenance state(dbaas): update encrypted state 2026-03-19 20:23:59 +00:00
iscsi-csi right-size cluster memory: reduce overprovisioned, fix under-provisioned services 2026-03-15 15:30:18 +00:00
k8s-portal add agent route to k8s-portal 2026-03-23 02:25:08 +02:00
kyverno state(dbaas): update encrypted state 2026-03-19 20:23:59 +00:00
mailserver consolidate MetalLB IPs: 5 → 1 (10.0.20.200) 2026-03-24 18:35:43 +02:00
metallb [ci skip] Move Terraform modules into stack directories 2026-02-22 14:38:14 +00:00
metrics-server equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-14 21:46:49 +00:00
monitoring fix: bank sync alerts - remove {{ $labels.job }} that Helm provider silently drops [ci skip] 2026-04-05 20:07:51 +03:00
nfs-csi equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-14 21:46:49 +00:00
nvidia right-size 14 services and scale down GPU-heavy workloads [ci skip] 2026-03-15 23:00:49 +00:00
rbac multi-user access: fix template memory default, add storage quota, add CONTRIBUTING.md [ci skip] 2026-03-19 23:49:15 +00:00
redis state(dbaas): update encrypted state 2026-03-19 20:23:59 +00:00
reverse_proxy fix: add retry middleware and per-service rate limit for ha-sofia 2026-04-05 20:47:58 +03:00
sealed-secrets equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-14 21:46:49 +00:00
technitium consolidate MetalLB IPs: 5 → 1 (10.0.20.200) 2026-03-24 18:35:43 +02:00
traefik increase global rate limit from 10/50 to 50/200 2026-03-28 23:40:10 +02:00
uptime-kuma Remove all CPU limits cluster-wide to eliminate CFS throttling 2026-03-14 08:51:45 +00:00
vaultwarden add hourly SQLite integrity check for vaultwarden with Prometheus alerting 2026-03-23 00:50:15 +02:00
vpa equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-14 21:46:49 +00:00
wireguard consolidate MetalLB IPs: 5 → 1 (10.0.20.200) 2026-03-24 18:35:43 +02:00
xray equalize memory req=lim across 70+ containers using Prometheus 7d max data 2026-03-14 21:46:49 +00:00