right-size memory requests to unblock GPU workloads and fix dbaas quota [ci skip]

- nvidia: custom LimitRange (128Mi default, was 1Gi from Kyverno) to stop inflating GPU operator init containers; saves ~2.5Gi on GPU node - nvidia: dcgm-exporter 1536Mi → 768Mi (actual usage 489Mi) - monitoring: prometheus server 4Gi → 3Gi (actual usage 2.6Gi) - onlyoffice: 2304Mi → 1536Mi (actual usage 1.3Gi) - immich: frame explicit 64Mi resources (was getting 1Gi LimitRange default) - dbaas: quota limits.memory 20Gi → 24Gi to fit 3rd MySQL replica Root cause: Kyverno tier-2-gpu LimitRange injected 1Gi on every NVIDIA init container (no explicit resources), wasting ~2.5Gi scheduling overhead on the GPU node. Combined with over-requesting, frigate and immich-ml couldn't schedule.
2026-03-17 22:35:54 +00:00 · 2026-03-17 22:35:54 +00:00 · 12a51c4ffa
commit 12a51c4ffa
parent 73511b1230
6 changed files with 46 additions and 11 deletions
--- a/stacks/immich/frame.tf
+++ b/stacks/immich/frame.tf
@ -66,6 +66,15 @@ resource "kubernetes_deployment" "immich-frame" {
        container {
          image = "ghcr.io/immichframe/immichframe:latest"
          name  = "immich-frame"
+          resources {
+            requests = {
+              cpu    = "10m"
+              memory = "64Mi"
+            }
+            limits = {
+              memory = "128Mi"
+            }
+          }
          port {
            container_port = 8080
            protocol       = "TCP"