infra

Viktor Barzin 43b49f7f6c cluster recovery: fix resource limits and node1 memory - nvidia quota: requests.memory 8Gi → 12Gi (unblock cuda-validator) - calibre: startup probe initial_delay 60→120s, timeout 1→5s, wait_for_rollout=false (DOCKER_MODS install takes 10+ min) - immich ML: memory 2Gi → 4Gi (OOMKilled loading CLIP models) Also done outside TF (not in this commit): - node1 VM: 16 GiB → 24 GiB RAM (Proxmox) - tigera-operator: kubectl patch 128→256Mi - nvidia-driver-daemonset: kubectl patch 1→4Gi memory - kyverno reports-controller: kubectl patch 128→256Mi - CNPG operator: kubectl rollout restart		2026-03-15 01:44:28 +00:00
..
Dockerfile	[ci skip] Move Terraform modules into stack directories	2026-02-22 14:38:14 +00:00
main.tf	cluster recovery: fix resource limits and node1 memory	2026-03-15 01:44:28 +00:00
values.yaml	[ci skip] Move Terraform modules into stack directories	2026-02-22 14:38:14 +00:00