fix cluster health: resolve 21/23 failures from healthcheck

- nvidia: change GPU taint NoSchedule -> PreferNoSchedule to allow
  overflow scheduling on k8s-node1 (frees ~7Gi capacity)
- kyverno: increase reports-controller memory 256Mi -> 512Mi (OOMKilled)
- speedtest: add missing DB_PORT=3306 env var (nc: service "" unknown)
- realestate-crawler: increase API memory 64Mi -> 256Mi (OOMKilled)
- calibre: increase liveness probe timeout 1s -> 5s (false restarts)
This commit is contained in:
Viktor Barzin 2026-03-15 02:33:46 +00:00 committed by Viktor Barzin
parent dc576aa8b6
commit 6f2f4c089c
5 changed files with 10 additions and 5 deletions

View file

@ -30,11 +30,11 @@ resource "helm_release" "kyverno" {
reportsController = {
resources = {
limits = {
memory = "256Mi"
memory = "512Mi"
}
requests = {
cpu = "100m"
memory = "128Mi"
memory = "384Mi"
}
}
}