After node2 OOM incident, right-size memory across the cluster by setting requests=limits based on max_over_time(container_memory_working_set_bytes[7d]) with 1.3x headroom. Eliminates ~37Gi overcommit gap. Categories: - Safe equalization (50 containers): set req=lim where max7d well within target - Limit increases (8 containers): raise limits for services spiking above current - No Prometheus data (12 containers): conservatively set lim=req - Exception: nextcloud keeps req=256Mi/lim=8Gi due to Apache memory spikes Also increased dbaas namespace quota from 12Gi to 16Gi to accommodate mysql 4Gi limits across 3 replicas.
8 lines
117 B
YAML
8 lines
117 B
YAML
args:
|
|
- "--kubelet-insecure-tls"
|
|
resources:
|
|
requests:
|
|
cpu: 50m
|
|
memory: 200Mi
|
|
limits:
|
|
memory: 200Mi
|