- Remove ClusterMemoryRequestsHigh, ContainerNearOOM, NodeLowFreeMemory, NodeMemoryPressureTrending — all fire regularly due to intentional memory overcommit and are not actionable - Keep ContainerOOMKilled (actionable — container actually died) - Raise HighServiceLatency p99 threshold from 10s to 30s to ignore transient spikes |
||
|---|---|---|
| .. | ||
| modules/monitoring | ||
| main.tf | ||
| secrets | ||
| terragrunt.hcl | ||
| tiers.tf | ||