infra/modules/kubernetes/monitoring
Viktor Barzin 26ba9ea371 [ci skip] Fix Prometheus storage alert and Grafana quota exhaustion
- Enable size-based TSDB retention (45GB) to clean up old blocks
  (including 2021-era blocks with failed compaction)
- Increase monitoring namespace quota from 64/128Gi to 80/160Gi
  CPU/memory limits to allow Grafana rolling updates
2026-02-21 21:04:08 +00:00
..
dashboards [ci skip] Implement multi-user Kubernetes access with OIDC 2026-02-17 21:42:39 +00:00
server-power-cycle remove kubectl manifests bc drone is not happy running them :/ 2021-05-08 14:03:34 +01:00
alloy.yaml [ci skip] Implement multi-user Kubernetes access with OIDC 2026-02-17 21:42:39 +00:00
Dockerfile add repo for the dockerfile for the redifsh exporter [ci skip] 2023-10-24 11:46:18 +00:00
grafana.tf [ci skip] Remove Authentik forward auth from Grafana, add admin password management 2026-02-18 21:40:32 +00:00
grafana_chart_values.yaml [ci skip] Remove Authentik forward auth from Grafana, add admin password management 2026-02-18 21:40:32 +00:00
idrac.tf [ci skip] Fix Alloy OOMKill and iDRAC priority class conflict 2026-02-16 20:09:53 +00:00
k8s-monitoring-values.yaml add loki + alloy deployments for logs collection [ci skip] 2025-05-04 11:25:39 +00:00
loki.tf [ci skip] Bump inotify max_user_instances from 512 to 8192 2026-02-21 20:21:04 +00:00
loki.yaml [ci skip] Fix compactor/ruler paths to use writable /var/loki mount 2026-02-13 23:22:13 +00:00
main.tf [ci skip] Fix Prometheus storage alert and Grafana quota exhaustion 2026-02-21 21:04:08 +00:00
prometheus.tf replace hardcoded namespace with module reference [ci skip] 2025-12-29 10:23:42 +00:00
prometheus_chart_values.tpl [ci skip] Fix Prometheus storage alert and Grafana quota exhaustion 2026-02-21 21:04:08 +00:00
pve_exporter.tf add tier to all deployments [ci skip] 2026-01-10 16:28:14 +00:00
snmp_exporter.tf reduce the frequency of polling idrac and remove some duplicates [ci skip] 2026-01-24 18:47:22 +00:00
ups_snmp_values.yaml add 2 more oids for ups to monitor active and reactive power consumption [ci skip] 2025-03-15 17:54:04 +00:00