[ci skip] Reduce node config drift: GPU label, OIDC idempotency, node-exporter, rebuild docs

- Add gpu=true label to Terraform (nvidia null_resource alongside taint)
- Improve API server OIDC config to detect value changes, not just flag presence
- Add policy_hash trigger to audit-policy so rule changes auto-reapply
- Enable prometheus-node-exporter sub-chart, delete unused Ansible playbook
- Document full node rebuild procedure in CLAUDE.md
- Save Talos Linux migration evaluation for future reference
This commit is contained in:
Viktor Barzin 2026-02-22 22:59:38 +00:00
parent ff66adbe9e
commit cf67e02135
No known key found for this signature in database
GPG key ID: 0EB088298288D958
8 changed files with 369 additions and 78 deletions

View file

@ -101,8 +101,8 @@ alertmanager:
# web.external-url seems to be hardcoded, edited deployment manually
# extraArgs:
# web.external-url: "https://prometheus.viktorbarzin.me"
# prometheus-node-exporter:
# enabled: true
prometheus-node-exporter:
enabled: true
server:
# Enable me to delete metrics
extraFlags: