Commit graph

924 commits

Author SHA1 Message Date
Viktor Barzin
a1d945a0b2 add prometheus alerts for deployment/statefulset/daemonset replica mismatches [ci skip]
- Add DeploymentReplicasMismatch alert
- Add StatefulSetReplicasMismatch alert
- Add DaemonSetMissingPods alert
- Add .claude/ directory with remote executor and knowledge base
2026-01-18 11:04:51 +00:00
Viktor Barzin
e8e18d466e do not unload immich ML model [ci skip] 2026-01-17 23:39:46 +00:00
Viktor Barzin
62be47b2a9 add cronjob for weekly backups of nextcloud [ci skip] 2026-01-17 23:35:10 +00:00
Viktor Barzin
185a138cc5 dedup ram alert and increase threshold to 95% [ci skip] 2026-01-17 22:42:22 +00:00
Viktor Barzin
e837b41b48 add freedify [ci skip] 2026-01-17 22:40:35 +00:00
Viktor Barzin
19184ee22f upgrade nextcloud and add external redis [ci skip] 2026-01-17 20:50:29 +00:00
Viktor Barzin
1ace09f1ea add emo instance for actual budget [ci skip] 2026-01-17 15:01:29 +00:00
Viktor Barzin
474da4efe5 add speedtest deployment [ci skip] 2026-01-13 20:34:44 +00:00
Viktor Barzin
61e318398c scale grafana to 3 pods for resilience [ci skip] 2026-01-12 18:27:54 +00:00
Viktor Barzin
b5db0dd7cf scale pgbouncer to 3 for resilience and run them on separate nodes [ci skip] 2026-01-12 18:27:54 +00:00
Viktor Barzin
155c0dece4 upgrade vaultwarden [ci skip] 2026-01-10 22:47:22 +00:00
Viktor Barzin
84bf53eaca upgrade ollama [ci skip] 2026-01-10 22:47:10 +00:00
Viktor Barzin
da38c7bb30 move some tiers around [ci skip] 2026-01-10 22:47:00 +00:00
Viktor Barzin
8a64640194 run descheduler hourly for more frequent updates [ci skip] 2026-01-10 21:03:42 +00:00
Viktor Barzin
fb84affce6 disable auth-response-headers for idrac and gw ingresses as they cause errors on the upstream [ci skip] 2026-01-10 20:41:00 +00:00
Viktor Barzin
235a469dea add credentials for ab bank sync cronjob [ci skip] 2026-01-10 20:01:06 +00:00
Viktor Barzin
ff9e431544 sclae tuya bridge to 3 pods for resilience [ci skip] 2026-01-10 19:27:57 +00:00
Viktor Barzin
445506b1d5 move crowdsec to croe services [ci skip] 2026-01-10 19:27:32 +00:00
Viktor Barzin
aa6dd13b48 add actualbudget-http-api plus a cronjob to periodically run bank sync [ci skip] 2026-01-10 19:27:14 +00:00
Viktor Barzin
f1e9fb9afe add tier to all deployments [ci skip] 2026-01-10 16:28:14 +00:00
Viktor Barzin
1b5cbeb9c8 monitor idrac more frequently [ci skip] 2026-01-07 18:55:59 +00:00
Viktor Barzin
7e8f73452c add ipv6 addresses to the ingress factory [ci skip] 2026-01-07 18:54:37 +00:00
Viktor Barzin
9edab32199 disable sidekiq as it is not working [ci skip] 2026-01-07 18:54:22 +00:00
Viktor Barzin
8d7a926e6f pin version [ci skip] 2026-01-05 20:17:02 +00:00
Viktor Barzin
934fa34c79 update cpu temp alert to above 60 [ci skip] 2026-01-04 12:26:46 +00:00
Viktor Barzin
01d4c9c3e1 update definition of high cpu usage to use pve metrics in stead for a longer period [ci skip] 2026-01-03 23:30:28 +00:00
Viktor Barzin
75a110c932 store the aiostreams secret key in resource to keep it persistent [ci skip] 2026-01-03 23:14:02 +00:00
Viktor Barzin
3a19f4c8a9 add netbox, ebook2audiobook, audiblez, aiostreams and listenarr; alos reenable prowlarr, qbittorrent [ci skip] 2026-01-03 16:58:57 +00:00
Viktor Barzin
fe01220c6e add netbox [ci skip] 2026-01-03 16:49:16 +00:00
Viktor Barzin
31c403cadb update cpu temp alert to 55C down from 75C [ci skip] 2026-01-03 16:48:54 +00:00
Viktor Barzin
c8354470a0 upgrade dawarich [ci skip] 2026-01-03 16:48:24 +00:00
Viktor Barzin
b1486c1de7 increase leakspeed on 403 rule [ci skip] 2025-12-29 22:07:19 +00:00
Viktor Barzin
d37c693a94 increase idrac scrape timeout in attempt to reduce 499 [ci skip] 2025-12-29 20:34:40 +00:00
Viktor Barzin
26be631088 fix some typos [ci skip] 2025-12-29 20:16:53 +00:00
Viktor Barzin
3b7d295119 add nginx reverse proxy to serialize registyr requests for the same path to avoid race conditions [ci skip] 2025-12-29 20:16:13 +00:00
Viktor Barzin
c03f57d807 refactor cloudflared module to make changing between for_each and count easier [ci skip] 2025-12-29 12:22:55 +00:00
Viktor Barzin
253e77f22d add registry low cache hit rate alert [ci skip] 2025-12-29 10:43:57 +00:00
Viktor Barzin
f1dde96d80 replace hardcoded namespace with module reference [ci skip] 2025-12-29 10:23:42 +00:00
Viktor Barzin
450bc96db8 add startup_shutdown to qemu vms to avoid metadata reset [ci skip] 2025-12-29 10:19:22 +00:00
Viktor Barzin
191abee1b6 reorder defcon services [ci skip] 2025-12-28 21:10:36 +00:00
Viktor Barzin
7551985f12 migrate to for_each when defining cloudflare dns records [ci skip] 2025-12-28 21:04:14 +00:00
Viktor Barzin
5d70f9e602 add depends_on to all modules [ci skip] 2025-12-28 20:51:14 +00:00
Viktor Barzin
cb42771a57 add some more headers when authenticating with authentik [ci skip] 2025-12-28 20:07:50 +00:00
Viktor Barzin
cf9d346cae add more alerts in prometheus and gorup them better [ci skip] 2025-12-28 20:07:33 +00:00
Viktor Barzin
a595c4db56 move out all monitoring resources to separate tf files [ci skip] 2025-12-28 20:07:00 +00:00
Viktor Barzin
26d55c6637 move grafana into separate file and tunr off persistence as we use external db now [ci skip] 2025-12-28 20:05:27 +00:00
Viktor Barzin
8b28288360 add authelia and tnadoor to the defcon levels [ci skip] 2025-12-28 20:04:36 +00:00
Viktor Barzin
7daa8304fa add debug option in authentik helm [ci skip] 2025-12-28 20:03:37 +00:00
Viktor Barzin
14538a02c0 add semi working authelia [ci skip] 2025-12-28 20:02:28 +00:00
Viktor Barzin
a0321d4473 upgrade authentik to 10.3 [ci skip] 2025-12-28 20:01:43 +00:00