Commit graph

197 commits

Author SHA1 Message Date
Viktor Barzin
e8ff760aff [ci skip] openclaw: cache tools on NFS for fast restarts
- Switch /tools volume from emptyDir to NFS (/mnt/main/openclaw/tools)
- Skip download of kubectl, terraform, terragrunt, pip packages if cached
- Startup time: ~2.5min → ~38s on subsequent restarts
2026-03-01 13:59:07 +00:00
Viktor Barzin
2b22c90a56 [ci skip] Phase 2: migrate Redis from NFS to local disk
- Switch from redis/redis-stack:latest to redis:7-alpine
  (modules were completely unused — zero module commands in stats)
- Move data from NFS (/mnt/main/redis) to local-path PVC
  (RDB saves: 39s on NFS → <1s on local disk)
- Start fresh (old RDB had redis-stack module data incompatible with plain redis;
  all Redis data is transient — queues and caches rebuild automatically)
- Add hourly redis-backup CronJob: redis-cli --rdb to NFS for backup pipeline
- Remove RedisInsight UI ingress (port 8001, only in redis-stack)
- Add redis-backup to NFS exports
- 110 clients reconnected immediately after switchover
- Memory savings: ~100MB from dropping unused modules
2026-02-28 19:44:08 +00:00
Viktor Barzin
a1ba218cd2 [ci skip] Phase 1: PostgreSQL migrated to CNPG on local disk
Major milestone - shared PostgreSQL moved from NFS to CloudNativePG:
- CNPG cluster (pg-cluster) running in dbaas namespace on local-path storage
- PostGIS image (ghcr.io/cloudnative-pg/postgis:16) for dawarich compatibility
- All 20 databases and 19 roles restored from pg_dumpall backup
- postgresql.dbaas Service patched to point at CNPG primary
- Old PG deployment scaled to 0 (NFS data intact for rollback)
- All 12+ dependent services verified running:
  authentik, n8n, dawarich, tandoor, linkwarden, netbox, woodpecker,
  rybbit, affine, health, resume, trading-bot, atuin
- Authentik PgBouncer working through the switched endpoint

TODO: codify CNPG cluster in Terraform, add 2nd replica, update backup CronJob
2026-02-28 19:08:06 +00:00
Viktor Barzin
0274cc0722 [ci skip] technitium: add primary-secondary DNS HA with AXFR zone replication
Secondary instance on a separate node replicates all zones from primary via
zone transfer. LoadBalancer routes DNS queries to both pods. PDB ensures at
least 1 DNS pod survives voluntary disruptions. Setup job automates zone
transfer enablement and secondary zone creation via Technitium REST API.
2026-02-28 14:14:20 +00:00
Viktor Barzin
c8de2c4803 [ci skip] Sunset Drone CI: remove all artifacts, DNS, configs, and references
Drone CI has been fully replaced by Woodpecker CI at ci.viktorbarzin.me.
Destroys K8s resources (12), removes DNS records, NFS exports, Uptime Kuma
monitor, dashboard entry, and all code/doc references across 18 files.
2026-02-23 19:38:55 +00:00
Viktor Barzin
cbf041bcc9 [ci skip] Add Woodpecker CI stack (WIP) and claude agents
- Add stacks/woodpecker/ with Helm-based deployment config
- Add .woodpecker/ CI pipeline configs (default, build-cli, renew-tls)
- Add NFS export entry for woodpecker
- Add .claude/agents/ definitions
2026-02-22 21:30:25 +00:00
Viktor Barzin
c277d28bd8 [ci skip] Add NFS export and DNS record for poison-fountain 2026-02-22 19:47:46 +00:00
Viktor Barzin
2fe7fa547c [ci skip] Configure f1-stream: WebAuthn, NFS storage, headless browser
- Set WEBAUTHN_RPID/ORIGIN for f1.viktorbarzin.me domain
- Add NFS volume at /mnt/main/f1-stream for persistent session/stream data
- Enable headless browser extraction (HEADLESS_EXTRACT_ENABLED=true)
- Reduce replicas to 1 (file-based sessions don't work across replicas)
2026-02-21 15:57:25 +00:00
Viktor Barzin
843b9658d5 [ci skip] Rename moltbot to openclaw across Terraform, K8s resources, and DNS
Update terraform version in init container from 1.12.1 to 1.14.5.
2026-02-18 21:53:46 +00:00
Viktor Barzin
a73f3fcb6b Cluster health remediation: cleanup CronJob, disable Collabora, fix GPU probe, add NFS exports [ci skip]
- Add daily CronJob to auto-clean Failed/Evicted pods cluster-wide (infra-maintenance)
- Disable Collabora in Nextcloud (broken HPA caused scaling storm; using OnlyOffice instead)
- Increase gpu-pod-exporter liveness probe timeout from 1s to 5s
- Add osm-routing NFS exports (osrm-data, otp-data)
2026-02-15 17:20:47 +00:00
Viktor Barzin
5a37c26e9b Drone CI Update TLS Certificates Commit 2026-02-15 00:05:36 +00:00
Viktor Barzin
69aae2ec9d [ci skip] Fix code review findings: correct Alertmanager URL, add atomic to Loki, remove dead minio NFS export, update design doc 2026-02-13 23:08:44 +00:00
Viktor Barzin
a44dfac721 [ci skip] Deploy MoltBot (OpenClaw) AI agent gateway
Add new Kubernetes service for OpenClaw gateway connected to in-cluster
Ollama, with kubectl/terraform/git access for infrastructure management.
Protected behind Authentik SSO.
2026-02-13 22:57:36 +00:00
Viktor Barzin
861cd80c64 add the nfs dirs 2026-02-08 02:29:48 +00:00
Viktor Barzin
65a228632b Drone CI Update TLS Certificates Commit 2026-02-08 00:04:51 +00:00
Viktor Barzin
34f13a2afb Drone CI Update TLS Certificates Commit 2026-01-25 00:06:38 +00:00
Viktor Barzin
df14912198 Drone CI Update TLS Certificates Commit 2026-01-11 00:26:26 +00:00
Viktor Barzin
888d995e3f Drone CI Update TLS Certificates Commit 2026-01-04 00:08:25 +00:00
Viktor Barzin
eccf849e2f Drone CI Update TLS Certificates Commit 2025-12-28 00:06:37 +00:00
Viktor Barzin
dfd33fe323 Drone CI Update TLS Certificates Commit 2025-12-14 00:06:32 +00:00
Viktor Barzin
d153af46d7 Drone CI Update TLS Certificates Commit 2025-11-30 00:06:37 +00:00
Viktor Barzin
9fd0d763c1 Drone CI Update TLS Certificates Commit 2025-11-23 00:06:38 +00:00
Viktor Barzin
dbabafa4d8 Drone CI Update TLS Certificates Commit 2025-11-16 00:13:56 +00:00
Viktor Barzin
eddb3f0484 Drone CI Update TLS Certificates Commit 2025-11-02 00:05:23 +00:00
Viktor Barzin
562040bd45 Drone CI Update TLS Certificates Commit 2025-10-26 00:05:12 +00:00
Viktor Barzin
7e7b6f638a Drone CI Update TLS Certificates Commit 2025-10-19 00:05:29 +00:00
Viktor Barzin
22a32c5de4 Drone CI Update TLS Certificates Commit 2025-10-12 00:13:18 +00:00
Viktor Barzin
ed455b8375 Drone CI Update TLS Certificates Commit 2025-09-28 00:06:34 +00:00
Viktor Barzin
662e4caad1 Drone CI Update TLS Certificates Commit 2025-09-14 00:07:51 +00:00
Viktor Barzin
b99ebcf75d Drone CI Update TLS Certificates Commit 2025-09-07 00:07:08 +00:00
Viktor Barzin
5dea499248 Drone CI Update TLS Certificates Commit 2025-08-31 00:06:53 +00:00
Viktor Barzin
3075b565a2 Drone CI Update TLS Certificates Commit 2025-08-24 00:17:41 +00:00
Viktor Barzin
cd274741f5 Drone CI Update TLS Certificates Commit 2025-08-10 00:06:18 +00:00
Viktor Barzin
3f645a4e2c Drone CI Update TLS Certificates Commit 2025-08-03 00:06:52 +00:00
Viktor Barzin
04fb2397f3 Drone CI Update TLS Certificates Commit 2025-07-27 00:11:18 +00:00
Viktor Barzin
2e648458a9 Drone CI Update TLS Certificates Commit 2025-07-13 00:05:39 +00:00
Viktor Barzin
fc8ecb22db Drone CI Update TLS Certificates Commit 2025-07-06 00:05:59 +00:00
Viktor Barzin
c6e71f4e81 Drone CI Update TLS Certificates Commit 2025-06-29 00:07:00 +00:00
Viktor Barzin
1c4baf51d2 Drone CI Update TLS Certificates Commit 2025-06-22 00:10:18 +00:00
Viktor Barzin
16a524a4e8 Drone CI Update TLS Certificates Commit 2025-06-21 13:10:40 +00:00
Viktor Barzin
a41bb1b150 Drone CI Update TLS Certificates Commit 2025-06-08 00:04:20 +00:00
Viktor Barzin
24c240ec69 Drone CI Update TLS Certificates Commit 2025-05-25 00:03:39 +00:00
Viktor Barzin
75c8e37495 Drone CI Update TLS Certificates Commit 2025-05-18 00:04:18 +00:00
Viktor Barzin
80c25b7cc5 Drone CI Update TLS Certificates Commit 2025-05-11 00:05:34 +00:00
Viktor Barzin
c8ddf1425c Drone CI Update TLS Certificates Commit 2025-05-04 00:07:32 +00:00
Viktor Barzin
c883e5d8b7 Drone CI Update TLS Certificates Commit 2025-04-20 00:05:47 +00:00
Viktor Barzin
fec9a67bb4 Drone CI Update TLS Certificates Commit 2025-04-13 00:03:57 +00:00
Viktor Barzin
ef958a6908 Drone CI Update TLS Certificates Commit 2025-04-06 00:04:54 +00:00
Viktor Barzin
668a736b50 Drone CI Update TLS Certificates Commit 2025-03-30 00:04:10 +00:00
Viktor Barzin
5050f7b63b Drone CI Update TLS Certificates Commit 2025-03-23 00:08:42 +00:00