stem95su: scheduled Drive->site sync CronJob (every 10m)

CronJob stem95su-gdrive-sync (*/10) mounts the content PVC RW and rclone-syncs the read-only Drive folder "claude" (stem claude/files) onto it (rclone/rclone:1.74.3, scope=drive.readonly, empty-source guard + --max-delete 25). ESO ExternalSecret stem95su-rclone <- Vault secret/stem95su. Requires the GCP OAuth app published to Production or the refresh token expires ~weekly. Lands the gdrive-sync stack on master (it had landed on a feature branch by accident on the shared devvm checkout). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-09 08:42:26 +00:00 · 2026-06-09 08:42:26 +00:00 · 6d224861c4
commit 6d224861c4
parent 05b50d2b96
1168 changed files with 120 additions and 358547 deletions
--- a/scripts/setup-forgejo-containerd-mirror.sh
+++ b/scripts/setup-forgejo-containerd-mirror.sh
@ -1,60 +0,0 @@
-#!/usr/bin/env bash
-# One-shot deployment of the forgejo.viktorbarzin.me containerd hosts.toml
-# entry across every k8s node. Cloud-init only fires on VM provision, so
-# existing nodes need this manual rollout.
-#
-# What it does, per node:
-#   1. drain (ignore-daemonsets, delete-emptydir-data)
-#   2. ssh in: mkdir + write /etc/containerd/certs.d/forgejo.viktorbarzin.me/hosts.toml
-#   3. systemctl restart containerd
-#   4. uncordon
-#
-# hosts.toml is documented as hot-reloaded but the post-2026-04-19
-# containerd corruption playbook calls for an explicit restart so the
-# config is unambiguously in effect. Running drain/uncordon around it
-# avoids pulling against an in-flight containerd restart.
-#
-# Re-run is safe: writes are idempotent.
-
-set -euo pipefail
-
-CERTS_DIR=/etc/containerd/certs.d/forgejo.viktorbarzin.me
-HOSTS_TOML='server = "https://forgejo.viktorbarzin.me"
-
-[host."https://10.0.20.203"]
-  capabilities = ["pull", "resolve"]
-  skip_verify = true
-'
-
-NODES=$(kubectl get nodes -o name | sed 's|^node/||')
-if [[ -z "$NODES" ]]; then
-  echo "ERROR: no nodes returned from kubectl get nodes" >&2
-  exit 1
-fi
-
-for n in $NODES; do
-  echo "=== $n ==="
-  kubectl drain "$n" --ignore-daemonsets --delete-emptydir-data --force --grace-period=60
-
-  ssh -o StrictHostKeyChecking=accept-new "wizard@$n" sudo bash <<EOF
-set -euo pipefail
-mkdir -p "$CERTS_DIR"
-cat > "$CERTS_DIR/hosts.toml" <<'TOML'
-$HOSTS_TOML
-TOML
-systemctl restart containerd
-EOF
-
-  kubectl uncordon "$n"
-
-  # Wait for the node to report Ready before moving to the next one.
-  for i in {1..30}; do
-    if kubectl get node "$n" -o jsonpath='{.status.conditions[?(@.type=="Ready")].status}' | grep -q True; then
-      echo "    node Ready"
-      break
-    fi
-    sleep 2
-  done
-done
-
-echo "All nodes updated."