infra/docs/plans
Viktor Barzin 7e558de8f0 openclaw: SSH + tmux task fallback to devvm
Give the OpenClaw pod two new capabilities:

1. Host-tools bundle. New init container `install-host-tools` extracts
   openssh-client + dnsutils + tmux + jq + ripgrep + fd + vault + yq +
   friends into /tools/host-tools/, with the bookworm-slim libs the
   binaries need. PATH + LD_LIBRARY_PATH on the main container point
   ld.so at the bundle. Idempotent via /tools/host-tools/.installed-v1
   marker; smoke test (ldd-based) fails the init at deploy time if any
   binary has unresolved deps. Bundle is ~558 MB on the existing
   /srv/nfs/openclaw/tools NFS.

2. devvm SSH + async task pattern. New init `setup-ssh-config` writes
   id_rsa/config/known_hosts under /home/node/.openclaw/.ssh; main
   container startup symlinks /home/node/.ssh → there. New
   /usr/local/bin/openclaw-task wrapper on devvm manages long-running
   work as tmux sessions on devvm (sessions and logs survive pod
   restarts — they live on devvm, not in the pod). New init container
   `seed-devvm-memory-note` drops a markdown note teaching the pattern;
   main container startup now runs `openclaw memory index --force` so
   the note is searchable on first boot.

Design + verified E2E flow in
docs/plans/2026-05-22-openclaw-devvm-access-design.md. Persistence test
green: spawned a 50s task from pod A, deleted pod A, new pod B saw the
task finish and read its full log.

Pre-existing keel.sh annotation drift on openclaw/{openlobster,
task_webhook} cleaned up in the same apply.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-22 14:17:01 +00:00
..
2026-02-22-anti-ai-scraping-design.md [docs] Update anti-AI and rybbit docs after rewrite-body removal 2026-04-17 21:43:13 +00:00
2026-02-22-anti-ai-scraping-plan.md [ci skip] Add anti-AI scraping implementation plan 2026-02-22 19:41:39 +00:00
2026-02-22-node-drift-quick-wins-design.md [ci skip] Reduce node config drift: GPU label, OIDC idempotency, node-exporter, rebuild docs 2026-02-22 22:59:38 +00:00
2026-02-22-talos-linux-migration-evaluation.md [ci skip] Reduce node config drift: GPU label, OIDC idempotency, node-exporter, rebuild docs 2026-02-22 22:59:38 +00:00
2026-02-23-mailserver-hardening-design.md mailserver: overhaul inbound delivery, monitoring, CrowdSec, and migrate to Brevo relay 2026-04-12 22:24:38 +01:00
2026-02-23-mailserver-hardening-plan.md mailserver: overhaul inbound delivery, monitoring, CrowdSec, and migrate to Brevo relay 2026-04-12 22:24:38 +01:00
2026-02-28-ci-build-caching-design.md [ci skip] add CI build caching design doc 2026-02-28 17:43:42 +00:00
2026-02-28-ci-build-caching-plan.md [ci skip] update CI caching plan: add Terraform provisioning for private registry 2026-02-28 17:51:55 +00:00
2026-02-28-network-visualization-design.md [ci skip] add network traffic visualization design doc 2026-02-28 18:14:42 +00:00
2026-02-28-network-visualization-plan.md [ci skip] add network visualization implementation plan 2026-02-28 18:19:36 +00:00
2026-02-28-storage-reliability-design.md [ci skip] expand k8s worker nodes to 256G, update inventory and extend script 2026-02-28 16:00:16 +00:00
2026-03-01-nfs-csi-migration-design.md [ci skip] add NFS CSI migration design doc and implementation plan 2026-03-01 23:30:27 +00:00
2026-03-01-nfs-csi-migration-plan.md [ci skip] add NFS CSI migration design doc and implementation plan 2026-03-01 23:30:27 +00:00
2026-03-01-traefik-resilience-design.md [ci skip] add Traefik resilience hardening design doc 2026-03-01 13:50:00 +00:00
2026-03-01-traefik-resilience-plan.md [ci skip] add Traefik resilience hardening implementation plan 2026-03-01 13:53:50 +00:00
2026-03-02-security-observability-design.md [ci skip] add security observability layer design document 2026-03-02 21:13:01 +00:00
2026-03-03-cluster-hardening-design.md [ci skip] add auto-generated tiers.tf, planning docs, and helm chart cache 2026-03-06 23:55:57 +00:00
2026-03-07-k8s-portal-onboarding-plan.md [ci skip] k8s portal: fix setup script + add onboarding hub (5 new pages) 2026-03-07 15:06:26 +00:00
2026-03-07-sops-migration-design.md [ci skip] add SOPS multi-user secrets migration design (v3, reviewed 3x) 2026-03-07 13:55:05 +00:00
2026-03-28-storage-migration-truenas-elimination.md migrate ebooks-calibre to proxmox-lvm, update storage docs [ci skip] 2026-04-03 19:45:34 +03:00
2026-04-03-proxmox-csi-cleanup-todo.md add proxmox-csi cleanup TODO for post-migration tasks [ci skip] 2026-04-03 20:02:14 +03:00
2026-04-20-infra-audit-design.md docs/plans: 2026-04-20 infra audit design (post-research, post-challenge) 2026-05-22 14:16:41 +00:00
2026-04-25-nfs-hostile-migration-design.md vault: migrate vault-0 storage to proxmox-lvm-encrypted 2026-04-25 16:19:49 +00:00
2026-04-25-nfs-hostile-migration-plan.md vault: record Phase 3 vault Released-PV cleanup 2026-04-25 23:08:45 +00:00
2026-05-07-forgejo-registry-consolidation-design.md [forgejo] Phase 0 of registry consolidation: prepare Forgejo OCI registry 2026-05-07 23:29:33 +00:00
2026-05-07-forgejo-registry-consolidation-plan.md [forgejo] Phase 0 of registry consolidation: prepare Forgejo OCI registry 2026-05-07 23:29:33 +00:00
2026-05-16-auto-upgrade-apps-design.md Phase 0: install Keel + Kyverno auto-update annotation injector 2026-05-22 14:16:48 +00:00
2026-05-16-auto-upgrade-apps-plan.md Phase 0: install Keel + Kyverno auto-update annotation injector 2026-05-22 14:16:48 +00:00
2026-05-17-agent-presence-plan.md docs/plans: add agent presence implementation plan (2026-05-17) 2026-05-22 14:16:56 +00:00
2026-05-19-mysql-8.4.9-upgrade-design.md docs: design + plan for MySQL 8.4.8 → 8.4.9 upgrade 2026-05-22 14:16:59 +00:00
2026-05-19-mysql-8.4.9-upgrade-plan.md docs: design + plan for MySQL 8.4.8 → 8.4.9 upgrade 2026-05-22 14:16:59 +00:00
2026-05-21-ha-control-plane-design.md docs: HA control plane design (3 masters) 2026-05-22 14:17:00 +00:00
2026-05-22-openclaw-devvm-access-design.md openclaw: SSH + tmux task fallback to devvm 2026-05-22 14:17:01 +00:00