6d224861 came from a --no-checkout worktree whose empty index made the
commit drop every file except two. This restores 05b50d2b's full tree and
correctly adds stacks/stem95su/gdrive-sync.tf + the service-catalog stem95su
entry. Forward-only (parent=6d224861, no force-push); [ci skip] since the
live infra was never applied from the broken commit.
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
3.8 KiB
Runbook: Proxmox host (pve, 192.168.1.127)
Last updated: 2026-04-19
The Proxmox host is a baremetal hypervisor on the storage LAN
(192.168.1.0/24) with a single IP 192.168.1.127. It hosts every
Kubernetes node VM and the NFS exports that back PVCs. It does not
receive DHCP — its network config is static in
/etc/network/interfaces (ifupdown). Because of that, DNS must be
configured manually and stays out of the scope of Kea/DHCP-DDNS.
DNS configuration
The host uses a plain /etc/resolv.conf with two nameservers. No
systemd-resolved, no resolvconf, no NetworkManager — nothing
manages /etc/resolv.conf; it is a regular file owned by root.
Why plain /etc/resolv.conf and not systemd-resolved
- Installing
systemd-resolvedon an active Proxmox node during business hours is the kind of change that risks breaking the NFS server or VM networking. PVE's Debian base does not shipsystemd-resolvedby default. - The ifupdown
/etc/network/interfacesfile does not manage/etc/resolv.confhere — ifupdown's resolvconf integration is only active if theresolvconfpackage is installed, which it is not (dpkg -l resolvconfreturnsun). - A plain file is the simplest mental model and avoids a second layer of "which tool is running now" confusion during an incident.
If you ever want to migrate to systemd-resolved, install the
package, enable the service, symlink /etc/resolv.conf to
/run/systemd/resolve/stub-resolv.conf, and drop the config in
/etc/systemd/resolved.conf.d/10-internal-dns.conf — but do this
during a maintenance window, not reactively.
Current state
# /etc/resolv.conf
search viktorbarzin.lan
nameserver 192.168.1.2
nameserver 94.140.14.14
options timeout:2 attempts:2
| Field | Value | Purpose |
|---|---|---|
| Primary | 192.168.1.2 |
pfSense LAN interface (dnsmasq forwarder → Technitium LB) — resolves .viktorbarzin.lan |
| Fallback | 94.140.14.14 |
AdGuard public DNS — recursive only, used if pfSense LAN IP unreachable |
search |
viktorbarzin.lan |
Unqualified names (technitium, idrac, etc.) resolve against the internal zone |
timeout:2 attempts:2 |
— | Cap glibc resolver at 2s per server, 2 tries — reasonable fallback latency |
Verification commands
ssh root@192.168.1.127 '
cat /etc/resolv.conf # should show the two nameservers
dig +short idrac.viktorbarzin.lan # expect an A record (192.168.1.4)
dig +short github.com # expect an A record
'
Simulated failover — force the primary unreachable and verify the fallback answers:
ssh root@192.168.1.127 '
ip route add blackhole 192.168.1.2
dig +short +time=3 github.com # glibc times out on primary, tries 94.140.14.14 → A record returned
ip route del blackhole 192.168.1.2 # cleanup
'
Expected behaviour: the first dig prints a warning about the UDP
setup failing for 192.168.1.2 and then prints the GitHub A record
(answered by 94.140.14.14).
Rollback
A pre-change backup of /etc/resolv.conf, /etc/network/interfaces,
and /etc/network/interfaces.d/ lives at
/root/dns-backups/dns-config-backup-YYYYMMDD-HHMMSS.tar.gz on the
host. To roll back:
ssh root@192.168.1.127 '
# pick the backup you want (there may be multiple if this runbook has been applied more than once)
BACKUP=$(ls -t /root/dns-backups/dns-config-backup-*.tar.gz | head -1)
tar -xzf "$BACKUP" -C /
cat /etc/resolv.conf
'
No service restart is needed — glibc re-reads /etc/resolv.conf per
lookup.
Related docs
docs/architecture/dns.md— where each resolver IP lives and which subnet it serves.docs/runbooks/nfs-prerequisites.md— other operations on this host; read before adding new NFS exports.