docs: consolidate all post-mortems under docs/post-mortems/

Move HTML post-mortems from repo root post-mortems/ to docs/post-mortems/.
Update index.html with all 3 incidents (newest first).

[ci skip]

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
Viktor Barzin 2026-04-14 08:24:36 +00:00
parent bdba15a387
commit 4e059b138c
2 changed files with 16 additions and 0 deletions

View file

@ -108,6 +108,22 @@ footer {
<h1>Post-Mortems</h1> <h1>Post-Mortems</h1>
<p class="subtitle">Incident reviews for the viktorbarzin.me Kubernetes cluster</p> <p class="subtitle">Incident reviews for the viktorbarzin.me Kubernetes cluster</p>
<ul class="incident-list"> <ul class="incident-list">
<li class="incident-item">
<a href="2026-04-14-nfs-fsid0-dns-vault-outage.md">
<span class="incident-date">2026-04-14</span>
<span class="sev-tag">SEV 1</span>
<div class="incident-title">NFS fsid=0 Cascade &mdash; DNS + Vault + Multi-Service Outage</div>
<div class="incident-desc">5h outage: fsid=0 in PVE /etc/exports broke NFSv4 subdirectory mounts &rarr; Technitium primary I/O errors &rarr; Vault lost quorum &rarr; Alertmanager blind &rarr; 25+ pods affected across 15+ namespaces.</div>
</a>
</li>
<li class="incident-item">
<a href="2026-03-16-nfs-csi-cascade-failure.md">
<span class="incident-date">2026-03-16</span>
<span class="sev-tag">SEV 1</span>
<div class="incident-title">NFS CSI Cascade Failure</div>
<div class="incident-desc">47h outage: NFS CSI driver liveness-probe port conflict &rarr; all NFS mounts fail &rarr; 40+ pods stuck across 20+ namespaces.</div>
</a>
</li>
<li class="incident-item"> <li class="incident-item">
<a href="2026-03-16-kured-containerd-cascade-outage.html"> <a href="2026-03-16-kured-containerd-cascade-outage.html">
<span class="incident-date">2026-03-16</span> <span class="incident-date">2026-03-16</span>