infra

Viktor Barzin abdef1781c anubis: strict bot policy — catch-all CHALLENGE for unmatched UAs The default upstream policy only WEIGHs Mozilla\|Opera UAs and lets everything else (curl, wget, python-requests, scrapy, headless CLI scrapers) fall through to the implicit ALLOW. On non-CDN-fronted hosts (kms, anything dns_type=non-proxied) this meant a plain `curl https://kms.viktorbarzin.me/` returned the real backend content with no challenge — defeating the whole point of the "avoid casual scrapers" intent. Now the module ships a custom POLICY_FNAME mounted via ConfigMap: - Imports the upstream deny-pathological / ai-block-aggressive / allow-good-crawlers / keep-internet-working snippets unchanged - Adds a final `path_regex: .*` → action: CHALLENGE catch-all Result: only IP-verified search engines (Googlebot from Google IPs, Bingbot, etc.) and well-known paths (robots.txt, .well-known, favicon, sitemap) skip the challenge. Everything else — including spoofed-Googlebot-UA-from-random-IP — solves PoW or gets nothing. Verified post-apply: curl default UA on viktorbarzin.me + kms + travel returns the Anubis challenge HTML; /robots.txt still 200s straight through.		2026-05-10 11:12:40 +00:00
..
create-template-vm	Reduce disk write amplification across cluster (~200-350 GB/day savings) [ci skip]	2026-04-09 19:01:21 +00:00
create-vm	Reduce disk write amplification across cluster (~200-350 GB/day savings) [ci skip]	2026-04-09 19:01:21 +00:00
docker-registry	[forgejo] Phase 4 final decommission: drop registry-private container + port 5050	2026-05-07 23:29:34 +00:00
kubernetes	anubis: strict bot policy — catch-all CHALLENGE for unmatched UAs	2026-05-10 11:12:40 +00:00