infra

Viktor Barzin f201e4573e immich: fix slow context search — prewarm clip_index + latency alert/healthcheck Context (smart) search latency was caused by the 665MB vchord clip_index decaying out of PG shared_buffers (~33% resident -> ~1.8s cold ANN reads vs ~4ms warm), NOT by yesterday's ML MODEL_TTL/clip-keepalive change (CLIP textual is warm ~15ms on GPU). The postStart prewarm runs once at pod start and pg_prewarm.autoprewarm only re-warms at startup, so the index decays under job buffer-pressure over days. - clip-index-prewarm CronJob (immich, /5): pg_prewarm('clip_index') keeps the whole index resident -> searches stay ~4ms. - immich-search-probe CronJob (immich, /5): times a random-vector ANN query + reads clip_index residency, pushes gauges to the Pushgateway. - Prometheus alerts ImmichSmartSearchSlow / ImmichClipIndexColdCache / ImmichSearchProbeStale (+ inhibition when the probe is stale). - cluster_healthcheck.sh check #46 check_immich_search (TOTAL_CHECKS 45->46). - Docs: infra CLAUDE.md immich note, monitoring.md, cluster-health skill. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>		2026-06-05 09:19:07 +00:00
..
agents	k8s-version-upgrade: decompose into Job chain to fix self-preemption	2026-05-11 23:54:22 +00:00
commands	[ci skip] update kubectl skill to use local kubeconfig	2026-02-07 13:42:35 +00:00
reference	t3code: track t3 nightly via health-checked auto-updater	2026-06-02 19:24:30 +00:00
scripts	rename weekly-backup → daily-backup across scripts, timers, services, and docs [ci skip]	2026-04-13 18:37:04 +00:00
skills	immich: fix slow context search — prewarm clip_index + latency alert/healthcheck	2026-06-05 09:19:07 +00:00
calendar-query.py	sync regenerated providers.tf + upstream changes	2026-03-22 02:56:04 +02:00
CLAUDE.md	immich: fix slow context search — prewarm clip_index + latency alert/healthcheck	2026-06-05 09:19:07 +00:00
home-assistant-sofia.py	[ci skip] Add ha-sofia Home Assistant deployment to skills	2026-02-07 21:26:05 +00:00
home-assistant.py	add claude [ci skip]	2026-02-06 20:10:02 +00:00
internet-mode-used_DO_NOT_REMOVE_MANUALLY_SECURITY_RISK	add claude [ci skip]	2026-02-06 20:10:02 +00:00
pfsense.py	[ci skip] Add pfSense firewall management skill	2026-02-14 12:42:10 +00:00
settings.json	add claude files [ci skip]	2026-01-18 15:40:43 +00:00