qwen3-8b emits <think>...</think> chain-of-thought before the JSON
response by default, which trips the JSON parser and forces ~50% of
posts to escalate to claude-agent-service (discovered during first
bulk ingest, 2026-06-05).
Fix: pass chat_template_kwargs.enable_thinking=false in the request
body for the Tier 1 (llama-swap) call. Claude calls are unaffected.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>