34392e4097
The MI50 llama.cpp server OOM-killed (LXC RAM limit + 8GB prompt cache) mid-run, and summarize_all had no error handling, so one APIConnectionError killed the whole batch. Add retry-with-backoff around the summarization LLM call, and try/except per session in summarize_all (log + skip; unsummarized sessions get retried on the next run). (Server-side: CT202 RAM raised + prompt cache disabled.) Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>