Stabilize Claude Code for Open-Weight Models
A three-layer proxy stack to keep Claude Code stable when routing through LiteLLM to open-weight models.
Running Claude Code with open-weight models like DeepSeek or Qwen through a LiteLLM proxy works. Until it doesn't. The port changes every time LiteLLM restarts because it picks a random dynamic port. The request payload grows until backends reject it with 400 errors. Usage stats come back null and t...More ›