add local model support: Ollama, LM Studio, OpenAI via litellm proxy

2026-06-30 11:36:57 +10:00 · 2026-03-31 19:21:59 -04:00 · 2026-03-31 19:21:59 -04:00 · b48c66b480
commit b48c66b480
parent 1fff29dfdb
1 changed files with 26 additions and 1 deletions
--- a/README.md
+++ b/README.md
@ -49,7 +49,32 @@ export ANTHROPIC_API_KEY=your-key
 node cli.js
 ```

-> **Note:** Claude Code uses the Anthropic Messages API format. It does not support OpenAI-compatible endpoints (Ollama, LM Studio, etc.) directly — you would need an API format translator proxy like [litellm](https://github.com/BerriAI/litellm) in front.
+### With Local Models (Ollama, LM Studio)
+
+Claude Code uses the Anthropic Messages API format. To use local models, run [litellm](https://github.com/BerriAI/litellm) as a translation proxy:
+
+```bash
+# Terminal 1: Start litellm proxy
+pip install litellm
+litellm --model ollama/llama3.1:8b --port 8080
+
+# Terminal 2: Point Claude Code at the proxy
+export ANTHROPIC_BASE_URL=http://localhost:8080
+export ANTHROPIC_API_KEY=not-needed
+node cli.js
+```
+
+Works with any model Ollama supports — llama3.1, codellama, deepseek-coder, mistral, etc.
+
+### With OpenAI / GPT Models
+
+```bash
+# Via litellm proxy
+litellm --model openai/gpt-4o --port 8080
+
+# Or any OpenAI-compatible endpoint (Codex, GPT-5.4, etc.)
+litellm --model openai/o3 --port 8080
+```

 ---