LLM Configuration
Select and configure the language model for the phone assistant
Available Models
Claude Opus 4
Most capable model. Best for complex reasoning, nuanced conversations, and high-stakes interactions.
Claude Sonnet 4
RecommendedBalanced performance and speed. Ideal for real-time phone conversations with good quality.
Claude Haiku 4
Fastest model. Best for high-volume, low-latency phone calls where speed matters most.
Qwen 2.5 72B
Free open-source model via OpenRouter. Good multilingual support including German.
Gemini 2.0 Flash
Google's fast model via OpenRouter free tier. Good for straightforward conversations.
Note: Model selection and system prompt changes are currently display-only. To change the active model, update the ASSISTANT_MODEL environment variable in the assistant service configuration and restart.
A future update will enable live model switching and prompt persistence from this page.