AI Models¶

Configure language models, speech models, embeddings, and batch processing for your AI agents.

Overview¶

The AI Models section allows you to configure all AI models used across your workspace. Add custom API keys, adjust model parameters, and manage access to different model providers.

Model Categories¶

Language Models

Configure GPT-4, Claude, Gemini, and other LLMs for conversational AI.

Configure LLMs
:material-batch-processing:{ .lg .middle } Batch Models

Set up batch processing for high-volume async AI tasks.

Batch Processing
ASR Models

Configure Automatic Speech Recognition (speech-to-text) models.

ASR Setup
TTS Models

Set up Text-to-Speech voices for AI agent responses.

TTS Voices
:material-phone-voice:{ .lg .middle } Realtime Voice Models

Configure real-time voice conversation models for phone/SIP.

Realtime Voice
Embeddings

Manage embedding models for RAG and semantic search.

Embeddings

Supported Providers¶

Language Models¶

OpenAI (GPT-4, GPT-4 Turbo, GPT-3.5)
Anthropic (Claude 3 Opus, Sonnet, Haiku)
Google (Gemini Pro, Gemini Ultra)
Meta (Llama 3, Llama 2)
Custom OpenAI-compatible endpoints

Speech Models¶

OpenAI Whisper (ASR)
Google Speech-to-Text
Azure Speech Services
ElevenLabs (TTS)
OpenAI TTS
Azure Neural Voices

Embeddings¶

OpenAI Embeddings (text-embedding-3-small/large)
Cohere Embeddings
Custom embedding endpoints

My Agents - Use configured models in agents
Multi-LLM Setup - Route between multiple models
Security > Credentials - Store API keys securely