Skip to content

AI Models

Configure language models, speech models, embeddings, and batch processing for your AI agents.


Overview

The AI Models section allows you to configure all AI models used across your workspace. Add custom API keys, adjust model parameters, and manage access to different model providers.


Model Categories

  • Language Models


    Configure GPT-4, Claude, Gemini, and other LLMs for conversational AI.

    Configure LLMs

  • :material-batch-processing:{ .lg .middle } Batch Models


    Set up batch processing for high-volume async AI tasks.

    Batch Processing

  • ASR Models


    Configure Automatic Speech Recognition (speech-to-text) models.

    ASR Setup

  • TTS Models


    Set up Text-to-Speech voices for AI agent responses.

    TTS Voices

  • :material-phone-voice:{ .lg .middle } Realtime Voice Models


    Configure real-time voice conversation models for phone/SIP.

    Realtime Voice

  • Embeddings


    Manage embedding models for RAG and semantic search.

    Embeddings


Supported Providers

Language Models

  • OpenAI (GPT-4, GPT-4 Turbo, GPT-3.5)
  • Anthropic (Claude 3 Opus, Sonnet, Haiku)
  • Google (Gemini Pro, Gemini Ultra)
  • Meta (Llama 3, Llama 2)
  • Custom OpenAI-compatible endpoints

Speech Models

  • OpenAI Whisper (ASR)
  • Google Speech-to-Text
  • Azure Speech Services
  • ElevenLabs (TTS)
  • OpenAI TTS
  • Azure Neural Voices

Embeddings

  • OpenAI Embeddings (text-embedding-3-small/large)
  • Cohere Embeddings
  • Custom embedding endpoints