Create assistant
Assistants
Create assistant
Create a new AI assistant with specified configuration
POST
Create assistant
This endpoint allows you to create a new AI assistant with comprehensive configuration options.Documentation Index
Fetch the complete documentation index at: https://docs.aiployees.com/llms.txt
Use this file to discover all available pages before exploring further.
Engine Modes
The API supports three engine modes, each with different capabilities:| Mode | Description | Required Fields |
|---|---|---|
pipeline | Traditional STT → LLM → TTS pipeline | llm_model_id |
multimodal | Real-time multimodal AI | multimodal_model_id |
dualplex | Multimodal brain + custom TTS voice | multimodal_model_id |
Request Body
Core Required Fields
The name of the assistant (max 255 characters)
The voice ID to use for the assistant. Use the Get Voices endpoint with the
mode parameter to get compatible voices for your engine mode.The language ID for the assistant. Use the Get Languages endpoint to get available languages.
The assistant type. Options:
inbound, outboundThe engine mode. Options:
pipeline, multimodal, dualplexThe timezone for the assistant (e.g., “Europe/Bucharest”, “America/New_York”)
The initial message the assistant will speak when the call starts (max 200 characters)
The system prompt that defines the assistant’s behavior and personality
Mode-Specific Fields
The LLM model ID to use. Required for
pipeline mode.Use the Get Models endpoint to get available models.The multimodal model ID. Required for
multimodal and dualplex modes.Use the Get Models endpoint to get available multimodal models.Fallback LLM model ID for tool calls in multimodal/dualplex modes. Optional.
Turn detection sensitivity for multimodal/dualplex modes (0-1). Default: auto
Secondary Languages
Array of additional language IDs the assistant can speak. The assistant will auto-detect and switch languages.
Knowledgebase Settings
The knowledgebase ID to attach to this assistant
How to use the knowledgebase. Options:
function_call- AI calls a function to search (required for multimodal/dualplex)prompt- Knowledge is injected into prompt (pipeline only)
Phone Number
The ID of a phone number to assign to the assistant. Must belong to your account.
Custom Mid-Call Tools
Array of custom mid-call tool IDs to attach. Each tool must belong to your account.
Built-in Tools
Array of built-in tools to enable. Each tool has a
type and tool-specific fields.Voice and TTS Settings
Whether to enable emotional text-to-speech synthesis
Voice stability setting (0-1). Higher = more consistent voice
Voice similarity setting (0-1). Higher = closer to original voice
Speech speed multiplier (0.7-1.2)
LLM temperature setting (0-1). Lower = more deterministic
Custom TTS provider ID. Auto-selected based on language if not provided. Use the Get Synthesizer Providers endpoint to discover available providers.
Custom STT provider ID. Auto-selected based on language if not provided. Pipeline mode only. Use the Get Transcriber Providers endpoint to discover available providers.
Call Behavior Settings
Whether to allow caller interruptions.
Whether to use filler audio during processing (e.g., “um”, “let me check”).
Custom filler word profiles per category. If not provided, defaults are set based on the assistant’s language. Each category is an array of short phrases.
positive: Filler words for positive/affirmative responses (e.g., “Great!”, “Perfect!”)negative: Filler words for negative/neutral responses (e.g., “Hmm.”, “Mhm.”)question: Filler words when processing a question (e.g., “Hmm.”, “Let me think.”)neutral: Filler words for neutral acknowledgments (e.g., “Ok.”, “I understand.”)
Whether to record the call
Whether to enable noise cancellation
If true, the assistant waits for the customer to speak first
Timing Settings
Maximum call duration in seconds (20-1200)
Maximum silence duration before re-engagement in seconds (1-360)
Maximum silence at call start before ending (1-120 seconds). Optional.
Maximum ringing time before giving up (1-60 seconds)
Re-engagement Settings
Re-engagement interval in seconds (7-600)
Custom prompt for re-engagement messages (max 1000 characters)Example:
"Are you still there? Do you have any other questions?"Voicemail Settings
Whether to end call when voicemail is detected
Message to leave on voicemail before hanging up (max 1000 characters)
Endpoint Detection
Voice activity detection type. Options:
vad, aiEndpoint sensitivity level (0-5)
Interrupt sensitivity level (0-5)
Minimum words before interruption is allowed (0-10). Set to enable.
Ambient Sound
Background ambient sound. Options:
off, office, city, forest, crowded_room, cafe, natureAmbient sound volume level (0-1)
Webhook Configuration
Whether webhook notifications are enabled
The webhook URL for post-call notifications. Required if
is_webhook_active is true.Whether to send webhooks only on completed calls (not failed/no-answer)
Whether to include recording URL in webhook payload
Post-Call Evaluation
Whether to enable AI post-call evaluation
Schema definition for post-call data extraction
Variables
Key-value pairs of custom variables accessible in prompts via
{{variable_name}}Conversation Ended Settings
Minutes of chat inactivity before the conversation is considered ended (1-1440)
Whether to allow re-triggering the conversation after it ends due to inactivity
Webhook URL called when a chat conversation ends due to inactivity. Separate from the main call webhook.
Example Requests
Pipeline Mode Assistant
Multimodal Mode Assistant
Dualplex Mode Assistant
Response
Success message confirming assistant creation
Notes
- All required fields must be provided for successful assistant creation
- Use the Get Voices endpoint with
modeparameter to get compatible voices - For multimodal/dualplex modes,
knowledgebase_modemust befunction_call - For multimodal/dualplex modes,
allow_interruptionsis always enabled - Fillers are only available in pipeline mode
- The assistant is created with
inactivestatus by default

