Skip to main content

Pricing & Wallet

Insighto.ai charges in two layers:

  1. A monthly plan with a generous allowance of every metered resource — queries, voice seconds, bots, and stored words.
  2. A pay-as-you-go Wallet that kicks in once any of those allowances is exhausted, billed per actual unit of usage.

You can also Bring Your Own Keys (BYOK) for LLMs and TTS — when you do, those line items drop out of the wallet equation because Insighto is calling your provider account directly.

This page covers the per-minute and per-query rates. For how to manage your wallet, see Billing & Wallet.

Monthly plan allowances

Each plan ships with a fixed bundle that resets on your billing-cycle anniversary:

AllowanceWhat it counts
QueriesOne text query (one user turn → assistant turn).
Voice secondsOne second of voice call duration.
BotsMaximum number of assistants you can have configured.
WordsTotal indexed words across all data sources.
Live agent seatsLive-agent inbox seats (add-on).
Concurrent callsMax simultaneous voice calls.

Plus binary feature flags: BYOK, Workflows, Calendar Pools, Custom voice, Forms, Custom tools, SIP, Human agents, Campaigns, Agency, White-label, Premium support.

Live status of your usage is on the Usage page.

Wallet (pay-as-you-go)

Once any allowance is exhausted, the Wallet takes over at the rates below. The Wallet is prepaid — top it up in any amount from $5 upward at Settings → Billing & Wallet. Quick tiles: $10, $25, $50, $100.

Voice — component breakdown

Every voice minute is the sum of four services. Base rates:

ComponentBase rate
Transcription (speech → text)1¢ / min
LLM (text → response)1¢ / min
Voice / TTS (response → audio)2¢ / min
Platform overhead2¢ / min
Total — baseline6¢ / min

The baseline assumes a regular text LLM (gpt-4o-mini class, 1× multiplier) and Azure Voice (1× TTS multiplier). Switching the LLM or TTS provider multiplies the relevant line item.

TTS multipliers

TTS providerVoice multiplier
Azure Speech
Cartesia1.75×
ElevenLabs2.5×
PlayHT2.5×
Custom-cloned voice (ElevenLabs)

LLM multipliers (voice)

LLM familyLLM multiplier
gpt-4o-mini, gpt-5.4-mini, gpt-3.5-*, deepseek-*, Llama 3.1 70B
o3-mini
gpt-4o (not -mini)10×
Anything else (older gpt-4-*, Claude, etc.)20×

Voice — pricing matrix

ConfigurationMultipliers (STT · LLM · TTS · Platform)Cost breakdown (¢)Total (¢ / min)
Regular LLM + Azure Voice1 · 1 · 1 · 11 + 1 + 2 + 26
Regular LLM + Cartesia1 · 1 · 1.75 · 11 + 1 + 3.5 + 27.5
Regular LLM + ElevenLabs1 · 1 · 2.5 · 11 + 1 + 5 + 29
Regular LLM + PlayHT1 · 1 · 2.5 · 11 + 1 + 5 + 29
Regular LLM + custom-cloned voice1 · 1 · 4 · 11 + 1 + 8 + 212
Regular LLM + your TTS key (BYOK)1 · 1 · 0 · 11 + 1 + 0 + 24
o3-mini + Azure Voice1 · 2 · 1 · 11 + 2 + 2 + 27
o3-mini + ElevenLabs1 · 2 · 2.5 · 11 + 2 + 5 + 210
OpenAI Realtime (gpt-4o-mini-realtime-preview)0 · 10 · 0 · 10 + 10 + 0 + 212
OpenAI Realtime (gpt-4o-realtime-preview)0 · 46 · 0 · 10 + 46 + 0 + 248

Notes:

  • BYOK voice is a hard zero. When your assistant's TTS voice comes from your own provider connection, the voice line item is multiplied by 0. The LLM line is not auto-zeroed by BYOK — that depends on whether the platform-managed OpenAI key or your own OpenAI key ran the turn.
  • Realtime models bundle STT + TTS into the LLM cost. When the assistant type is Realtime OpenAI, the platform doesn't run separate transcription or TTS — the LLM line absorbs all three.
  • Custom-cloned voices cost 4× the base TTS rate.

Chat — per-query rates

Each text query (one user turn → assistant turn) is billed at 1.5¢ × llm_multiplier:

LLM modelMultiplierCost per query
gpt-4o-mini, gpt-5.4-mini, gpt-3.5-*, deepseek-*1.5¢
o3-mini3.0¢
gpt-4o (not -mini)10×15¢
Anything else (gpt-4, gpt-4-turbo, Claude, etc.)20×30¢

Pick the LLM in Build → Assistants → [your assistant] → Model. gpt-4o-mini is usually the sweet spot — strong reasoning at 1.5¢ per query.

Other usage

ResourceRate
Stored words (data sources)Included up to plan limit; overage charged per 100k words / month
Outbound SMS$0.015 / message (US baseline); varies by destination
Outbound voice (campaigns)Same per-minute rates as inbound + telephony toll
Telephony (Twilio / Plivo / Telnyx)Pass-through at provider rates

Example calculations

A 5-minute voice call with regular LLM + Azure Voice:

5 min × 6¢ = 30¢ deducted from wallet

A 10-minute voice call with o3-mini + ElevenLabs:

10 min × 10¢ = $1.00 deducted from wallet

100 chat queries on gpt-4o:

100 × 15¢ = $15.00 deducted from wallet

A 5-minute Realtime call (4o):

5 min × 48¢ = $2.40 deducted from wallet

Every per-call deduction is itemised on the conversation record — the billing details JSON stores the exact component costs and multipliers used. You can read these back via the conversation API.

BYOK — Bring Your Own Keys

If you've configured BYOK Credentials (Settings → BYOK Credentials), Insighto calls your provider account for the supported providers. The conversation is marked billed_to = byok and neither plan credits nor wallet are touched at all.

BYOK key slots:

  • OpenAI — used for LLM (chat completions, embeddings) and the Realtime API.
  • ElevenLabs — used for TTS when an ElevenLabs voice is selected.
  • Azure Speech — used for STT and TTS when an Azure voice / STT is selected.

BYOK does not cover:

  • Deepgram STT, Cartesia TTS, PlayHT TTS, Anthropic — no BYOK slot today.
  • Telephony (Twilio / Plivo / Telnyx) — always carrier-billed.
  • Platform overhead (2¢/min on voice) — kept by us for orchestration.
  • Add-on features (Workflows, Calendar Pools, etc.) — gated by plan tier, not wallet.

When BYOK is fully on (master toggle on, all three slots populated) and the assistant uses only providers covered by your keys, the runtime sets deduct_credits = false — the wallet is untouched, the plan credits are untouched, and the conversation is filed as billed_to = byok.

→ Set it up: BYOK Credentials

Auto-recharge

The wallet has auto-recharge fields (threshold, amount, enabled) wired into the schema, but no UI drives them for direct customers today. Top up manually for now.

Agency rebilling

If you're an agency, you set your own plan + wallet pricing for your subaccounts, with optional cost multipliers on top of Insighto's base rates. See Rebilling.

Where to next