Only pay as per your usage.
Insighto.ai lets you maintain your wallet with us, which can be topped up with credits as you need.
Each wallet comes preloaded with $10 worth of credits for FREE!
Your wallet is charged as:
- Regular Voice (Azure): ¢6 ($0.06) per minute
- Regular Chatbot: ¢1.5 ($0.015) per customer's query
The above consumption is based on the base LLM (gpt-3.5-turbo) model and regular voices (Azure).
Voice Components:
Voice service consists of 4 components, and the standard rates for these are:
Component | Cost per minute (cents) |
---|---|
Transcription | 1 |
LLM | 1 |
Voice | 2 |
Platform (Insighto.ai) | 2 |
Total | 6 |
If your Assistant uses a component with a multiplier of more than 1, it will affect the above rate for that particular component only.
Example:
Case | Multipliers (Transcription, LLM, Voice, Platform) | Cost per component (cents per minute) | Total Cost (cents per minute) |
---|---|---|---|
Regular LLM with Azure Voice | 1, 1, 1, 1 | 1+1+2+2 | 6 |
Regular LLM with ElevenLabs Voice | 1, 1, 2.5, 1 | 1+1+5+2 | 9 |
Regular LLM with Cartesia Voice | 1, 1, 1.75, 1 | 1+1+3.5+2 | 7.5 |
o3-mini with ElevenLabs Voice | 1, 2, 2.5, 1 | 1+2+5+2 | 10 |
o3-mini LLM with Azure Voice | 1, 2, 1, 1 | 1+2+2+2 | 7 |
OpenAI Realtime (4o-mini) | 0, 10, 0, 1 | 0+10+0+2 | 12 |
OpenAI Realtime (4o) | 0, 46, 0, 1 | 0+46+0+2 | 48 |
Voice consumption is measured at the second level and prorated accordingly.
Chatbot Components:
Chatbots are powered by the LLM model only. The LLM multiplier is used to find the cost per message.
Example:
LLM | Multiplier | Cost per customer's query (in cents) |
---|---|---|
gpt-3.5-turbo | 1 | 1.5 |
o3-mini | 2 | 3.0 |
gpt-4 | 20 | 30 |
gpt-4o | 10 | 15 |
Was this article helpful?
That’s Great!
Thank you for your feedback
Sorry! We couldn't be helpful
Thank you for your feedback
Feedback sent
We appreciate your effort and will try to fix the article