Limits¶
The Limits tab lets you control usage and costs for your chatbot.
Navigate to: Inqyra > Configuration > Limits tab
The Limits tab for controlling usage and costs
Available Limits¶
| Setting | Range | Default | Description |
|---|---|---|---|
| Max Tokens per Response | 100–32,000 | 4,096 | Maximum number of tokens the AI can use in a single response. Higher values allow longer answers but cost more |
| Daily Token Limit | 0 = unlimited | 0 | Maximum total tokens (input + output) the chatbot can use per day across all conversations |
| Daily Cost Limit (USD) | 0 = unlimited | 0 | Maximum amount in USD the chatbot can spend per day. When reached, the widget stops responding |
| Hourly Request Limit | 0 = unlimited | 0 | Maximum number of chat messages per hour across all visitors |
| Archive After (days) | — | 30 | Number of days after which conversations are automatically archived |
Understanding Tokens¶
Tokens are the units AI providers use to measure text. Roughly:
- 1 token = about 4 characters or ¾ of a word in English
- A typical user question = 20–50 tokens
- A typical chatbot response = 100–500 tokens
- Each message uses both input tokens (question + context) and output tokens (response)
Cost Control Strategy¶
For sites with moderate traffic, recommended settings are:
| Setting | Low Budget | Medium Budget | High Budget |
|---|---|---|---|
| Daily Token Limit | 50,000 | 200,000 | Unlimited |
| Daily Cost Limit | $1.00 | $5.00 | Unlimited |
| Max Tokens per Response | 1,024 | 4,096 | 8,192 |
What happens when a limit is reached
When the daily token limit or cost limit is reached, the chat widget will stop responding to visitors for the remainder of the day. The widget remains visible but displays an error message. Limits reset at midnight (server time).
Tip
Monitor your usage on the Dashboard to find the right balance between availability and cost. Start with conservative limits and increase them as you understand your traffic patterns.