Skip to content

Limits

The Limits tab lets you control usage and costs for your chatbot.

Navigate to: Inqyra > Configuration > Limits tab

The Limits tab showing token, cost, and rate limit settings The Limits tab for controlling usage and costs

Available Limits

Setting Range Default Description
Max Tokens per Response 100–32,000 4,096 Maximum number of tokens the AI can use in a single response. Higher values allow longer answers but cost more
Daily Token Limit 0 = unlimited 0 Maximum total tokens (input + output) the chatbot can use per day across all conversations
Daily Cost Limit (USD) 0 = unlimited 0 Maximum amount in USD the chatbot can spend per day. When reached, the widget stops responding
Hourly Request Limit 0 = unlimited 0 Maximum number of chat messages per hour across all visitors
Archive After (days) 30 Number of days after which conversations are automatically archived

Understanding Tokens

Tokens are the units AI providers use to measure text. Roughly:

  • 1 token = about 4 characters or ¾ of a word in English
  • A typical user question = 20–50 tokens
  • A typical chatbot response = 100–500 tokens
  • Each message uses both input tokens (question + context) and output tokens (response)

Cost Control Strategy

For sites with moderate traffic, recommended settings are:

Setting Low Budget Medium Budget High Budget
Daily Token Limit 50,000 200,000 Unlimited
Daily Cost Limit $1.00 $5.00 Unlimited
Max Tokens per Response 1,024 4,096 8,192

What happens when a limit is reached

When the daily token limit or cost limit is reached, the chat widget will stop responding to visitors for the remainder of the day. The widget remains visible but displays an error message. Limits reset at midnight (server time).

Tip

Monitor your usage on the Dashboard to find the right balance between availability and cost. Start with conservative limits and increase them as you understand your traffic patterns.