Rate Limits
If you require higher limits, please contact us at support@caesar.xyz.
Scope: These limits apply to research job creation endpoints:
POST /research(native API)POST /compat/completions(OpenAI-compatible)POST /compat/chat/completions(OpenAI-compatible)
The Caesar API has the following limits:
- Concurrent research jobs: 5 per account (in-progress jobs, including
queued). - Monthly reasoning budget: 200 reasoning loops, counted only for successful research jobs.
Billing is based on actual usage. You’re charged for reasoning_loops_consumed, not the reasoning_loops you request. If you set reasoning_loops: 5 but the job completes in 2 loops (via early exit or sufficient information), you’re only billed for 2 loops.
Exceeding limits
When a limit is exceeded the API returns 429 Too Many Requests:
Recommended client behaviour
- Retry with exponential back-off.
- Serialise new jobs when you’re at the concurrency cap.
- Use
reasoning_loopsthoughtfully to stay within your monthly budget.
What is a reasoning loop?
A reasoning loop represents one iteration of Caesar’s research process: gathering information, analyzing findings, and refining the response. Your monthly usage is the sum of reasoning_loops_consumed by jobs that reach the completed status, not the reasoning_loops requested.
What counts as ‘concurrent’?
Any job that is not yet completed or failed (e.g. queued, searching, summarising, analysing) counts towards your concurrency limit.
When do monthly limits reset?
Limits reset on a monthly cadence; your dashboard shows your remaining balance and the next reset window.
Can I request higher limits?
Yes. Email support@caesar.xyz with your expected volumes and use case.
Where can I see my remaining quota?
Visit your dashboard after signing in.