For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Logo
DocumentationAPI ReferenceChangelog
DocumentationAPI ReferenceChangelog
  • Get Started
    • Introduction
    • Authentication
    • Rate Limits
    • Pricing
  • Integration
    • MCP Server
    • SDKs
    • OpenAI Compatibility
    • MCP Docs Server
    • llms.txt
    • x402
  • Guides
    • Research Parameters
    • Research Chat
    • Brainstorming
    • Model Selection
    • Prompt Guidance
    • Versioning
  • Reference
    • Errors
    • Pagination
On this page
  • Supported models
  • When to use each model
  • Example
  • Quick picker
  • Learn more
Guides

Model Selection

Pick the best model for research synthesis
Built with

The model parameter on POST /research lets you choose which LLM synthesizes the final answer. If you omit it, Caesar automatically selects a model based on your query.

Start by omitting model to let Caesar auto-select. Set model when you need consistent output, a specific latency profile, or a fixed provider.

Supported models

ModelProviderSummary
gpt-5.2OpenAIHighest quality synthesis for complex, multi-step research tasks
gemini-3.1-proGoogleLatest Gemini Pro-tier reasoning model for deep, long-context synthesis
gemini-3-proGoogleAdvanced reasoning with strong long context performance
gemini-3-flashGoogleBest performance for high-volume, low-latency research
claude-opus-4.6AnthropicStrongest Claude-tier synthesis quality for long-form, nuanced analysis

gemini-3-pro remains supported for backward compatibility. Prefer gemini-3.1-pro for new integrations.

The model parameter controls research synthesis. Retrieval and source gathering remain the same.

When to use each model

gpt-5.2

Best for

  • Complex, multi-step reasoning
  • Cross-domain technical synthesis
  • High-stakes decisions that need the strongest accuracy

Trade-offs

  • Typically higher latency than Flash-tier models
gemini-3.1-pro

Best for

  • Deep analysis in code, math, or STEM topics
  • Long-context synthesis over large documents or datasets
  • Highest Gemini-tier quality for technical and research-heavy prompts

Trade-offs

  • Slower than Flash for simple or high-throughput workloads
gemini-3-flash

Best for

  • Large scale processing and batch research
  • Low latency, high volume workloads
  • Agentic or iterative tasks that need fast turns

Trade-offs

  • Less depth than Pro or Opus on very complex analysis
claude-opus-4.6

Best for

  • Maximum Claude-tier synthesis quality
  • Highly nuanced argumentation and editorial polish
  • Long-form outputs where tone and structure matter

Trade-offs

  • Higher latency than Flash-tier and many non-Opus models

Example

1{
2 "query": "Compare major approaches to carbon capture and their performance",
3 "model": "gemini-3-flash",
4 "reasoning_loops": 2
5}

Quick picker

GoalRecommended model
Fast, high volume researchgemini-3-flash
Strong reasoning with long contextgemini-3.1-pro
Maximum accuracy on complex researchgpt-5.2
Maximum Claude-tier writing qualityclaude-opus-4.6

Learn more

  • https://openai.com/index/introducing-gpt-5-2/
  • https://ai.google.dev/gemini-api/docs/models
  • https://www.anthropic.com/news/claude-3-family