Recommended endpoint
Minimal request
cURL example
Python example
Node.js example
Best practices
- Use Pro-class models for deeper reasoning quality
- Increase
thinkingBudgetgradually instead of maxing it out immediately - Keep a
thinkingBudget: 0fallback path for latency-sensitive requests
