Recommended endpoint
Minimal request
cURL example
Python example
Node.js example
Best practices
- Enable thinking only for genuinely hard reasoning tasks
- Start with a smaller
budget_tokensvalue - If tools are involved later, verify that your stream parser supports thinking events too
