Guide to choosing the right models for testing and evaluating your environments
<think>
sections when applied to messages; a collection with pre-modified templates is available here.extra_body={"thinking": { "type": "enabled", "budget_tokens": 2000 }}
in sampling_args
)reasoning_effort = "low" / "medium" / "high"
in sampling_args
reasoning_effort = "none"
in sampling_args