X-Cerebras-Version-Patch header and becomes the default on July 21, 2026.
Review what changed, then set
X-Cerebras-Version-Patch: 2 to test your integration before the July 21, 2026 cutover.What’s Guaranteed
Within a given API version, we guarantee:- Existing request parameters continue to work as documented
- Existing response fields remain present with the same types
- New optional request parameters may be added
- New fields may be added to responses
- Error message text may change (but not error types within a version)
Setting the API Version
Override the default API version per-request by passing theX-Cerebras-Version-Patch header:
Version 2 Rollout Timeline
| Date | What Happens |
|---|---|
| January 21, 2026 | Version 2 available for testing via header |
| July 21, 2026 | Version 2 becomes the default; older versions end-of-life |
What Changed in Version 2
Version 2 introduces stricter validation for structured outputs and tool calling, refines reasoning model behavior, and fixes edge cases. If your application uses JSON schemas, tool calls, or reasoning models, review these changes carefully.Structured Outputs: Stricter Schema Validation
When usingstrict: true, Version 2 requires explicit, strictly-typed schemas. additionalProperties: false is required at every level of nested objects:
Tool Calling: Stricter Message Validation
Version 2 validates the structure of multi-turn conversations involving tool calls:| Validation | What it checks |
|---|---|
| Tool response completeness | Every tool_call_id in an assistant message must have a corresponding tool message immediately following |
| Orphan tool messages | Tool messages cannot reference a tool_call_id that doesn’t exist in a prior assistant message |
| Tool choice consistency | tool_choice cannot be set unless tools is also provided |
| Unique tool call IDs | Duplicate tool_call.id values in the same request are rejected |
Reasoning Logprobs Added
For reasoning models, version 2 adds a separatereasoning_logprobs field that contains logprobs for the reasoning tokens. Previously, reasoning logprobs were included in the main logprobs field alongside content logprobs.
See Reasoning for details on reasoning format options.
Unicode Handling Fix
Logprobs now reflect partial Unicode tokens as they appear in the model’s vocabulary. Previously, the Unicode replacement character (\uFFFD) was not handled correctly in logprobs output. This fix is backward-compatible and requires no code changes.
Migration Checklist
Use this checklist to prepare for version 2:Test with the version header
Add
X-Cerebras-Version-Patch: 2 to your requests and run your test suite.Update JSON schemas
- Add all properties to the
requiredarray - Add
additionalProperties: falseto all object definitions - Ensure enum values match their property types
Update reasoning parsing (if using reasoning models)
If you parse reasoning content, update your code to read from the
reasoning field in the response instead of extracting it from content.Future Versions
We strive to maintain backward compatibility whenever possible. In rare cases where breaking changes are necessary, a new API version will be released and made available for testing via theX-Cerebras-Version-Patch header for a minimum of 6 months before taking effect. Cerebras Cloud users will be notified via email before each version transition.
