The Cerebras Code MCP server is currently in research preview and is open source here. We welcome contributions!
Set up your API key
You need a valid Cerebras API key. Please visit this link and sign up, then click on API Keys in the left navigation.
Optionally, create an OpenRouter key here to use as fallback if you hit Cerebras rate limits.
Available Models
The Cerebras Code MCP Server supports all Cerebras models:| Model | Parameters | Best For |
|---|---|---|
| llama-3.3-70b | 70B | Best for complex reasoning, long-form content, and tasks requiring deep understanding |
| qwen-3-32b | 32B | Balanced performance for general-purpose applications |
| llama3.1-8b | 8B | Fastest option for simple tasks and high-throughput scenarios |
| gpt-oss-120b | 120B | Largest model for the most demanding tasks |
| zai-glm-4.7 | 357B | Advanced 357B parameter model with strong reasoning capabilities |

