This server plugs Cerebras's fast inference into your IDE through MCP, specifically targeting a hybrid workflow where you plan with Claude, Cline, or Cursor and then execute code changes with Cerebras's Qwen 3 Coder model. It exposes a single write tool that takes natural language prompts and context files, then generates code with visual Git-style diffs. The setup wizard handles configuration for Claude Code, Cline, Cursor, and VS Code, and you can optionally add OpenRouter as a fallback if you hit Cerebras rate limits. Reach for this when you want to offload repetitive code generation to a faster model while keeping your primary AI for architecture decisions.
This MCP server is designed for planning with Claude Code, Cline, or Cursor and making changes with Cerebras to maximize speed and intelligence while avoiding API limits. Use your preferred AI for planning and strategy, then leverage Cerebras for high-quality code generation.
It will use the Qwen 3 Coder model, and can be embedded in IDEs like Claude Code and Cline, with beta support for Cursor.
write usage count among modelsnpm install -g cerebras-code-mcp
Visit cloud.cerebras.ai and create an API key
[OPTIONAL] Add OpenRouter as a backup in case you hit your Cerebras rate limits Visit OpenRouter and get a key to use as a fallback provider.
You can set this key in your MCP settings under OPENROUTER_API_KEY, and it will trigger automatically if anything goes wrong with calling Cerebras.
cerebras-mcp --config
Use the setup wizard to configure the tool on your machine.
If you're using Cursor, it will ask you to copy and paste a prompt into your Cursor User Rules.
cerebras-mcp --remove
Use the removal wizard to clean up configurations for any IDE or perform a complete cleanup.
The MCP tool will appear as write in your tool list. It supports:
Example usage:
Create a REST API with Express.js that handles user authentication
CEREBRAS_API_KEY*secretYour Cerebras API key from cloud.cerebras.ai
OPENROUTER_API_KEYsecretOptional OpenRouter API key for fallback when Cerebras rate limits are hit
makafeli/n8n-workflow-builder
danishashko/make-mcp
lukisch/n8n-manager-mcp
io.github.us-all/airflow
io.github.infoinlet-marketplace/mcp-workflow