Forge

5 toolsauthSTDIOregistry active

Summary

Connects Claude and other MCP clients to Forge, Voxell's hosted text-embedding API running the Qwen3-Embedding family. You get two tools: embed text into vectors (with input_type flags for query vs. document, Matryoshka dimension truncation, and three quality tiers) and list_models to see what's available. The ultra tier runs the 8B model that sits at #4 on MTEB English. Voxell doesn't store your text or vectors, just token counts for billing. Useful when you need semantic search, RAG retrieval, or similarity scoring without managing your own embedding infrastructure. Also speaks OpenAI's embeddings API if you want to swap it into existing code that already calls text-embedding-3-large.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Tools

Public tool metadata for what this MCP can expose to an agent.

5 tools

compare_ai_toolsCompare 2 or more AI coding tools side-by-side. Free tier returns a summary comparison table. Pro tier ($9) returns full detailed analysis with recommendations, pricing breakdowns, and growth trends. Available tools: claude-code, cursor, windsurf, devin, openhands, github-copi...3 params

Compare 2 or more AI coding tools side-by-side. Free tier returns a summary comparison table. Pro tier ($9) returns full detailed analysis with recommendations, pricing breakdowns, and growth trends. Available tools: claude-code, cursor, windsurf, devin, openhands, github-copi...

Parameters* required

toolsarray

List of tool IDs to compare (2-8). Options: claude-code, cursor, windsurf, devin, openhands, github-copilot, aider, cline

api_keystring

Pro API key for full comparison with recommendations

aspectsarray

Comparison aspects (default: all)

get_tool_profileGet detailed profile for a single AI coding tool. Includes features, pricing, strengths/weaknesses, and use cases. Available: claude-code, cursor, windsurf, devin, openhands, github-copilot, aider, cline.1 params

Get detailed profile for a single AI coding tool. Includes features, pricing, strengths/weaknesses, and use cases. Available: claude-code, cursor, windsurf, devin, openhands, github-copilot, aider, cline.

Parameters* required

tool_idstring

Tool IDone of claude-code · cursor · windsurf · devin · openhands · github-copilot

recommend_toolGet an AI-powered recommendation for which tool best fits your use case. Analyzes your requirements and returns ranked suggestions with reasoning. [PRO ONLY — $9 one-time key]5 params

Get an AI-powered recommendation for which tool best fits your use case. Analyzes your requirements and returns ranked suggestions with reasoning. [PRO ONLY — $9 one-time key]

Parameters* required

budgetstring

Monthly budget rangeone of free · under-20 · under-50 · unlimited

api_keystring

Pro API key (required)

use_casestring

Describe your use case (e.g., "full-stack web dev with React and Python")

experiencestring

Developer experience levelone of beginner · intermediate · expert

preferencesarray

Preferences (optional)

get_pricing_comparisonGet a complete pricing comparison table for all AI coding tools. Shows free tiers, pro pricing, team pricing, and enterprise options. Always free — no API key needed.1 params

Get a complete pricing comparison table for all AI coding tools. Shows free tiers, pro pricing, team pricing, and enterprise options. Always free — no API key needed.

Parameters* required

sort_bystring

Sort order (default: price_asc)one of price_asc · price_desc · popularity · name

purchase_pro_keyGet instructions to purchase a Pro API key ($9 one-time) for full comparisons and AI recommendations. Unlock detailed analysis, growth trends, and personalized tool recommendations.1 params

Get instructions to purchase a Pro API key ($9 one-time) for full comparisons and AI recommendations. Unlock detailed analysis, growth trends, and personalized tool recommendations.

Parameters* required

payment_methodstring

Payment methodone of paypal

@voxell/forge-mcp

An MCP server for Forge — Voxell's hosted text-embedding API. It exposes Forge to any MCP client (Claude, Cursor, Cline, Windsurf, VS Code, …) as two tools:

embed — turn text into vectors
list_models — list available models and their dimensions

You bring a Forge API key. The server is stateless, and Voxell does not store the text you send or the vectors it returns — only usage metadata (token counts) is recorded, for billing. It does embeddings only — no storage, no search, no RAG. Those are different products.

Quick install

One-click install in your editor (then replace your-key-here with a real key from dash.voxell.ai):

Claude Code — one command:

claude mcp add forge -e FORGE_API_KEY=your-key-here -- npx -y @voxell/forge-mcp

Any other client (Claude Desktop, Cline, Windsurf, Zed, …) uses the standard mcpServers block — see Use it below.

Why Forge

Quality you can dial. Forge runs the Qwen3-Embedding family; ultra is the 8B — ~75+ average task score on MTEB, currently #4 on MTEB (English), and the top usable model (the three ranked above it are research-only). turbo (0.6B) is the fast/cheap default. Pick your quality/cost point.
Matryoshka (MRL). Set dim to truncate (re-normalized) for ~4× smaller, cheaper vectors.
Low latency (Go + CUDA engine), zero-trust (per-key auth; mTLS available), and free to start (10M tokens, no card — dash.voxell.ai; more at voxell.ai/forge).

What you can do with it

Add semantic search — embed your documents with input_type: "document" and each query with input_type: "query", then rank by cosine similarity.
Build RAG — embed a knowledge base, store the vectors, and retrieve the closest chunks to ground an LLM.
Find similar or duplicate text — embed two texts and compare their vectors.
Cluster or classify — embed a batch, then cluster or train a classifier on the vectors.
Shrink vector storage — set dim to truncate (Matryoshka) and trade a little accuracy for smaller, cheaper vectors.
Straight from your editor — ask your AI agent (Cursor, Claude, …) to embed a snippet, a batch, or a file via the embed tool — no separate script.

Requirements

Node.js ≥ 18 (tested on 20)
A Forge API key — create one at https://dash.voxell.ai. New accounts start with 10M free tokens, no credit card.

Use it

Most MCP clients run it on demand with npx. Add this to your client's MCP config:

{
  "mcpServers": {
    "forge": {
      "command": "npx",
      "args": ["-y", "@voxell/forge-mcp"],
      "env": { "FORGE_API_KEY": "your-key-here" }
    }
  }
}

(Cursor, Claude Desktop, Cline, Windsurf, and VS Code all use this mcpServers shape.)

Tools

`embed`

arg	type	default	notes
`input`	string or string[]	—	text(s) to embed (required)
`model`	string	`turbo`	`turbo` (1024-d), `pro` (2560-d), `ultra` (4096-d)
`dim`	number	model default	truncate to N dimensions (Matryoshka) — works on every model
`input_type`	`"query"` \| `"document"`	`document`	use `query` for search queries

Returns the vectors plus the model, dimension, and token count.

Default is turbo — the one you probably want. pro/ultra trade size and speed for more dimensions.

`list_models`

Lists the available models and their dimensions.

Configuration

env	required	default
`FORGE_API_KEY`	yes	—
`FORGE_BASE_URL`	no	`https://api.voxell.ai`

Beyond MCP: OpenAI-compatible API

Forge speaks the OpenAI embeddings API. Point any OpenAI client at Forge — no code change, and your existing vector dimensions are preserved:

from openai import OpenAI

client = OpenAI(base_url="https://api.voxell.ai/v1", api_key="your-forge-key")
# the exact call you already make — now on a higher-ranked engine:
client.embeddings.create(model="text-embedding-3-large", input=["hello world"])  # -> 3072-d

Your OpenAI model names map to a matching-dimension Forge tier (text-embedding-3-small/ ada-002 → 1536-d, text-embedding-3-large → 3072-d), so existing vector stores slot in unchanged. Or address Forge tiers directly — turbo | pro | ultra. Also supports dimensions (Matryoshka, re-normalized) and encoding_format: "base64".

It's an upgrade on every path. Forge's smallest tier (turbo, Qwen3-Embedding-0.6B) outranks OpenAI's largest embedding model (text-embedding-3-large) on MTEB — so there's no drop-in that lands worse. ultra (Qwen3-Embedding-8B, ~75+ average task score, #4 on MTEB English) is a different league.

Why re-embedding onto Forge is worth it. Embedding is a one-way door: whatever an encoder discards at write time is gone — no reranker, longer prompt, or bigger LLM downstream reconstructs what the vectors never captured. The model you embed with sets the ceiling on everything above it. Re-embed once onto a higher-ranked engine and that ceiling rises — permanently.

License

MIT © Voxell, Inc.

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Configuration

FORGE_API_KEY*secret

Your Forge API key — create one at https://dash.voxell.ai (free tokens to start, no card).

@voxell/forge-mcp

An MCP server for Forge — Voxell's hosted text-embedding API. It exposes Forge to any MCP client (Claude, Cursor, Cline, Windsurf, VS Code, …) as two tools:

embed — turn text into vectors
list_models — list available models and their dimensions

Quick install

One-click install in your editor (then replace your-key-here with a real key from dash.voxell.ai):

Claude Code — one command:

claude mcp add forge -e FORGE_API_KEY=your-key-here -- npx -y @voxell/forge-mcp

Any other client (Claude Desktop, Cline, Windsurf, Zed, …) uses the standard mcpServers block — see Use it below.

Why Forge

Quality you can dial. Forge runs the Qwen3-Embedding family; ultra is the 8B — ~75+ average task score on MTEB, currently #4 on MTEB (English), and the top usable model (the three ranked above it are research-only). turbo (0.6B) is the fast/cheap default. Pick your quality/cost point.
Matryoshka (MRL). Set dim to truncate (re-normalized) for ~4× smaller, cheaper vectors.
Low latency (Go + CUDA engine), zero-trust (per-key auth; mTLS available), and free to start (10M tokens, no card — dash.voxell.ai; more at voxell.ai/forge).

What you can do with it

Add semantic search — embed your documents with input_type: "document" and each query with input_type: "query", then rank by cosine similarity.
Build RAG — embed a knowledge base, store the vectors, and retrieve the closest chunks to ground an LLM.
Find similar or duplicate text — embed two texts and compare their vectors.
Cluster or classify — embed a batch, then cluster or train a classifier on the vectors.
Shrink vector storage — set dim to truncate (Matryoshka) and trade a little accuracy for smaller, cheaper vectors.
Straight from your editor — ask your AI agent (Cursor, Claude, …) to embed a snippet, a batch, or a file via the embed tool — no separate script.

Requirements

Node.js ≥ 18 (tested on 20)
A Forge API key — create one at https://dash.voxell.ai. New accounts start with 10M free tokens, no credit card.

Use it

Most MCP clients run it on demand with npx. Add this to your client's MCP config:

{
  "mcpServers": {
    "forge": {
      "command": "npx",
      "args": ["-y", "@voxell/forge-mcp"],
      "env": { "FORGE_API_KEY": "your-key-here" }
    }
  }
}

(Cursor, Claude Desktop, Cline, Windsurf, and VS Code all use this mcpServers shape.)

Tools

`embed`

arg	type	default	notes
`input`	string or string[]	—	text(s) to embed (required)
`model`	string	`turbo`	`turbo` (1024-d), `pro` (2560-d), `ultra` (4096-d)
`dim`	number	model default	truncate to N dimensions (Matryoshka) — works on every model
`input_type`	`"query"` \| `"document"`	`document`	use `query` for search queries

Returns the vectors plus the model, dimension, and token count.

Default is turbo — the one you probably want. pro/ultra trade size and speed for more dimensions.

`list_models`

Lists the available models and their dimensions.

Configuration

env	required	default
`FORGE_API_KEY`	yes	—
`FORGE_BASE_URL`	no	`https://api.voxell.ai`

Beyond MCP: OpenAI-compatible API

Forge speaks the OpenAI embeddings API. Point any OpenAI client at Forge — no code change, and your existing vector dimensions are preserved:

from openai import OpenAI

client = OpenAI(base_url="https://api.voxell.ai/v1", api_key="your-forge-key")
# the exact call you already make — now on a higher-ranked engine:
client.embeddings.create(model="text-embedding-3-large", input=["hello world"])  # -> 3072-d

Forge

Tools

@voxell/forge-mcp

Quick install

Why Forge

What you can do with it

Requirements

Use it

Tools

`embed`

`list_models`

Configuration

Beyond MCP: OpenAI-compatible API

License

Configuration

Forge

Tools

@voxell/forge-mcp

Quick install

Why Forge

What you can do with it

Requirements

Use it

Tools

`embed`

`list_models`

Configuration

Beyond MCP: OpenAI-compatible API

License

Configuration

Related AI & LLM Tools MCP Servers

Related AI & LLM Tools MCP Servers