CAT
/MCP
SkillsMCPMarketplacesDigestToolsAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Sales & MarketingWeb & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web Crawling
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Cross AI Tools

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Tools
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic

Ask Ollama

lykhoyda/ask-llm
STDIOregistry active
Summary

Runs local Ollama models through MCP's ask_ollama tool, giving Claude a private, offline second opinion without API costs or data leaving your machine. Returns structured responses with provider metadata and supports multi-turn conversations via sessionId for maintaining context across prompts. Part of a broader ask-llm suite that includes Gemini and Codex bridges, but this package strips out the cloud dependencies. Reach for it when you want code reviews, architecture feedback, or brainstorming from models like qwen2.5-coder running on your own hardware. Includes get_usage_stats for tracking token counts and a diagnose tool that verifies your Ollama installation and model availability.

CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

Ask LLM

CI Release GitHub Release License: MIT

PackageTypeVersionDownloads
ask-gemini-mcpMCP Servernpmdownloads
ask-codex-mcpMCP Servernpmdownloads
ask-ollama-mcpMCP Servernpmdownloads
ask-antigravity-mcpMCP Servernpmdownloads
ask-llm-mcpMCP Servernpmdownloads
@ask-llm/pluginClaude Code PluginGitHub/plugin install

MCP servers + Claude Code plugin for AI-to-AI collaboration

Get a second opinion before you ship. Ask LLM lets your AI assistant — Claude Code, Cursor, Claude Desktop, or any of 40+ MCP clients — consult a second model to review your code, debate a plan, or catch a bug it might have missed. Pick the reviewer that fits: OpenAI Codex (GPT-5.5), Google Antigravity (agy), a local Ollama model, or Gemini (1M+ token context). Standard MCP, no prompt hacks.

⚠️ Gemini CLI goes enterprise-only on 2026-06-18: From that date Google restricts Gemini CLI to Gemini Code Assist Standard/Enterprise seats, and free, Google AI Pro, and Ultra accounts lose access. ask-gemini-mcp still installs, but a non-enterprise account then surfaces actionable guidance instead of output. Free/Pro users: switch to ask-antigravity (the Google-sanctioned successor, subscription-backed via Google AI Pro/Ultra), ask-codex, or ask-ollama. Announcement

Why a second opinion?

Your primary AI is confident — but confidence isn't correctness. A second model, with no stake in the first one's answer, catches what it missed.

  • Second opinion on code — before you commit to an approach, have another model review it independently.
  • Debate a plan — send an architecture proposal for critique, alternatives, and trade-off analysis.
  • Review a diff — have a different model analyze your changes to surface issues your primary AI glossed over.
  • Read more than fits — Gemini and Antigravity's large context windows ingest whole codebases at once.
  • Keep it local — run reviews through Ollama when nothing can leave your machine.

In action

You:    ask codex to review src/auth.ts for security issues
Codex:  ⚠ verifyToken() compares tokens with === — not timing-safe (line 42)
        ⚠ the session cookie is missing a SameSite attribute
Claude: Good catches — applying both fixes to src/auth.ts.

One prompt. A second model reviews independently; your assistant applies the fix — no copy-paste between tools.

Quick Start

Claude Code

# All-in-one — auto-detects installed providers
claude mcp add --scope user ask-llm -- npx -y ask-llm-mcp
Or install providers individually
claude mcp add --scope user gemini -- npx -y ask-gemini-mcp
claude mcp add --scope user codex -- npx -y ask-codex-mcp
claude mcp add --scope user ollama -- npx -y ask-ollama-mcp
claude mcp add --scope user antigravity -- npx -y ask-antigravity-mcp

Claude Desktop

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "ask-llm": {
      "command": "npx",
      "args": ["-y", "ask-llm-mcp"]
    }
  }
}
Or install providers individually
{
  "mcpServers": {
    "gemini": {
      "command": "npx",
      "args": ["-y", "ask-gemini-mcp"]
    },
    "codex": {
      "command": "npx",
      "args": ["-y", "ask-codex-mcp"]
    },
    "ollama": {
      "command": "npx",
      "args": ["-y", "ask-ollama-mcp"]
    }
  }
}
Cursor, Codex CLI, OpenCode, and other clients

Cursor (.cursor/mcp.json):

{
  "mcpServers": {
    "ask-llm": { "command": "npx", "args": ["-y", "ask-llm-mcp"] }
  }
}

Codex CLI (~/.codex/config.toml):

[mcp_servers.ask-llm]
command = "npx"
args = ["-y", "ask-llm-mcp"]

Any MCP Client (STDIO transport):

{ "command": "npx", "args": ["-y", "ask-llm-mcp"] }

Replace ask-llm-mcp with ask-codex-mcp, ask-antigravity-mcp, ask-ollama-mcp, or ask-gemini-mcp for a single provider.

Choose your reviewer

ProviderBest forModel (default → fallback)Notes
CodexCode reasoning, targeted reviews, architecture critiquegpt-5.5 → gpt-5.4-miniRequires an OpenAI/Codex account
AntigravityA subscription-backed second opinion; larger-context readsGemini 3.1 Pro (High) → Gemini 3.5 Flash (High)Google AI Pro/Ultra plan; one-shot, experimental
OllamaPrivate/local review, zero cost, offlineqwen3.6:27b (no auto-fallback)Runs entirely on your machine
GeminiWhole-codebase reads (1M+ tokens)gemini-3.1-pro-preview → gemini-3.5-flash⚠️ Enterprise-gated from 2026-06-18
Unified (ask-llm)One install for all of the above; fan out in parallelroutes per callRecommended

Claude Code Plugin

The Ask LLM plugin adds multi-provider code review, brainstorming, and automated hooks directly into Claude Code:

/plugin marketplace add Lykhoyda/ask-llm
/plugin install ask-llm@ask-llm-plugins

What You Get

FeatureDescription
/multi-reviewParallel Antigravity + Codex review with 4-phase validation pipeline and consensus highlighting (gemini via /gemini-review)
/gemini-reviewGemini-only review with confidence filtering
/codex-reviewCodex-only review with confidence filtering
/ollama-reviewLocal review — no data leaves your machine
/antigravity-reviewSubscription-backed review via Google Antigravity (agy) — experimental
/brainstormMulti-LLM brainstorm: Claude Opus researches the topic against real files in parallel with external providers (Gemini/Codex/Ollama), then synthesizes all findings with verified findings weighted higher
/compareSide-by-side raw responses from multiple providers, no synthesis — for when you want to see how each provider phrases the same answer
codex-pair hookOpt-in continuous review — runs Codex against every Edit/Write/MultiEdit when a .codex-pair/context.md marker is present in the project

The review agents use a 4-phase pipeline inspired by Anthropic's code-review plugin: context gathering, prompt construction with explicit false-positive exclusions, synthesis, and source-level validation of each finding.

See the plugin docs for details.

Prerequisites

  • Node.js v20.0.0 or higher (LTS)
  • At least one provider:
    • Codex CLI — installed and authenticated
    • Antigravity CLI (agy) — installed and logged in once (Google AI Pro/Ultra)
    • Ollama — running locally with a model pulled (ollama pull qwen3.6:27b)
    • Gemini CLI — npm install -g @google/gemini-cli && gemini login (enterprise-gated from 2026-06-18)

MCP Tools

ToolPackagePurpose
ask-geminiask-gemini-mcpSend prompts to Gemini CLI with @ file syntax. 1M+ token context. Live progressive output via stream-json
ask-gemini-editask-gemini-mcpGet structured OLD/NEW code edit blocks from Gemini
fetch-chunkask-gemini-mcpRetrieve chunks from cached large responses
ask-codexask-codex-mcpSend prompts to Codex CLI. GPT-5.5 with mini fallback. Native session resume via sessionId
ask-ollamaask-ollama-mcpSend prompts to local Ollama. Fully private, zero cost. Server-side conversation replay via sessionId
ask-antigravityask-antigravity-mcpSend a prompt to Google Antigravity (agy) for a subscription-backed second opinion. Experimental; one-shot
ask-llmask-llm-mcpUnified orchestrator — pick provider per call. Fan out to all installed providers
multi-llmask-llm-mcpDispatch the same prompt to multiple providers in parallel; returns per-provider responses + usage in one call
get-usage-statsallPer-session token totals, fallback counts, breakdowns by provider/model — all in-memory, no persistence
diagnoseask-llm-mcpSelf-diagnosis: Node version, PATH resolution, provider CLI presence + versions. Read-only
pingallConnection test — verify MCP setup

All ask-* tools accept an optional sessionId parameter for multi-turn conversations and now return a structured AskResponse (provider, response, model, sessionId, usage) via MCP outputSchema alongside the human-readable text. The orchestrator (ask-llm-mcp) also exposes usage://current-session as an MCP Resource for live JSON snapshots.

Usage Examples

ask codex to review the changes in src/auth.ts for security issues
ask antigravity to debate this architecture plan in docs/design.md
ask ollama to explain src/config.ts (runs locally, no data sent anywhere)
ask gemini to summarize @. the current directory (1M+ context, @ is Gemini-only)
use multi-llm to compare what codex and gemini think about this approach

CLI Subcommands

The orchestrator binary (ask-llm-mcp) supports two CLI modes alongside the default MCP server:

# Interactive multi-provider REPL — switch providers, persist sessions, see usage live
npx ask-llm-mcp repl

# Diagnose your setup — Node version, PATH, provider CLI versions, env vars
npx ask-llm-mcp doctor          # human-readable
npx ask-llm-mcp doctor --json   # machine-readable, exit 1 on error

The REPL ships sessions per provider (/provider gemini, /provider codex, /new, /sessions, /usage) and inherits all the executor behavior (quota fallback, stream-json output for Gemini, native session resume).

Models

ProviderDefaultFallback
Geminigemini-3.1-pro-previewgemini-3.5-flash (on quota)
Codexgpt-5.5gpt-5.4-mini (on quota)
Ollamaqwen3.6:27b— (local; errors if the model isn't pulled)

Gemini and Codex automatically fall back to a lighter model on quota errors. Ollama runs locally and never substitutes a model — if the requested model isn't pulled, it returns a clear ollama pull error.

Documentation

  • Docs site: lykhoyda.github.io/ask-llm
  • AI-readable: llms.txt | llms-full.txt

Contributing

Contributions are welcome! See open issues for things to work on.

License

MIT License. See LICENSE for details.

Disclaimer: This is an unofficial, third-party tool and is not affiliated with, endorsed, or sponsored by Google or OpenAI.

Featured
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

Configuration

OLLAMA_HOSTdefault: http://localhost:11434

Ollama server address (default: http://localhost:11434)

GMCPT_TIMEOUT_MSdefault: 300000

Timeout for Ollama execution in milliseconds (default: 300000 = 5 minutes)

GMCPT_LOG_LEVELdefault: warn

Log verbosity: debug, info, warn, error (default: warn)

Categories
AI & LLM Tools
Registryactive
Packageask-ollama-mcp
TransportSTDIO
UpdatedJun 10, 2026
View on GitHub

Related AI & LLM Tools MCP Servers

View all →
SkillFM LLM Cost Optimizer

io.github.ericm1018/skillfm-llm-cost-optimizer-openai-anthropic-usage

LLM cost optimizer for OpenAI, Anthropic, token usage, BYOK, and SkillFM Beacon audits.
Llm Orchestration Agent

io.github.mikerawsonnz/llm-orchestration-agent

Run a prompt through a LangChain (system + human) chain over Gemini on Vertex AI; optional LangSmith
Authenticated Llm Agent

io.github.mikerawsonnz/authenticated-llm-agent

JWT-gated LLM gateway: authenticate (bcrypt/JWT), then run a LangChain-on-Vertex Gemini completion.
Copilot Memory MCP

labforgedev/copilot-memory-mcp

Persistent semantic memory for AI agents using local ChromaDB vector search. No cloud required.
1
Agent Prompt Injection Firewall Mcp

csoai-org/agent-prompt-injection-firewall-mcp

The WAF for agents. Pattern-based + heuristic firewall scans prompts, RAG documents, tool argume...
Authenticated Multi Llm Agent

io.github.mikerawsonnz/authenticated-multi-llm-agent

Google-OAuth-gated LLM gateway: verify a Google ID token, then run a Gemini (Vertex AI) completion f