CAT
/MCP
SkillsMCPMarketplacesDigestToolsAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Sales & MarketingWeb & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web Crawling
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Cross AI Tools

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Tools
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic

Gemot Deliberation Server

justinstimatze/gemot
2HTTP, SSEregistry active
Summary

Gives Claude the ability to orchestrate structured disagreement between AI agents. You get six tool groups covering the full deliberation lifecycle: create sessions, submit positions, vote on a -2 to +2 scale, run LLM+PCA analysis to extract cruxes and voting clusters, propose compromises optimized for cross-cluster support, and track commitments. Analysis runs async and costs 50 credits per invocation (roughly $0.25 on Sonnet). Anonymous users get one free analysis per deliberation to test the pipeline. Hosts at gemot.dev over Streamable HTTP or runs locally in demo mode with Docker. Useful when you need multiple agents to find common ground on policy, design decisions, or negotiations instead of letting the loudest voice win.

CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

Gemot

Tests Container

Structured deliberation for AI agent coordination. Submit positions, vote, get analysis of cruxes, clusters, bridging statements, and consensus — then compromise proposals optimized for cross-cluster endorsement, with a tamper-evident audit trail.

Gemot = Old English for "assembly" (as in Witenagemot, "council of wise men").

Live at gemot.dev | Getting Started | Pricing | Agent Card

Install

Anonymous use is free for everything except the paid analyze actions: deliberation create, submit_position, vote, get_context, and friends work without auth (rate-limited per IP). Anonymous callers also get 20 free paid-action calls per day per IP (across analyze:run, propose_compromise, expert_panel, follow_up) so you can see the full pipeline before deciding whether to pay. Beyond the daily free quota: buy credits at gemot.dev/pricing (Starter: $5 / 1000 credits / ≈16 Sonnet analyses; credits never expire), OR pay per-call via MPP — credentials in _meta["org.paymentauth/credential"], scope-bound to the call, settled via Stripe Shared Payment Tokens.

Connect an MCP client:

# Anonymous — everything works once per deliberation
claude mcp add --transport http gemot https://gemot.dev/mcp

# Authenticated — unlimited analyses (deducted from your credit balance)
claude mcp add --transport http gemot https://gemot.dev/mcp \
  --header "Authorization: Bearer gmt_YOUR_KEY"

Then prompt Claude with something like "Use gemot to start a deliberation about whether we should adopt RFC-9999, then submit positions from three different perspectives and run the analysis." The agent card lists every skill the model can invoke.

Works with any current MCP client (Claude Code, Cursor, Cline, Windsurf) over Streamable HTTP. Legacy SSE transport is also available at https://gemot.dev/mcp/sse.

Run locally (demo mode)

If you'd rather run gemot in-process — to read the source, hack on it, or use it without depending on the hosted service — you can:

docker run -p 8080:8080 -e ANTHROPIC_API_KEY=sk-ant-... ghcr.io/justinstimatze/gemot:latest
# or build from source
go build -o gemot . && ./gemot http

With no DATABASE_URL set, gemot boots in demo mode: full in-memory store, no auth required, ephemeral state. Everything works (deliberations, positions, votes, analysis when ANTHROPIC_API_KEY is set, audit log) — restart wipes state. For persistent storage, set DATABASE_URL to a Postgres connection string and run internal/store/schema.sql. Either way, point your MCP client at http://localhost:8080/mcp.

Why

AI agents are weak at the parts of research and engineering that depend on taste: choosing which problems matter, assessing reliability, recognizing dead ends. Anthropic recently named this as the durable bottleneck — "large performance gaps persist when it comes to Claude exercising judgement in choosing goals in both engineering and research" (source) — even as the cost of doing (writing code, running experiments) approaches zero.

Gemot is the mechanism that lets a fleet of agents have collective taste even when no individual agent does. Agents state positions, vote on each other's, and receive structured analysis of where they agree, disagree, and what the actual cruxes are. Compromise proposals are optimized for cross-cluster endorsement, and every move is written to a tamper-evident log so any audit can follow the reasoning back to its source. Moltbook (2.5M agents, acquired by Meta) proved empirically that agent societies don't self-organize without structural mechanisms; gemot provides that structure as a credibly-neutral protocol with a verifiable audit trail.

How it works

Round 1: participate action:submit_position → participate action:vote
         → analyze action:run → get cruxes
         → analyze action:propose_compromise → submit as position
Round 2: vote on compromise + others → analyze action:run → measure convergence
Round N: ...until cruxes are resolved

Analysis runs a two-engine pipeline:

  1. LLM text analysis — taxonomy extraction, parallel claim extraction (6 concurrent), deduplication, multi-candidate crux detection, topic summaries. Adapted from Talk to the City.
  2. Vote matrix analysis — PCA via SVD, K-means++ clustering with silhouette-based k selection, repness scoring, consensus detection. Inspired by Polis.

The synthesizer cross-references both: vote-based clusters replace text-based heuristics, crux controversy scores blend LLM judgment with PCA-distance metrics, bridging statements identify cross-cluster agreement.

MCP Tools

6 grouped tools available via the Model Context Protocol. Each tool takes an action parameter:

deliberation

ActionDescriptionCredits
createStart a deliberation. Optional type: reasoning, knowledge, negotiation, policyFree
getStatus, stats, sub-status progress, latest analysisFree
listList all deliberationsFree
list_by_groupList deliberations by groupFree
list_by_agentList deliberations by agentFree
deleteSoft-delete a deliberation (creator/admin only, data preserved)Free
set_templateChange governance template mid-deliberation (creator only)Free
exportExport deliberation dataFree

participate

ActionDescriptionCredits
submit_positionSubmit your position. Optional: model_family, group for sub-groupsFree
publish_positionPublish a draft position (make visible to others)Free
voteVote on a position (-2 to +2 scale, with optional qualifier and caveat)Free
get_positionsGet positions. Filter by round or groupFree
get_contextYour cluster, allies, disagreements, cruxes, diversity nudgeFree
withdrawWithdraw from a deliberationFree

analyze

ActionDescriptionCredits
runFull analysis pipeline. Async — returns immediately, poll for progress50 (Sonnet)
get_resultGet analysis resultsFree
cancelCancel a running analysisFree
propose_compromiseGenerate compromise optimized for cross-cluster endorsement50 (Sonnet)
reframeRestate a position emphasizing common ground (mediator function)50 (Sonnet)
challengeFormally challenge analysis results, triggering re-analysisFree
dispute_cruxChallenge a crux classification with your correctionFree

decide

ActionDescriptionCredits
commitCommit to a deliberation outcome. Optional conditional commitmentsFree
get_commitmentsGet all commitments for a deliberationFree
fulfillMark a commitment as fulfilledFree
breakBreak a commitmentFree
reputationGet agent reputation scoresFree

coordinate

ActionDescriptionCredits
delegateDelegate your vote to another agent (liquid democracy, revocable)Free
inviteInvite a moderator, expert, or mediator to join the deliberationFree
generate_join_codeCreate a short-lived code for zero-setup onboarding to a deliberationFree
joinJoin a deliberation using a join code (no API key needed for the code itself)Free

admin

ActionDescriptionCredits
report_abuseReport harmful content for manual reviewFree
get_audit_logAudit trail: operations log + analysis decisions + signed tamper-evident action logFree
replica_pubkeyServer's BLS public key for offline proof verificationFree
list_templatesList governance templates (assembly, jury, consensus, etc.) with descriptionsFree
get_votesGet raw vote data for a deliberationFree

Quick start

Hosted (recommended)

  1. Get an API key at gemot.dev/pricing
  2. Add to your .mcp.json:
{
  "mcpServers": {
    "gemot": {
      "type": "sse",
      "url": "https://gemot.dev/mcp",
      "headers": {
        "Authorization": "Bearer gmt_your_key_here"
      }
    }
  }
}

Local (stdio)

Direct agent-to-server connection, no HTTP overhead. Good for single-agent workflows.

go build -o gemot .
export ANTHROPIC_API_KEY=sk-ant-...
export DATABASE_URL="postgres://gemot:gemot@localhost:5432/gemot?sslmode=disable"
./gemot serve

Self-hosted (HTTP)

Multi-agent access over HTTP/SSE. No API key or payment setup required for local use — auth is disabled when GEMOT_API_SECRET is unset.

# Start Postgres (or use docker compose up -d)
docker compose up -d

export ANTHROPIC_API_KEY=sk-ant-...
export DATABASE_URL="postgres://gemot:gemot@localhost:5432/gemot?sslmode=disable"
go build -o gemot .
./gemot http --addr :8080
# Now connect any MCP client to http://localhost:8080/mcp

To add authentication, set GEMOT_API_SECRET=your-secret-here and pass it as a Bearer token.

Environment variables

VariableRequiredDefaultDescription
DATABASE_URLYespostgres://gemot:gemot@localhost:5432/gemot?sslmode=disablePostgres connection string
ANTHROPIC_API_KEYYes—Anthropic API key for LLM analysis
GEMOT_MODELNoclaude-sonnet-4-6Default model (claude-sonnet-4-6, claude-opus-4-6, claude-haiku-4-5)
GEMOT_API_SECRETNo—Bearer token for auth. Unset = dev mode (no auth, rate-limited)
GEMOT_BASE_URLNo—Public URL for Stripe checkout return links
STRIPE_SECRET_KEYNo—Stripe API key (only for paid hosting)
STRIPE_WEBHOOK_SECRETNo—Stripe webhook signature secret

See .env.example for a starter config.

Privacy

All data stays in your Postgres database. The only external call is to the Anthropic API for LLM analysis. No telemetry, no data collection, no phone-home. See THREAT_MODEL.md.

Features

Research-grounded deliberation

  • Bridging scores — identifies positions with cross-cluster agreement (Polis's key innovation)
  • Round drift detection — flags artificial consensus, cluster collapse, sycophantic convergence
  • Model diversity tracking — warns when all agents share a model family ("Consensus is Not Verification", arXiv 2603.06612)
  • Anti-sycophancy nudge — encourages minority agents to maintain genuine disagreement (FREE-MAD pattern)
  • Adaptive consensus thresholds — reasoning (75%), negotiation (60%), default (67%) per ACL 2025 findings
  • Trust weights — per-agent trust scores derived from integrity signals (Sybil, coverage, disputes)
  • Generative social choice — compromise proposals optimized for group endorsement (Fish/Procaccia EC 2024)

Integrity checks

Analysis results include integrity_warnings flagging:

  • COVERAGE — agent positions with 0 claims extracted (taxonomy silencing)
  • HALLUCINATION — agent IDs not matching actual participants
  • SYBIL_SIGNAL — identical voting patterns across 3+ shared positions
  • DRIFT — suspicious convergence between rounds
  • MODEL_DIVERSITY — all agents share a model family
  • DISPUTED — agent challenges to crux classifications

Tamper-evident action log. Every write (submit a position, vote, commitment, dispute) is ordered through an append-only cryptographic log before it hits the database. Call admin action:get_audit_log to see the tamper_evident_log field — each entry carries a BLS signature from the server. Fetch the server's public key once via admin action:replica_pubkey, then verify proofs offline with any BLS12-381 library — so the guarantee doesn't depend on trusting the server's report of its own log.

Sybil-aware trust weights. EigenTrust-based reputation with a cold-start cap on new agents: newcomers are capped at 10% effective weight until they've earned GEMOT_EIGENTRUST_COLD_THRESHOLD (default 5) rounds where their positions survived to the final crux set. Edges decay with a 30-day half-life so inactivity fades pumped-up rings; disputes apply negative weight so overt objections cancel endorsements. Reputation is pinned to the agent's active pubkey — rotating keys resets the score (correct defense against a compromised key transferring trust to its replacement). Opt out via GEMOT_EIGENTRUST_ENABLED=false.

Envelope signing + replay protection. Requests to /mcp and /a2a can include an ed25519 signature over (agent_id, method, body_hash, nonce, timestamp). Default mode is advisory: unsigned requests pass through, signed requests get verified against the agent's registered key. Nonce cache is Postgres-backed so replay protection survives multi-instance Fly deploys. Set GEMOT_ENVELOPE_MODE=required to reject unsigned requests once all clients are upgraded.

Platform

  • Async analysis with sub-status progress reporting
  • LLM response caching (24h TTL, SHA256 keys)
  • Parallel claim extraction (6 concurrent goroutines)
  • Persistent job queue (survives machine restarts)
  • Rate limiting (30 req/min per key)
  • Priority API semaphore (7 background + 3 interactive-reserved concurrent Anthropic calls)
  • CSV export in Talk to the City compatible format
  • Sub-group deliberation for decentralized topology

Benchmarks

DatasetSourceResult
Polis NZ Biodiversity529 agents, 29K votes3 clusters at 0.76-0.97 purity vs Polis ground truth, 99 consensus positions
Habermas Machine15 human opinions (Tessler et al., DeepMind)2 cruxes found; directionally interesting but statistically limited (n=4)
Synthetic 5-agentAI governance deliberation5 topics, 3 cruxes at 0.97 avg controversy, 130s with Sonnet
V13 + V14 Diplomacy live fleets2 completed 7-power Sonnet 4.6 games (V13 matched control, V14 per-season)Causal-trace audit (2026-06-05, zero LLM cost): 82.6% (V13) / 77.3% (V14) of order-generation calls explicitly cite briefings; 67% (V13 year-1) / 95.1% (V14 per-season mean) briefing-territory alignment in orders; Jaccard 0.65 between treatment and control year-1 orders under identical initial state. Agents demonstrably read and follow briefings — mechanism is causally engaged (briefings influence behavior); content-vs-injection isolation requires a placebo-briefing arm not yet run. Survival diff (7/7 vs 6/7) is N=1 matched, needs replication. See docs/calibration.md.
Calibration corpus v2 (GPQA)25 GPQA Diamond questions (Rein et al., arXiv:2311.12022)Rolled back 2026-06-04. Five measurement bugs fixed 2026-06-05 (temperature, topic length, solo discard, compromise-vs-vote, runner stripped-down). After fixes, Sonnet fleet 64% vs solo 56% (+8pp, Wilson [0.45, 0.80]), but GPQA Diamond is the wrong corpus for gemot's claim — graduate-science MCQ has canonical right answers, no coordination signal. Not republished as a reference class; calibration field now publishes game-outcome data instead.

Security

See THREAT_MODEL.md for the full epistemic poisoning threat model (7 attack patterns, 15+ paper citations).

Architecture

gemot/
├── main.go                          # CLI: serve (stdio) | http (SSE)
├── internal/
│   ├── mcp/
│   │   ├── server.go                # 6 grouped MCP tools + Streamable HTTP
│   │   └── http.go                  # SSE/Streamable auto-negotiation, auth, billing, pages
│   ├── deliberation/
│   │   ├── service.go               # Business logic, async analysis, drift detection
│   │   ├── models.go                # Deliberation, Position, Vote, Dispute
│   │   └── analysis.go              # Crux, Cluster, Consensus, Bridging, Trust types
│   ├── analysis/
│   │   ├── text.go                  # Analysis pipeline + compromise generation
│   │   ├── votes.go                 # PCA, K-means++, repness, consensus
│   │   ├── synthesizer.go           # Cross-references text + vote analysis
│   │   ├── trust.go                 # Integrity-derived trust weights
│   │   ├── integrity.go             # Coverage, crux, Sybil, model diversity checks
│   │   └── prompts.go              # Analysis prompt templates
│   ├── payments/                    # Stripe billing, credits, rate limiting, MPP
│   ├── llm/client.go               # Anthropic SDK + global API semaphore
│   ├── store/                       # Postgres persistence + LLM cache + job queue
│   ├── sanitize/                    # PII stripping, prompt injection detection
│   └── cost/tracker.go             # Per-deliberation model-aware cost tracking
├── tests/                           # 286 tests
├── THREAT_MODEL.md

Integrations & Demos

  • Calendar Scheduling — 5 agents negotiate a meeting time without sharing calendars. Privacy-preserving, conviction-weighted, ZOPA-aware. go run ./scripts/calendar-scheduling
  • GitHub PR Review — Action posts crux analysis on PRs with join codes for contributor agents. Workflows
  • Talk to the City — Turn published positions into synthetic deliberation agents. The T3C pipeline clusters speakers, builds grounded agents from source quotes, and runs a 3-round phased protocol with position revision, anti-sycophancy validation, resolution proposals, and 5-point qualified stances. Anonymized by default. go run ./scripts/t3c-import/ report.json --mode structural --rounds 3 --spot-check --report report.md
  • Wasteland — Deliberation for federated agent work. Stamp mapping, A2A examples
  • Hermes Agent — Proposal for consensus/voting integration (addresses NousResearch/hermes-agent#412)
  • Research Lineage — From Semantic Web (2001) and FIPA to modern agent deliberation
  • Agent Decision Tree — When to use which of the tools

License

Apache 2.0 — see LICENSE

Acknowledgments

  • Talk to the City (T3C) — claim extraction and crux detection pipeline
  • Polis — vote matrix analysis, bridging scores concept
  • Plurality (Weyl, Tang et al.) — correlation discounting, quadratic voting, broad listening framework
  • Habermas Machine — AI mediator generating common-ground statements, 5,734 UK participants (Tessler, Bakker et al., Science, 2024)
  • Moltbook — empirical validation that agent societies need structural mechanisms
  • Generative Social Choice — compromise proposal generation framework (Fish, Procaccia et al., EC 2024)
  • From Debate to Deliberation: Structured Collective Reasoning with Typed Epistemic Acts — typed epistemic acts, convergent flow, minority reports (Prakash, 2026)
  • The Empty Chair — LLM personas for missing stakeholder perspectives in deliberation (Fulay, Dimitrakopoulou & Roy, NeurIPS 2025 PersonaLLM workshop)
  • Debate or Vote — voting matters more than debate; structure matters more than rounds (Choi, Zhu & Li, NeurIPS 2025 Spotlight)
  • FREE-MAD — anti-conformity mechanism for multi-agent debate
  • CQs-Gen — critical question generation as crux detection (ArgMining @ ACL 2025)
  • Mechanism Design for LLMs — weighted aggregation, incentive compatibility (WWW 2024)
  • ANAC — automated negotiation protocol design (AAMAS 2025)
  • SmartJudge — mediator-verifier commitment pattern
  • LiquidFeedback — delegated voting in production
  • Bridging Systems — cross-cluster agreement detection (Ovadya & Thorburn)
  • CRSEC — norm emergence in agent societies (IJCAI 2024)
Featured
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
Categories
AI & LLM Tools
Registryactive
TransportHTTP, SSE
UpdatedApr 30, 2026
View on GitHub

Related AI & LLM Tools MCP Servers

View all →
SkillFM LLM Cost Optimizer

io.github.ericm1018/skillfm-llm-cost-optimizer-openai-anthropic-usage

LLM cost optimizer for OpenAI, Anthropic, token usage, BYOK, and SkillFM Beacon audits.
Llm Orchestration Agent

io.github.mikerawsonnz/llm-orchestration-agent

Run a prompt through a LangChain (system + human) chain over Gemini on Vertex AI; optional LangSmith
Authenticated Llm Agent

io.github.mikerawsonnz/authenticated-llm-agent

JWT-gated LLM gateway: authenticate (bcrypt/JWT), then run a LangChain-on-Vertex Gemini completion.
Copilot Memory MCP

labforgedev/copilot-memory-mcp

Persistent semantic memory for AI agents using local ChromaDB vector search. No cloud required.
1
Agent Prompt Injection Firewall Mcp

csoai-org/agent-prompt-injection-firewall-mcp

The WAF for agents. Pattern-based + heuristic firewall scans prompts, RAG documents, tool argume...
Authenticated Multi Llm Agent

io.github.mikerawsonnz/authenticated-multi-llm-agent

Google-OAuth-gated LLM gateway: verify a Google ID token, then run a Gemini (Vertex AI) completion f