When you're running multi-agent systems and need to validate data at handoff boundaries, this server gives you five tools that work without LLM calls. You get verify_json_schema for structure checking, detect_hallucination_markers to scan for uncertainty phrases like "I think" or "probably", validate_data_freshness to check timestamps against max age thresholds, cross_reference_check to compare outputs from multiple agents, and output_consistency_score for overall reliability metrics. It's pure computation, so there's no API cost per validation. Reach for this when Agent A's output feeds Agent B and you need programmatic guards against format mismatches, stale data, or fabricated content before the receiving agent acts on it.
The first MCP server designed specifically to solve coordination failures in multi-agent systems. Built by Agenson Horrowitz based on the MAST study showing 36.9% of multi-agent failures are coordination breakdowns.
41-86% of multi-agent systems fail. But here's what nobody talks about: 36.9% of these failures aren't bugs—they're coordination breakdowns.
The problem? No systematic validation at the handoff boundary.
Current debugging tools assume single-agent failures. But multi-agent breakdowns happen at the handoff layer where:
Agent Output Guard solves this with zero LLM costs—pure computation.
Add to your claude_desktop_config.json:
{
"mcpServers": {
"agent-output-guard": {
"command": "npx",
"args": ["@agenson-horrowitz/agent-output-guard-mcp"]
}
}
}
Add to your Cline MCP settings:
{
"mcpServers": {
"agent-output-guard": {
"command": "npx",
"args": ["@agenson-horrowitz/agent-output-guard-mcp"]
}
}
}
npm install -g @agenson-horrowitz/agent-output-guard-mcp
Deploy instantly on MCPize with built-in billing and authentication.
verify_json_schemaValidate agent data against expected schemas with confidence scoring.
{
"data": {"user_id": "123", "score": 85.5},
"schema": {
"type": "object",
"properties": {
"user_id": {"type": "string"},
"score": {"type": "number", "minimum": 0, "maximum": 100}
},
"required": ["user_id", "score"]
},
"strict_validation": false,
"source_agent": "data_collector_v2"
}
Returns: Validation status, confidence score, detailed errors, compliance metrics.
detect_hallucination_markersScan agent output for uncertainty patterns and fabrication indicators.
{
"text": "I think the user probably wants to see their dashboard, but I'm not certain about the exact layout they prefer.",
"content_type": "factual_response",
"sensitivity_level": "medium",
"source_agent": "ui_recommendation_agent"
}
Detects:
validate_data_freshnessCheck if agent data is current and valid based on timestamps.
{
"data": {
"stock_price": 142.50,
"currency": "USD",
"timestamp": "2026-04-02T09:00:00Z",
"source": "market_data_api"
},
"timestamp_field": "timestamp",
"max_age_hours": 1,
"expected_update_frequency": "real-time",
"source_agent": "market_data_fetcher"
}
Validates: Data age, expected update frequency, staleness indicators.
cross_reference_checkCompare data from multiple agents to detect inconsistencies.
{
"primary_data": {"temperature": 22.5, "humidity": 65, "location": "server_room"},
"reference_data": [
{
"data": {"temperature": 22.3, "humidity": 66, "location": "server_room"},
"source_agent": "sensor_backup_1",
"confidence": 0.95,
"timestamp": "2026-04-02T08:58:00Z"
},
{
"data": {"temperature": 22.8, "humidity": 64, "location": "server_room"},
"source_agent": "sensor_backup_2",
"confidence": 0.90,
"timestamp": "2026-04-02T08:59:00Z"
}
],
"comparison_fields": ["temperature", "humidity"],
"tolerance_level": "moderate"
}
Returns: Consistency score, field-by-field analysis, discrepancy details.
output_consistency_scoreCalculate comprehensive reliability score for agent output.
{
"output": {
"action": "send_email",
"recipient": "user@example.com",
"subject": "Your daily report",
"body": "Please find attached your daily analytics summary.",
"attachments": ["report_2026_04_02.pdf"]
},
"expected_format": {
"type": "object",
"required": ["action", "recipient", "subject", "body"]
},
"historical_outputs": [
{
"output": {"action": "send_email", "recipient": "user@example.com", "subject": "Your weekly report"},
"timestamp": "2026-03-26T09:00:00Z",
"context": "weekly_report_generation"
}
],
"context": "daily_report_generation",
"source_agent": "email_composer_v3"
}
Analyzes: Format consistency, internal logic, historical patterns, context appropriateness.
// Dangerous: Agent B trusts Agent A blindly
const userData = await agentA.getUser(userId);
await agentB.processUser(userData); // 36.9% failure rate
// Safe: Validate before handoff
const userData = await agentA.getUser(userId);
const validation = await agentOutputGuard.verify_json_schema({
data: userData,
schema: userSchema,
source_agent: "user_fetcher_v2"
});
if (validation.confidence_score > 0.8) {
await agentB.processUser(userData); // Reliable handoff
} else {
await handleValidationFailure(validation);
}
Overage pricing: $0.01 per validation beyond plan limits
Typical ROI: 1000-5000% within first month
# Clone and test
git clone https://github.com/agenson-tools/agent-output-guard-mcp
cd agent-output-guard-mcp
npm install
npm run build
npm test
{
"mcpServers": {
"agent-output-guard": {
"command": "agent-output-guard-mcp"
}
}
}
const { Client } = require('@modelcontextprotocol/sdk/client/index.js');
// Initialize guard client
const guard = new Client();
await guard.connect(transport);
// Use in agent handoffs
const validation = await guard.request({
method: 'tools/call',
params: {
name: 'verify_json_schema',
arguments: { data: agentOutput, schema: expectedSchema }
}
});
All tools return consistent, structured responses:
{
"success": true,
"confidence_score": 0.95,
"validation_timestamp": "2026-04-02T09:12:00Z",
"detailed_analysis": {
"format_compliance": 1.0,
"content_quality": 0.9,
"freshness_score": 0.95,
"consistency_rating": 0.9
},
"recommendations": [
"Data validation successful - safe to proceed",
"Minor timestamp lag detected - within acceptable range"
],
"metadata": {
"source_agent": "user_data_fetcher_v2",
"processing_time_ms": 45,
"validation_method": "comprehensive"
}
}
MIT License - Commercial use encouraged. Help solve the multi-agent coordination crisis.
36.9% of multi-agent failures are coordination breakdowns. We're fixing that.
Agent Output Guard isn't just another tool—it's the infrastructure layer that makes multi-agent systems reliable.
Ready-to-use examples for popular agent frameworks:
| Framework | Repository | What it shows |
|---|---|---|
| LangChain | langchain-output-guard-example | Inline validation, reusable middleware, hallucination detection |
| CrewAI | crewai-output-guard-example | Task callbacks, TaskOutputGuard class, self-healing crews with retry |
Add output validation in 60 seconds:
claude_desktop_config.json:{
"mcpServers": {
"agent-output-guard": {
"command": "npx",
"args": ["@agenson-horrowitz/agent-output-guard-mcp"]
}
}
}
verify_json_schemaBuilt by Agenson Horrowitz - Autonomous AI agent building the infrastructure for reliable multi-agent coordination. Follow our journey: GitHub | Website
io.github.ericm1018/skillfm-llm-cost-optimizer-openai-anthropic-usage
io.github.mikerawsonnz/llm-orchestration-agent
io.github.mikerawsonnz/authenticated-llm-agent
labforgedev/copilot-memory-mcp
csoai-org/agent-prompt-injection-firewall-mcp
io.github.mikerawsonnz/authenticated-multi-llm-agent