CAT
/MCP
SkillsMCPMarketplacesDigestToolsAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Sales & MarketingWeb & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web Crawling
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Cross AI Tools

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Tools
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic

Superlocalmemory

varun369/superlocalmemoryv2
STDIOregistry active
Summary

A local-first memory layer that runs entirely on CPU without requiring Docker, graph databases, or API keys. It exposes 17+ MCP tools for storing and retrieving memories using hybrid search across SQLite, CozoDB, and LanceDB, plus a knowledge graph backend for relationship tracking. The math is notable: differential geometry and algebraic topology replace the LLM-based extraction most memory systems depend on, scoring 74.8% on LoCoMo benchmarks in zero-LLM mode versus Mem0's 64.2% baseline. Memories decay naturally based on usage, compress adaptively to save space, and consolidate into reusable patterns over time. Reach for this when you need Claude to remember context across sessions without sending data to the cloud, or when you're building EU AI Act compliant workflows that can't touch hosted services.

CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

SuperLocalMemory

SuperLocalMemory V3.6.18

Cache. Compress. Remember. Three surfaces — proxy, MCP tools, or skill. Every setup covered.
To the best of our knowledge, the only zero-cloud agent memory that beats Mem0's zero-LLM score on LoCoMo. Mode A: 74.8% vs Mem0 64.2% — no GPU, no API key, on CPU.

v3.6.18 — Plugin-native. Profile-aware. Distributed-ready.
Proxy: slm wrap claude  ·  MCP: add slm_compress to your config  ·  Skill: zero-config

3 published research papers (arXiv preprints + Zenodo-archived) · arXiv:2603.02240 · arXiv:2603.14588 · arXiv:2604.04514

arXiv Paper Three Surfaces: Proxy, MCP Tools, Skill PyPI npm AGPL v3 EU AI Act Design Compliant Website MCP Native CLI Agent-Native Multilingual 30+ Languages


Why SuperLocalMemory?

Every hosted AI memory platform — Mem0 Cloud, Zep Cloud, Letta Cloud, EverMemOS Cloud — sends your data to cloud LLMs by default. Self-hosted variants exist but require Docker, a separate graph DB, or Ollama config, and most default to OpenAI until you flip env vars. After August 2, 2026, any of those cloud paths becomes a compliance question under the EU AI Act.

SuperLocalMemory V3 uses mathematics instead of cloud compute — differential geometry, algebraic topology, and stochastic analysis replace the work other systems need LLMs to do. Local-first out of the box. No Docker. No graph DB. No API keys. CPU-only.

Benchmark results (evaluated on LoCoMo, the standard long-conversation memory benchmark, published April 2026):

SystemScoreConfigCloud LLM required?Open SourceSource
EverMemOS93.05%Cloud (proprietary)YesCore onlyevermind.ai (Feb 2026)
Hindsight (LoComo10)92.0%CloudYesNobenchmarks.hindsight.vectorize.io (Apr 2026)
Mem0 (token-efficient)91.6%Hybrid (Cohere/OpenAI)YesPartialmem0.ai blog (Apr 16 2026)
SLM V3 Mode C87.7%Local + optional LLMOptional (Ollama OK)Yes (AGPL-3.0)In-house, arXiv:2603.14588
Zep v3 Cloud85.2%CloudYesCommunity deprecatedgetzep.com
SLM V3 Mode A74.8%Local, CPU-only, zero-LLMNoYes (AGPL-3.0)In-house, arXiv:2603.14588
Mem0 (zero-retrieval-LLM)64.2%Local baselineNoPartialMem0 paper, zero-LLM row

How to read this table. Scores from different papers use different LoCoMo splits, judge models, and prompt variants. We do NOT claim these numbers are apples-to-apples across rows. Rows marked "In-house" were run by us; cited rows link to the vendor's public source and date. The only apples-to-apples comparison is Mode A 74.8% vs Mem0 zero-retrieval-LLM 64.2% (+10.6pp) — both are zero-LLM configurations. Mem0's 91.6% and EverMemOS's 93.05% use cloud LLMs; Mode C uses a local LLM (Ollama).

What Mode A is: CPU-only, SQLite-only, zero-LLM retrieval on published LoCoMo questions. To the best of our knowledge it is the only publicly-released local-first memory that clears Mem0's zero-LLM baseline on this benchmark. If another fully-local system hits similar numbers, please open an issue so we can update this table.

Mathematical layers contribute +12.7 percentage points average across 6 conversations (n=832 questions), with up to +19.9pp on the most challenging dialogues.


Quick Start

# npm (recommended)
npm install -g superlocalmemory
slm setup       # Choose mode (A/B/C)
slm doctor      # Verify everything is working
# pip
pip install superlocalmemory
slm setup
slm doctor
# First use
slm remember "Alice works at Google as a Staff Engineer"
slm recall "What does Alice do?"
slm status
# Wrap your agent — starts proxy + sets environment + launches agent
slm wrap claude
# Your first repeat prompt → CACHE HIT → $0.00
# See savings: slm optimize savings --since 1

Upgrading: pip install -U superlocalmemory && slm restart && slm doctor — migration is automatic, no data loss.


Three Pillars

Memory

Five-channel hybrid retrieval: Semantic (Fisher-Rao geodesic distance) + BM25 + Entity Graph + Temporal + Hopfield (associative/partial-query completion). RRF fusion, cross-encoder reranking, adaptive LightGBM ranking. All data stays local — SQLite + optional LanceDB/CozoDB.

Three mathematical contributions replace cloud LLM dependency:

  1. Fisher-Rao Retrieval Metric — similarity scoring from the Fisher information structure of diagonal Gaussian families. To the best of our knowledge, the first public application of information geometry to agent memory retrieval.
  2. Sheaf Cohomology for Consistency — algebraic topology detects contradictions via coboundary norms on the knowledge graph.
  3. Riemannian Langevin Lifecycle — memory positions evolve on the Poincare ball; neglected memories self-archive, no hardcoded thresholds.

Auto-capture hooks (slm hooks install) fire only on real signals — topic pivot, web call, file edit — never on a timer. Fail-open, <10ms p99 hot path.

Multi-scope memory (v3.6.15, opt-in): keep memories personal (default), shared with named profiles, or global across the machine. Off by default — recall only ever returns your own facts until you turn sharing on, per call or in config. See docs/shared-memory.md.

Multilingual: plug in any OpenAI-compatible embedding endpoint — Ollama, vLLM, LiteLLM, bge-m3, multilingual-e5, Qwen3-Embedding. The math layer is language-agnostic; 30+ languages work at full retrieval quality. No cloud dependency, no code changes.

Cache + Compress

One engine, three ways in — choose the surface that fits your setup:

SurfaceHow you use itRequires proxy?Window effectCache scope
A — Proxyslm wrap claude or ANTHROPIC_BASE_URL=http://127.0.0.1:8765YesShrinksFull-turn cache — every call
B — MCP toolsAdd 5 tools to MCP config; call slm_compress, slm_cache_set/getNoPreserved (1M)Results you explicitly route through SLM
C — SkillCopy skills/slm-optimize/SKILL.md → ~/.claude/skills/NoPreserved (1M)Auto-applied by the agent per skill rules

The hard constraint: The primary Claude conversation turn cannot be cached without a proxy. The MCP/skill path caches results you explicitly route through SLM (tool outputs, file reads, sub-model calls) — without a proxy the main conversation turn is not intercepted.

How to choose:

  • Metered API (pay-per-token), want every call cached → Proxy (A)
  • Pro/Max/Team subscription or any plan where you won't run a proxy → MCP tools (B) or Skill (C)
  • Zero configuration → Skill (C): install once, auto-compresses CLAUDE.md and large outputs
  • Agent-controlled caching of repeated file reads → MCP tools (B)

Cache: exact-match SQLite lookup (SHA-256, zero false hits) + vCache-gated semantic (opt-in). 100% cost saved on a hit (input + output tokens).

Compress: safe mode = lossless normalization (JSON/code/tool outputs, 60-95% fewer tokens); aggressive mode = LLMLingua-2 prose only (opt-in). CCR stores originals for byte-exact reversal. Anthropic 90% / OpenAI 50% prefix-cache discount alignment included. [CITATION-NEEDED-ONLINE: live provider prefix-cache discount rates]

Savings dashboard: slm optimize savings --since 7 — live USD/INR/tokens saved. Hot-reload config, fail-open.

Mesh

Run SLM on multiple machines and have agents coordinate as one team — no external broker, no Docker. HTTP-based sync every 30s, mDNS discovery (SLM_MESH_DISCOVERY=on), graceful offline queue.

# Machine A (broker)
export SLM_MESH_HOST=192.168.1.100
export SLM_MESH_SHARED_SECRET=my-secret-key
slm init

# Machine B (client)
export SLM_MESH_PEER_URL=http://192.168.1.100:8765
export SLM_MESH_SHARED_SECRET=my-secret-key
slm init

8 mesh MCP tools: mesh_peers, mesh_send, mesh_broadcast, mesh_project, mesh_inbox, mesh_pending, mesh_state, mesh_lock.

Full docs: docs/multi-machine.md · docs/distributed-deployment.md


Install Paths

PathCommandWhen
npm (recommended)npm install -g superlocalmemoryNode 14+, installs Python deps automatically
pippip install superlocalmemoryPython 3.11+, direct install
Claude Code Plugin (WP-06)/plugin marketplace add qualixar/superlocalmemory then /plugin install superlocalmemory@qualixarSelf-bootstraps venv, isolated SLM_DATA_DIR, additive — 14-tool core. Ships the skills/agents/hooks/commands
Portable / IDE connect (WP-08)slm connect <ide> [--here]Wire any IDE without reinstalling; slm connect claude-code → plugin pointer

After any install path: slm setup → slm doctor → slm warmup (optional, pre-downloads ~500MB embedding model).

ComponentSizeWhen
Core libraries (numpy, scipy, networkx)~50MBDuring install
Dashboard & MCP server (fastapi, uvicorn)~20MBDuring install
Learning engine (lightgbm)~10MBDuring install
Search engine (sentence-transformers, torch)~200MBDuring install
Embedding model (nomic-embed-text-v1.5, 768d)~500MBFirst use or slm warmup
Mode B requires Ollama + a model (ollama pull llama3.2)~2GBManual

MCP + Profiles

SLM supports two MCP transports:

HTTP (recommended, v3.6.7+):

{ "mcpServers": { "superlocalmemory": { "type": "http", "url": "http://127.0.0.1:8765/mcp/" } } }

Or: claude mcp add --transport http superlocalmemory http://127.0.0.1:8765/mcp/

stdio (universal fallback):

{ "mcpServers": { "superlocalmemory": { "command": "slm", "args": ["mcp"] } } }

MCP Profiles (WP-01)

Control tool surface via SLM_MCP_PROFILE:

ProfileToolsUse case
core14 (default)14Memory core — remember, recall, forget, session_init, + mesh
mesh88Mesh-only — multi-machine coordination
full3838Core + optimize + evolution + trust
power5050Full38 + admin + ingestion + compliance
whole8181Every tool (SLM_MCP_ALL_TOOLS=1)

Precedence: ALL > TOOLS > PROFILE > default

export SLM_MCP_PROFILE=full38   # or core14 / mesh8 / power50 / whole81
slm mcp

Per-IDE configs available for Claude Code, Cursor, Windsurf, VS Code Copilot, Continue, Gemini CLI, JetBrains, Zed, and more (15 configs in ide/configs/). See docs/ide-setup.md.


Claude Code Plugin

Install directly in Claude Code — no system-level npm/pip needed. This is how you get the skills, agents, hooks, commands, and rules (the MCP server is bootstrapped automatically). It is a two-step flow — add the marketplace once, then install:

# 1. Add the Qualixar marketplace (one-time — the repo IS the marketplace)
/plugin marketplace add qualixar/superlocalmemory

# 2. Install the plugin
/plugin install superlocalmemory@qualixar
  • Self-bootstraps a Python venv, installs all deps in an isolated SLM_DATA_DIR
  • Registers the 14-tool core MCP surface (core14 profile by default)
  • Ships the SLM skills / agents / hooks / commands / rules
  • Additive — does not replace an existing SLM install
  • slm connect claude-code detects an existing plugin install and links them

Plugin vs pip/npm: pip install superlocalmemory / npm i -g superlocalmemory give you the slm CLI + the MCP server (the tools). The skills/agents/hooks/ commands come only through the plugin above. Use the plugin for Claude Code; use pip/npm for the CLI or other IDEs.

To update later: /plugin marketplace update qualixar then /plugin install superlocalmemory@qualixar.


Modes + EU AI Act

ModeWhatCloud?EU AI ActBest For
ALocal GuardianNoneCompliantPrivacy-first, air-gapped, enterprise
BSmart LocalLocal only (Ollama)CompliantBetter answers, data stays local
CFull PowerCloud LLMPartialMaximum accuracy, research
slm mode a   # Zero-cloud (default)
slm mode b   # Local Ollama
slm mode c   # Cloud LLM

Mode A is, to the best of our knowledge, the only publicly-released agent memory that runs with zero cloud calls while clearing Mem0's published LoCoMo score. All data stays on your device. No API keys. No GPU. Runs on 2 vCPUs + 4GB RAM.

The EU AI Act (Regulation 2024/1689) takes full effect August 2, 2026.

RequirementMode AMode BMode C
Data sovereignty (Art. 10)PassPassRequires DPA
Right to erasure (GDPR Art. 17)PassPassPass
Transparency (Art. 13)PassPassPass
No network calls during memory opsYesYesNo

To the best of our knowledge, no existing agent memory system addresses EU AI Act compliance by architectural design. Modes A and B pass all checks — no personal data leaves the device during any memory operation.

Built-in compliance tools: GDPR Article 15/17 export + complete erasure, tamper-proof SHA-256 audit chain, data provenance tracking, ABAC policy enforcement. See docs/compliance.md.


Advanced

TopicLink
Full optimize docsdocs/optimize-overview.md · docs/optimize-cli.md · docs/optimize-config.md
Distributed deploymentdocs/distributed-deployment.md
Multi-machine meshdocs/multi-machine.md
Auto-memory hooksdocs/auto-memory.md
Architecture + mathdocs/ARCHITECTURE.md
CLI referencedocs/cli-reference.md
MCP tools referencedocs/mcp-tools.md
Getting starteddocs/getting-started.md
IDE setup (15 configs)docs/ide-setup.md
Skill evolutiondocs/skill-evolution.md
V2 migrationdocs/migration-from-v2.md
Configurationdocs/configuration.md
Wikigithub.com/qualixar/superlocalmemory/wiki

Web dashboard:

slm dashboard    # Opens at http://localhost:8765

17-tab sidebar with Knowledge Graph (Sigma.js WebGL, community detection), Health Monitor, Entity Explorer, Mesh Peers, Ingestion Status, Privacy blur mode. Cross-platform: macOS + Windows + Linux.

Release history:

VersionCodenameKey Features
v3.6.17Community8 contributor PRs (observability events, marker-bounded adapter writes, daemon port discovery, anthropic api_base, OpenMP workers, atomic-write rehash, _jl sentinel, LFS pointer); dashboard-feedback fix (#53/#59); env-tunable SQLite knobs + idle backoff; remote LLM test-probe (#40)
v3.6.16DocsCorrected Claude Code plugin install — adds the required /plugin marketplace add step; clarifies plugin vs pip/npm delivery
v3.6.15Multi-scopeOpt-in shared memory (personal/shared/global, off by default), default-deny scope at every read path, recall scope-race fix, contributor PRs #42/#43/#44, fixes #46–#49
v3.6.14Plugin-nativeClaude Code Plugin (WP-06), MCP profiles (WP-01), IDE connect (WP-08), asset consolidation, UI polish (WP-12)
v3.6.xOptimize Everywhere / Distributed-readyThree surfaces (proxy/MCP/skill), SLM_REMOTE=1 LAN mode, remote dashboard, custom LLM endpoints
v3.5.0Scale-ReadyCozoDB/LanceDB, 6-channel recall <1s, Core Memory Block, context injection v2, score normalization
v3.4.xScale-Ready (foundation)Tiered storage, graph pruning, Hopfield channel, LightGBM ranking, mDNS mesh discovery
v3.3.xFoundationBM25Plus, Fisher-Rao, sqlite-vec, RRF fusion, cross-encoder rerank. 3 published papers

Research Papers

SuperLocalMemory is backed by three published research papers (arXiv preprints + Zenodo DOIs). These are preprints — not conference-accepted or journal-published yet.

Paper 3: The Living Brain (V3.3)

SuperLocalMemory V3.3: The Living Brain — Biologically-Inspired Forgetting, Cognitive Quantization, and Multi-Channel Retrieval for Zero-LLM Agent Memory Systems Varun Pratap Bhardwaj (2026) arXiv:2604.04514 · Zenodo DOI: 10.5281/zenodo.19435120

Paper 2: Information-Geometric Foundations (V3)

SuperLocalMemory V3: Information-Geometric Foundations for Zero-LLM Enterprise Agent Memory Varun Pratap Bhardwaj (2026) arXiv:2603.14588 · Zenodo DOI: 10.5281/zenodo.19038659

Paper 1: Trust & Behavioral Foundations (V2)

SuperLocalMemory: A Structured Local Memory Architecture for Persistent AI Agent Context Varun Pratap Bhardwaj (2026) arXiv:2603.02240 · Zenodo DOI: 10.5281/zenodo.18709670

Cite This Work

@article{bhardwaj2026slmv33,
  title={SuperLocalMemory V3.3: The Living Brain — Biologically-Inspired
         Forgetting, Cognitive Quantization, and Multi-Channel Retrieval
         for Zero-LLM Agent Memory Systems},
  author={Bhardwaj, Varun Pratap},
  journal={arXiv preprint arXiv:2604.04514},
  year={2026},
  url={https://arxiv.org/abs/2604.04514}
}

@article{bhardwaj2026slmv3,
  title={Information-Geometric Foundations for Zero-LLM Enterprise Agent Memory},
  author={Bhardwaj, Varun Pratap},
  journal={arXiv preprint arXiv:2603.14588},
  year={2026}
}

@article{bhardwaj2026slm,
  title={A Structured Local Memory Architecture for Persistent AI Agent Context},
  author={Bhardwaj, Varun Pratap},
  journal={arXiv preprint arXiv:2603.02240},
  year={2026}
}

Support / License / Qualixar

See CONTRIBUTING.md for guidelines. Wiki for detailed documentation.

GNU Affero General Public License v3.0 (AGPL-3.0). See LICENSE.

For commercial licensing (closed-source, proprietary, or hosted use), see COMMERCIAL-LICENSE.md or contact varun.pratap.bhardwaj@gmail.com.

Copyright (c) 2026 Varun Pratap Bhardwaj / Qualixar.

Part of Qualixar · Author: Varun Pratap Bhardwaj

Acknowledgments

  • Everything Claude Code (ECC) — SLM's skill observation patterns were inspired by ECC's continuous learning architecture. SLM supports direct ingestion of ECC observations via slm ingest --source ecc. We recommend ECC for Claude Code users who want the deepest learning experience alongside SLM.
  • HKUDS/OpenSpace — The skill evolution research in SLM draws from the EvoSkills co-evolutionary verification concepts (arXiv:2604.01687). We adopted their 3-trigger evolution system and anti-loop guard patterns.

Qualixar AI Agent Reliability Platform

Qualixar is building the open-source infrastructure for AI agent reliability engineering. Seven products, one coherent platform:

ProductPurposeInstall
SuperLocalMemoryPersistent memory + learningnpm install -g superlocalmemory
Qualixar OSUniversal agent runtimenpx qualixar-os
SLM MeshP2P coordination across sessionsnpm i slm-mesh
SLM MCP HubFederate 430+ MCP toolspip install slm-mcp-hub
AgentAssayToken-efficient agent testingpip install agentassay
AgentAssertBehavioral contracts + drift detectionpip install agentassert-abc
SkillFortifyFormal verification for agent skillspip install skillfortify

Zero cloud dependency. Local-first. EU AI Act compliant.

Start here → qualixar.com · All papers on Qualixar HuggingFace


Built with mathematical rigor. Not in the race — here to help everyone build better AI memory systems.


Star This Project

If this project solves a real problem for you, please star the repo — it helps other developers discover Qualixar and signals that the AI agent reliability community is growing.

Star History Chart

Featured
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
Categories
AI & LLM ToolsDocuments & KnowledgeSearch & Web Crawling
Registryactive
Packagesuperlocalmemory
TransportSTDIO
UpdatedFeb 17, 2026
View on GitHub

Related AI & LLM Tools MCP Servers

View all →
SkillFM LLM Cost Optimizer

io.github.ericm1018/skillfm-llm-cost-optimizer-openai-anthropic-usage

LLM cost optimizer for OpenAI, Anthropic, token usage, BYOK, and SkillFM Beacon audits.
Llm Orchestration Agent

io.github.mikerawsonnz/llm-orchestration-agent

Run a prompt through a LangChain (system + human) chain over Gemini on Vertex AI; optional LangSmith
Authenticated Llm Agent

io.github.mikerawsonnz/authenticated-llm-agent

JWT-gated LLM gateway: authenticate (bcrypt/JWT), then run a LangChain-on-Vertex Gemini completion.
Copilot Memory MCP

labforgedev/copilot-memory-mcp

Persistent semantic memory for AI agents using local ChromaDB vector search. No cloud required.
1
Agent Prompt Injection Firewall Mcp

csoai-org/agent-prompt-injection-firewall-mcp

The WAF for agents. Pattern-based + heuristic firewall scans prompts, RAG documents, tool argume...
Authenticated Multi Llm Agent

io.github.mikerawsonnz/authenticated-multi-llm-agent

Google-OAuth-gated LLM gateway: verify a Google ID token, then run a Gemini (Vertex AI) completion f