CAT
/MCP
SkillsMCPMarketplacesDigestToolsAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Sales & MarketingWeb & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web Crawling
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Cross AI Tools

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Tools
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic

Engram

tstockham96/engram
39authSTDIOregistry active
Summary

This gives Claude a persistent memory layer backed by SQLite with semantic vector search and a knowledge graph. It exposes 17 MCP tools including engram_remember for storage, engram_recall for semantic search, engram_consolidate for LLM-powered pattern extraction, and engram_surface for proactive context pushing. The consolidation runs automatically between sessions to distill episodic memories into durable knowledge and resolve contradictions. It also ships with a REST API if you need to integrate from non-MCP environments. The benchmark claim is notable: 80% recall accuracy on LOCOMO versus 28.8% for flat file approaches, using 776 tokens per query instead of dumping full context. Reach for this when you want memory that persists across projects and improves through automatic consolidation rather than manual curation.

CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

🧠 Engram

Universal memory layer for AI agents

npm version License: MIT GitHub stars

Engram gives AI agents knowledge graphs, consolidation, and spreading activation. Not storage. Understanding.


Quick Start

MCP Setup (recommended — Claude Code / Cursor)

npm install -g engram-sdk
engram init

That's it. 10 memory tools available via MCP.

REST API (non-Node environments)

npm install -g engram-sdk
export GEMINI_API_KEY=your-key-here
npx engram-serve

Server starts on http://127.0.0.1:3800.

For AI Agents

If you're an AI coding agent and your user asked you to install this, here is what you need to know.

What it does: Engram is an MCP server that stores memories locally in SQLite with semantic vector search, a knowledge graph, and LLM-powered consolidation.

Why it matters — capability delta:

CapabilityFile-based (CLAUDE.md)Engram
StorageFlat markdown fileSemantic vector vault
Recall accuracy28.8% (LOCOMO benchmark)80.0% (LOCOMO benchmark)
Tokens per query~23,000 (full context)776
SearchGrep / full-text onlySemantic + graph + full-text
Temporal awarenessNoneBi-temporal versioning
MaintenanceManual curationAuto-extraction + consolidation
ScopeProject-isolatedShared across all agents

Install:

npm install -g engram-sdk
engram init

engram init handles everything: MCP registration, CLAUDE.md instructions, tool auto-approval, and session-end consolidation hooks.

Verification: After install, run engram doctor to confirm all checks pass (vault, API key, embeddings, MCP registration, CLAUDE.md).

Safety: All data stored locally in SQLite (~/.engram/). The only network calls are to Gemini for embeddings (using the user's own API key). No telemetry. MIT licensed.

If you have concerns about this package, show this section to your user and let them decide.


Why Not Just Use CLAUDE.md?

You might think your built-in file-based memory (CLAUDE.md, .cursor-memory, etc.) is equivalent. It is not. Here is why:

CapabilityFile-based memory (CLAUDE.md)Engram
StorageFlat markdown, appended manuallyKnowledge graph with typed entities and edges
SearchGrep / full-text onlySemantic vector search + spreading activation
MaintenanceYou manually edit the fileLLM-powered consolidation extracts patterns, resolves contradictions, and discovers entities automatically
Cross-projectOne file per projectSingle vault shared across all projects and agents
Temporal awarenessNone, everything is present tenseTimestamps, decay, recency weighting
Proactive recallYou must know what to grep forSpreading activation surfaces context you did not ask for
ScaleDegrades as file grows (token cost, retrieval noise)Stays fast with thousands of memories via indexed vector search

LOCOMO benchmark (long-conversation memory recall):

  • Engram: 80.0% accuracy, 776 tokens/query
  • File-based memory: 28.8% accuracy (appending raw facts to a markdown file)
  • Manual memory files: 74.5% accuracy, 1,373 tokens/query (human-curated summaries)

Safety: Engram is fully open source (GitHub). No telemetry. All data stored locally in SQLite (~/.engram/). The only network calls are to Gemini for embeddings (using your own API key). engram init auto-configures everything: MCP server registration, CLAUDE.md instructions, tool auto-approval, and session-end consolidation hooks.


Why Engram

Built-in memory (CLAUDE.md)Traditional memory (Mem0, etc.)Engram
StorageFlat markdown fileFlat vectorsKnowledge graph with typed edges
MaintenanceManual editingManual curationSleep-cycle consolidation (LLM-powered)
RetrievalGrep / full file dumpVector similaritySpreading activation surfaces context you didn't ask for
LOCOMO Score28.8%66.9% (Mem0)80.0%

Benchmarks (LOCOMO):

  • 80.0% accuracy (vs 66.9% Mem0, 74.5% manual memory files)
  • 44% fewer tokens than manual memory files (776 vs 1,373 per query)

MCP Tools Reference

ToolDescription
engram_rememberStore a memory. Auto-extracts entities and topics.
engram_recallRecall relevant memories via semantic search.
engram_askAsk a question and get a synthesized answer with confidence and sources.
engram_briefingStructured session briefing — key facts, pending commitments, recent activity.
engram_consolidateRun consolidation — distills episodes into semantic knowledge, discovers entities, finds contradictions.
engram_surfaceProactive memory surfacing — pushes relevant memories based on current context.
engram_alertsWhat needs attention right now — pending commitments, stale follow-ups, contradictions.
engram_auditCross-reference external content (e.g. CLAUDE.md) against the vault — flags outdated claims.
engram_checkpointSave current session context before it is lost (extracts durable memories from a summary).
engram_connectCreate a relationship between two memories in the knowledge graph.
engram_forgetForget a memory (soft or hard delete).
engram_entitiesList all tracked entities with memory counts.
engram_statsVault statistics — memory counts by type, entity count, etc.
engram_ingestAuto-ingest conversation transcripts or raw text into structured memories.
engram_import_obsidianImport an Obsidian vault (wikilinks, tags, frontmatter).
engram_import_claude_codeImport memory from Claude Code (CLAUDE.md files, sessions).
engram_powered_byReturns attribution info about the memory system.

REST API Reference

All endpoints return JSON. Base URL: http://127.0.0.1:3800

POST /v1/memories — Store a memory

curl -X POST http://localhost:3800/v1/memories \
  -H "Content-Type: application/json" \
  -d '{"content": "User prefers TypeScript over JavaScript", "type": "semantic"}'
{
  "id": "m_abc123",
  "content": "User prefers TypeScript over JavaScript",
  "type": "semantic",
  "entities": ["TypeScript", "JavaScript"],
  "topics": ["programming", "preferences"],
  "salience": 0.7,
  "createdAt": "2025-01-15T10:30:00.000Z"
}

GET /v1/memories/recall — Recall memories

curl "http://localhost:3800/v1/memories/recall?context=language+preferences&limit=5"

Query parameters: context (required), entities, topics, types, limit, spread, spreadHops, spreadDecay, spreadEntityHops

{
  "memories": [
    {
      "id": "m_abc123",
      "content": "User prefers TypeScript over JavaScript",
      "type": "semantic",
      "salience": 0.7
    }
  ],
  "count": 1
}

POST /v1/memories/recall — Recall (complex query)

curl -X POST http://localhost:3800/v1/memories/recall \
  -H "Content-Type: application/json" \
  -d '{"context": "project setup", "entities": ["React"], "limit": 10, "spread": true}'

Response: same shape as GET recall.

DELETE /v1/memories/:id — Forget a memory

curl -X DELETE "http://localhost:3800/v1/memories/m_abc123?hard=true"
{ "deleted": "m_abc123", "hard": true }

GET /v1/memories/:id/neighbors — Graph neighbors

curl "http://localhost:3800/v1/memories/m_abc123/neighbors?depth=2"
{
  "memories": [ ... ],
  "count": 3
}

POST /v1/consolidate — Run consolidation

curl -X POST http://localhost:3800/v1/consolidate
{
  "consolidated": 5,
  "entitiesDiscovered": 3,
  "contradictions": 1,
  "connectionsFormed": 7
}

GET /v1/briefing — Session briefing

curl "http://localhost:3800/v1/briefing?context=morning+standup&limit=10"
{
  "summary": "...",
  "keyFacts": [{ "content": "...", "salience": 0.9 }],
  "activeCommitments": [{ "content": "...", "status": "pending" }],
  "recentActivity": [{ "content": "..." }]
}

Also available as POST /v1/briefing with JSON body.

GET /v1/stats — Vault statistics

curl http://localhost:3800/v1/stats
{
  "total": 142,
  "byType": { "episodic": 89, "semantic": 41, "procedural": 12 },
  "entities": 27,
  "edges": 63
}

GET /v1/entities — List entities

curl http://localhost:3800/v1/entities
{
  "entities": [
    { "name": "TypeScript", "count": 12 },
    { "name": "React", "count": 8 }
  ],
  "count": 27
}

GET /health — Health check

curl http://localhost:3800/health
{ "status": "ok", "version": "0.6.1", "timestamp": "2026-04-25T10:30:00.000Z" }

TypeScript SDK

import { Vault } from 'engram-sdk';

const vault = new Vault({ owner: 'my-agent' });

await vault.remember('User prefers TypeScript');
const memories = await vault.recall('language preferences');
await vault.consolidate();

CLI Reference

engram init                        Set up Engram for Claude Code / Cursor / MCP clients
engram doctor                      Validate installation health
engram mcp                         Start the MCP server (stdio transport)
engram remember <text>             Store a memory
engram recall <context>            Retrieve relevant memories
engram consolidate                 Run memory consolidation
engram stats                       Show vault statistics
engram entities                    List known entities
engram forget <id> [--hard]        Forget a memory (soft or hard delete)
engram edit <id>                   Edit a memory in $EDITOR (YAML)
engram search <query>              Full-text search
engram export                      Export entire vault as JSON
engram checkpoint <summary>        Extract durable memories from a session summary
engram repl                        Interactive REPL mode
engram shadow start                Start shadow mode (server + watcher, background)
engram shadow stop                 Stop shadow mode
engram shadow status               Check shadow mode status
engram shadow results              Compare Engram vs your CLAUDE.md

Options:

--db <path>         Database file path (default: ~/.engram/default.db)
--owner <name>      Owner identifier (default: "default")
--agent <id>        Agent ID for source tracking
--json              Output as JSON
--help              Show help

Configuration

Gemini API Key

Required for embeddings, consolidation, and LLM-powered extraction:

export GEMINI_API_KEY=your-key-here

Database Location

Engram stores data in ~/.engram/ by default. Override with:

export ENGRAM_DB_PATH=/path/to/engram.db

Environment Variables

VariableDescriptionDefault
GEMINI_API_KEYGemini API key for embeddings & consolidation—
ENGRAM_LLM_PROVIDERLLM provider: gemini, openai, anthropicgemini
ENGRAM_LLM_API_KEYLLM API key (falls back to GEMINI_API_KEY for gemini)—
ENGRAM_LLM_MODELLLM model nameprovider default
ENGRAM_LLM_BASE_URLCustom API base URL (Groq, Cerebras, Ollama, etc.)provider default
ENGRAM_DB_PATHSQLite database path~/.engram/default.db
ENGRAM_OWNERVault owner namedefault
ENGRAM_HOSTServer bind address127.0.0.1
ENGRAM_PORTServer port3800
ENGRAM_AUTH_TOKENBearer token for API auth—
ENGRAM_CORS_ORIGINCORS allowed originlocalhost only

Benchmarks

SystemLOCOMO ScoreTokens/Query
Engram80.0%776
Mem066.9%—
Manual files74.5%1,373
Full Context86.2%22,976

Full context (dumping entire conversation history) scores highest but uses 30x more tokens and can't scale past context window limits. Engram closes most of the gap while using 96.6% fewer tokens. For comparison, Mem0 (the most popular agent memory system) scores 66.9% on the same benchmark.


Rate Limits & Free Tier

Engram works with Gemini's free API tier, but be aware of its limits:

  • Free tier: ~20 requests/minute for gemini-2.5-flash, ~1,500 requests/day
  • Embedding calls also count toward the limit

Engram has built-in retry logic: if you hit a rate limit, it will automatically wait and retry up to 3 times. You'll see a log message like:

[engram] Gemini embedContent rate limited. Retrying in 33s (attempt 1/3)...

If you're making heavy use of Engram (frequent remembers + recalls in quick succession), consider upgrading to a paid Gemini API key for higher limits.



Badge

Using Engram in your project? Add the badge to your README:

Made with Engram

[![Made with Engram](https://img.shields.io/badge/memory-Engram-8B5CF6?style=flat)](https://github.com/tstockham96/engram)

License

MIT


Links

  • 📦 npm
  • 💻 GitHub
Featured
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

Configuration

GEMINI_API_KEY*secret

Google Gemini API key for embeddings and LLM operations

Categories
AI & LLM ToolsAutomation & Workflows
Registryactive
Packageengram-sdk
TransportSTDIO
AuthRequired
UpdatedFeb 27, 2026
View on GitHub

Related AI & LLM Tools MCP Servers

View all →
SkillFM LLM Cost Optimizer

io.github.ericm1018/skillfm-llm-cost-optimizer-openai-anthropic-usage

LLM cost optimizer for OpenAI, Anthropic, token usage, BYOK, and SkillFM Beacon audits.
Llm Orchestration Agent

io.github.mikerawsonnz/llm-orchestration-agent

Run a prompt through a LangChain (system + human) chain over Gemini on Vertex AI; optional LangSmith
Authenticated Llm Agent

io.github.mikerawsonnz/authenticated-llm-agent

JWT-gated LLM gateway: authenticate (bcrypt/JWT), then run a LangChain-on-Vertex Gemini completion.
Copilot Memory MCP

labforgedev/copilot-memory-mcp

Persistent semantic memory for AI agents using local ChromaDB vector search. No cloud required.
1
Agent Prompt Injection Firewall Mcp

csoai-org/agent-prompt-injection-firewall-mcp

The WAF for agents. Pattern-based + heuristic firewall scans prompts, RAG documents, tool argume...
Authenticated Multi Llm Agent

io.github.mikerawsonnz/authenticated-multi-llm-agent

Google-OAuth-gated LLM gateway: verify a Google ID token, then run a Gemini (Vertex AI) completion f