Remembra — AI Memory Layer

145 toolsSTDIOregistry active

Summary

This is a self-hosted memory layer that gives Claude persistent recall across conversations through 11 MCP tools. It automatically extracts entities from what you tell it, resolves that "Sarah" and "Sarah from Acme" are the same person, and builds a temporal knowledge graph you can query later. You get hybrid semantic and keyword search, temporal decay patterns, and conflict detection when memories contradict each other. The quickstart script spins up Remembra with Qdrant and Ollama locally in one command, no API keys required. It includes a dashboard for browsing entities and relationships, plus Python and TypeScript SDKs if you want to integrate it into your own tools. Useful when you need Claude to remember user preferences, past decisions, or relationship context beyond the current chat window.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Tools

Public tool metadata for what this MCP can expose to an agent.

5 tools

store_memoryStore information in persistent memory. Automatically extracts entities (people, organizations, locations) and facts from the content.3 params

Store information in persistent memory. Automatically extracts entities (people, organizations, locations) and facts from the content.

Parameters* required

ttlstring

Optional time-to-live. Examples: "24h", "7d", "30d", "1y", or omit for permanent storage.

contentstring

The text content to memorize. Can be facts, context, decisions, meeting notes, preferences, etc.

metadataobject

Optional key-value metadata to attach (e.g., {"source": "meeting", "date": "2024-01-15"})

recall_memoriesSearch persistent memory for relevant information using hybrid search (semantic + keyword). Use this BEFORE answering questions about past decisions, context, people, or projects.3 params

Search persistent memory for relevant information using hybrid search (semantic + keyword). Use this BEFORE answering questions about past decisions, context, people, or projects.

Parameters* required

limitinteger

Maximum number of memories to return (1-50, default: 5)default: 5

querystring

Natural language query. Can be a question or keywords.

thresholdnumber

Minimum relevance score 0.0-1.0 (default: 0.4)default: 0.4

forget_memoriesDelete memories from persistent storage. GDPR-compliant deletion. Provide exactly one of: memory_id, entity, or all_memories.3 params

Delete memories from persistent storage. GDPR-compliant deletion. Provide exactly one of: memory_id, entity, or all_memories.

Parameters* required

entitystring

Delete all memories about a specific entity (person, org, etc)

memory_idstring

Delete a specific memory by its ID

all_memoriesboolean

Set to true to delete ALL memories. Use with extreme caution!

health_checkCheck Remembra server health and connection status. Returns server version, status, and component health.

Check Remembra server health and connection status. Returns server version, status, and component health.

No parameter schema in public metadata yet.

ingest_conversationIngest a conversation and automatically extract memories. Processes messages and intelligently extracts facts worth remembering long-term.5 params

Ingest a conversation and automatically extract memories. Processes messages and intelligently extracts facts worth remembering long-term.

Parameters* required

storeboolean

If false, returns extraction results without storing (dry run)default: true

messagesarray

List of message dicts with 'role' and 'content'. Optional: 'name', 'timestamp'.

session_idstring

Optional session ID for grouping related conversations

extract_fromstring

Which messages to extract from: 'user', 'assistant', or 'both'one of user · assistant · bothdefault: both

min_importancenumber

Minimum importance threshold (0.0-1.0, default: 0.5)default: 0.5

Remembra

The memory layer for AI that actually works.
Persistent memory with entity resolution, temporal decay, and graph-aware recall.
Self-host in minutes. No vendor lock-in.

Documentation • Website • Quick Start • Why Remembra? • Twitter • Discord

🚀 What's New in v0.13.0

Dashboard v2.0

🔐 Two-Factor Authentication — TOTP-based 2FA with authenticator apps
👥 Team Collaboration — Shared memory spaces with role-based access
🛠️ Admin Dashboard — Full user management (delete/deactivate/reset)
📊 Activity Log — Security audit trail with JSON export
🕵️ Entity Browser — Visual exploration of people, places, concepts
⏰ Timeline Fix — Proper timezone handling with local time display

Core API

📦 npm Package — npm install remembra with full TypeScript support
🔒 Security Fixes — RBAC enforcement, SSRF protection, error sanitization

Supported Agents (6+)

Claude Desktop • Claude Code • Codex CLI • Cursor • Windsurf • Gemini

Previous (v0.12.x)

👤 User Profiles API with activity metrics
🧠 Smart Auto-Forgetting (35+ temporal patterns)
⏰ Event-driven expiry with expires_at
🌐 Browser Extension for AI chat interfaces

The Problem

Every AI app needs memory. Your chatbot forgets users between sessions. Your agent can't recall decisions from yesterday. Your assistant asks the same questions over and over.

Existing solutions have tradeoffs:

Mem0: Graph features require $249/mo plan; limited self-hosting documentation
Zep: Academic approach, complex deployment
Letta: Research-grade, not production-ready
LangChain Memory: Too basic, no persistence

The Solution

from remembra import Memory

memory = Memory(user_id="user_123")

# Store — entities and facts extracted automatically
memory.store("Had a meeting with Sarah from Acme Corp. She prefers email over Slack.")

# Recall — semantic search finds relevant memories
result = memory.recall("How should I contact Sarah?")
print(result.context)
# → "Sarah from Acme Corp prefers email over Slack."

# It knows "Sarah" and "Acme Corp" are entities. It builds relationships.
# It persists across sessions, reboots, context windows. Forever.

⚡ Quick Start (2 Minutes)

One Command Install

curl -sSL https://raw.githubusercontent.com/remembra-ai/remembra/main/quickstart.sh | bash

That's it. Remembra + Qdrant + Ollama start locally. No API keys needed.

Or with Docker Compose directly:

git clone https://github.com/remembra-ai/remembra && cd remembra
docker compose -f docker-compose.quickstart.yml up -d

Try it:

# Store a memory
curl -X POST http://localhost:8787/api/v1/memories \
  -H "Content-Type: application/json" \
  -d '{"content": "Alice is CEO of Acme Corp", "user_id": "demo"}'

# Recall it
curl -X POST http://localhost:8787/api/v1/memories/recall \
  -H "Content-Type: application/json" \
  -d '{"query": "Who runs Acme?", "user_id": "demo"}'

Connect ALL Your AI Agents (NEW in v0.10.0)

One command configures everything:

pip install remembra
remembra-install --all --url http://localhost:8787

This auto-detects and configures: Claude Desktop, Claude Code, Codex CLI, Cursor, Windsurf, Gemini.

Verify setup:

remembra-doctor all

Manual MCP Config (if needed)

Claude Desktop — add to ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "remembra": {
      "command": "remembra-mcp",
      "env": {
        "REMEMBRA_URL": "http://localhost:8787",
        "REMEMBRA_USER_ID": "default"
      }
    }
  }
}

Claude Code:

claude mcp add remembra -e REMEMBRA_URL=http://localhost:8787 -- remembra-mcp

Cursor — add to .cursor/mcp.json:

{
  "mcpServers": {
    "remembra": {
      "command": "remembra-mcp",
      "env": {
        "REMEMBRA_URL": "http://localhost:8787"
      }
    }
  }
}

Now ask Claude: "Remember that Alice is CEO of Acme Corp" — then later: "Who runs Acme?"

Python SDK

pip install remembra

from remembra import Memory

memory = Memory(user_id="user_123")
memory.store("Had a meeting with Sarah from Acme Corp. She prefers email over Slack.")
result = memory.recall("How should I contact Sarah?")
print(result.context)  # "Sarah from Acme Corp prefers email over Slack."

TypeScript SDK

npm install remembra

import { Remembra } from 'remembra';

const memory = new Remembra({ url: 'http://localhost:8787' });
await memory.store('User prefers dark mode');
const result = await memory.recall('preferences');

🔥 Why Remembra?

Feature Comparison

Feature	Remembra	Mem0	Zep/Graphiti	Letta	Engram
One-Command Install	✅ `curl \| bash`	✅ pip	✅ pip	⚠️ Complex	✅ brew
Bi-Temporal Relationships	✅ Point-in-time	❌	⚠️ Basic	❌	❌
Entity Resolution	✅ Free	💰 $249/mo	✅	❌	❌
Conflict Detection	✅ Auto-supersede	❌	❌	❌	❌
PII Detection	✅ Built-in	❌	❌	❌	❌
Hybrid Search	✅ BM25+Vector	❌	✅	❌	❌
6 Embedding Providers	✅ Hot-swap	❌ (1-2)	❌ (1)	❌	❌
Plugin System	✅	❌	❌	✅	❌
Sleep-Time Compute	✅	❌	❌	✅	❌
Self-Host + Billing	✅ Stripe	❌	❌	❌	❌
Memory Spaces	✅ Multi-tenant	❌	❌	❌	❌
MCP Server	✅ 11 Tools	✅	❌	❌	✅
Pricing	Free / $49 / $199	$19 → $249	$25+	Free	Free
License	MIT	Apache 2.0	Apache 2.0	Apache 2.0	MIT

Core Features

🧠 Smart Extraction — LLM-powered fact extraction from raw text

👥 Entity Resolution — "Adam", "Mr. Smith", "my husband" → same person

⏱️ Temporal Memory — TTL, decay curves, historical queries

🔍 Hybrid Search — Semantic + keyword for accurate recall

🔒 Security — PII detection, anomaly monitoring, audit logs

📊 Dashboard — Visual memory browser, entity graphs, analytics

📊 Benchmark Results

Tested on the LoCoMo benchmark (Snap Research, ACL 2024) — the standard academic benchmark for AI memory systems.

Category	Accuracy	Questions
Single-hop (direct recall)	100%	37
Multi-hop (cross-session reasoning)	100%	32
Temporal (time-based queries)	100%	13
Open-domain (world knowledge + memory)	100%	70
Overall (memory categories)	100%	152

Scored with LLM judge (GPT-4o-mini). Adversarial detection not yet implemented. Run your own: python benchmarks/locomo_runner.py --data /tmp/locomo/data/locomo10.json

📖 Documentation

Resource	Description
Quick Start	Get running in minutes
Python SDK	Full Python reference
TypeScript SDK	JavaScript/TypeScript guide
MCP Server	Tool reference + setup guides for 11 tools
REST API	API reference
Self-Hosting	Docker deployment guide

🛠️ MCP Server

Give any AI coding tool persistent memory with one command. Works with Claude Code, Cursor, VS Code + Copilot, Windsurf, JetBrains, Zed, OpenAI Codex, and any MCP-compatible client.

pip install remembra[mcp]
claude mcp add remembra -e REMEMBRA_URL=http://localhost:8787 -- remembra-mcp

Available Tools (11 total):

Tool	Description
`store_memory`	Save facts, decisions, context
`recall_memories`	Semantic search across memories
`update_memory`	Update content without delete+recreate
`forget_memories`	GDPR-compliant deletion
`list_memories`	Browse stored memories
`search_entities`	Search the entity graph
`share_memory`	Cross-agent memory sharing via Spaces
`timeline`	Temporal browsing by entity and date
`relationships_at`	Point-in-time relationship queries
`ingest_conversation`	Auto-extract from chat history
`health_check`	Verify connection

🏗️ Architecture

┌─────────────────────────────────────────────────────────────┐
│                    Your Application                          │
├──────────┬──────────────┬───────────────────────────────────┤
│ Python   │ TypeScript   │ MCP Server (Claude/Cursor)        │
│ SDK      │ SDK          │ remembra-mcp                      │
├──────────┴──────────────┴───────────────────────────────────┤
│                   Remembra REST API                          │
├──────────────┬──────────────┬───────────────┬───────────────┤
│  Extraction  │   Entities   │   Retrieval   │   Security    │
│  (LLM)       │  (Graph)     │ (Hybrid)      │  (PII/Audit)  │
├──────────────┴──────────────┴───────────────┴───────────────┤
│                    Storage Layer                             │
│         Qdrant (vectors) + SQLite (metadata/graph)          │
└─────────────────────────────────────────────────────────────┘

🤝 Contributing

We welcome contributions! See CONTRIBUTING.md for guidelines.

# Clone
git clone https://github.com/remembra-ai/remembra
cd remembra

# Install dev dependencies
pip install -e ".[dev]"

# Run tests
pytest

# Start dev server
remembra-server --reload

📄 License

MIT License — Use it however you want.

⭐ Star History

If Remembra helps you, please star the repo! It helps others discover the project.

Built with ❤️ by DolphyTech
remembra.dev • docs • twitter • discord

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Remembra

The memory layer for AI that actually works.
Persistent memory with entity resolution, temporal decay, and graph-aware recall.
Self-host in minutes. No vendor lock-in.

Documentation • Website • Quick Start • Why Remembra? • Twitter • Discord

🚀 What's New in v0.13.0

Dashboard v2.0

🔐 Two-Factor Authentication — TOTP-based 2FA with authenticator apps
👥 Team Collaboration — Shared memory spaces with role-based access
🛠️ Admin Dashboard — Full user management (delete/deactivate/reset)
📊 Activity Log — Security audit trail with JSON export
🕵️ Entity Browser — Visual exploration of people, places, concepts
⏰ Timeline Fix — Proper timezone handling with local time display

Core API

📦 npm Package — npm install remembra with full TypeScript support
🔒 Security Fixes — RBAC enforcement, SSRF protection, error sanitization

Supported Agents (6+)

Claude Desktop • Claude Code • Codex CLI • Cursor • Windsurf • Gemini

Previous (v0.12.x)

👤 User Profiles API with activity metrics
🧠 Smart Auto-Forgetting (35+ temporal patterns)
⏰ Event-driven expiry with expires_at
🌐 Browser Extension for AI chat interfaces

The Problem

Every AI app needs memory. Your chatbot forgets users between sessions. Your agent can't recall decisions from yesterday. Your assistant asks the same questions over and over.

Existing solutions have tradeoffs:

Mem0: Graph features require $249/mo plan; limited self-hosting documentation
Zep: Academic approach, complex deployment
Letta: Research-grade, not production-ready
LangChain Memory: Too basic, no persistence

The Solution

from remembra import Memory

memory = Memory(user_id="user_123")

# Store — entities and facts extracted automatically
memory.store("Had a meeting with Sarah from Acme Corp. She prefers email over Slack.")

# Recall — semantic search finds relevant memories
result = memory.recall("How should I contact Sarah?")
print(result.context)
# → "Sarah from Acme Corp prefers email over Slack."

# It knows "Sarah" and "Acme Corp" are entities. It builds relationships.
# It persists across sessions, reboots, context windows. Forever.

⚡ Quick Start (2 Minutes)

One Command Install

curl -sSL https://raw.githubusercontent.com/remembra-ai/remembra/main/quickstart.sh | bash

That's it. Remembra + Qdrant + Ollama start locally. No API keys needed.

Or with Docker Compose directly:

git clone https://github.com/remembra-ai/remembra && cd remembra
docker compose -f docker-compose.quickstart.yml up -d

Try it:

# Store a memory
curl -X POST http://localhost:8787/api/v1/memories \
  -H "Content-Type: application/json" \
  -d '{"content": "Alice is CEO of Acme Corp", "user_id": "demo"}'

# Recall it
curl -X POST http://localhost:8787/api/v1/memories/recall \
  -H "Content-Type: application/json" \
  -d '{"query": "Who runs Acme?", "user_id": "demo"}'

Connect ALL Your AI Agents (NEW in v0.10.0)

One command configures everything:

pip install remembra
remembra-install --all --url http://localhost:8787

This auto-detects and configures: Claude Desktop, Claude Code, Codex CLI, Cursor, Windsurf, Gemini.

Verify setup:

remembra-doctor all

Manual MCP Config (if needed)

Claude Desktop — add to ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "remembra": {
      "command": "remembra-mcp",
      "env": {
        "REMEMBRA_URL": "http://localhost:8787",
        "REMEMBRA_USER_ID": "default"
      }
    }
  }
}

Claude Code:

claude mcp add remembra -e REMEMBRA_URL=http://localhost:8787 -- remembra-mcp

Cursor — add to .cursor/mcp.json:

{
  "mcpServers": {
    "remembra": {
      "command": "remembra-mcp",
      "env": {
        "REMEMBRA_URL": "http://localhost:8787"
      }
    }
  }
}

Now ask Claude: "Remember that Alice is CEO of Acme Corp" — then later: "Who runs Acme?"

Python SDK

pip install remembra

from remembra import Memory

memory = Memory(user_id="user_123")
memory.store("Had a meeting with Sarah from Acme Corp. She prefers email over Slack.")
result = memory.recall("How should I contact Sarah?")
print(result.context)  # "Sarah from Acme Corp prefers email over Slack."

TypeScript SDK

npm install remembra

import { Remembra } from 'remembra';

const memory = new Remembra({ url: 'http://localhost:8787' });
await memory.store('User prefers dark mode');
const result = await memory.recall('preferences');

🔥 Why Remembra?

Feature Comparison

Feature	Remembra	Mem0	Zep/Graphiti	Letta	Engram
One-Command Install	✅ `curl \| bash`	✅ pip	✅ pip	⚠️ Complex	✅ brew
Bi-Temporal Relationships	✅ Point-in-time	❌	⚠️ Basic	❌	❌
Entity Resolution	✅ Free	💰 $249/mo	✅	❌	❌
Conflict Detection	✅ Auto-supersede	❌	❌	❌	❌
PII Detection	✅ Built-in	❌	❌	❌	❌
Hybrid Search	✅ BM25+Vector	❌	✅	❌	❌
6 Embedding Providers	✅ Hot-swap	❌ (1-2)	❌ (1)	❌	❌
Plugin System	✅	❌	❌	✅	❌
Sleep-Time Compute	✅	❌	❌	✅	❌
Self-Host + Billing	✅ Stripe	❌	❌	❌	❌
Memory Spaces	✅ Multi-tenant	❌	❌	❌	❌
MCP Server	✅ 11 Tools	✅	❌	❌	✅
Pricing	Free / $49 / $199	$19 → $249	$25+	Free	Free
License	MIT	Apache 2.0	Apache 2.0	Apache 2.0	MIT

Core Features

🧠 Smart Extraction — LLM-powered fact extraction from raw text

👥 Entity Resolution — "Adam", "Mr. Smith", "my husband" → same person

⏱️ Temporal Memory — TTL, decay curves, historical queries

🔍 Hybrid Search — Semantic + keyword for accurate recall

🔒 Security — PII detection, anomaly monitoring, audit logs

📊 Dashboard — Visual memory browser, entity graphs, analytics

📊 Benchmark Results

Tested on the LoCoMo benchmark (Snap Research, ACL 2024) — the standard academic benchmark for AI memory systems.

Category	Accuracy	Questions
Single-hop (direct recall)	100%	37
Multi-hop (cross-session reasoning)	100%	32
Temporal (time-based queries)	100%	13
Open-domain (world knowledge + memory)	100%	70
Overall (memory categories)	100%	152

Scored with LLM judge (GPT-4o-mini). Adversarial detection not yet implemented. Run your own: python benchmarks/locomo_runner.py --data /tmp/locomo/data/locomo10.json

📖 Documentation

Resource	Description
Quick Start	Get running in minutes
Python SDK	Full Python reference
TypeScript SDK	JavaScript/TypeScript guide
MCP Server	Tool reference + setup guides for 11 tools
REST API	API reference
Self-Hosting	Docker deployment guide

🛠️ MCP Server

pip install remembra[mcp]
claude mcp add remembra -e REMEMBRA_URL=http://localhost:8787 -- remembra-mcp

Available Tools (11 total):

Tool	Description
`store_memory`	Save facts, decisions, context
`recall_memories`	Semantic search across memories
`update_memory`	Update content without delete+recreate
`forget_memories`	GDPR-compliant deletion
`list_memories`	Browse stored memories
`search_entities`	Search the entity graph
`share_memory`	Cross-agent memory sharing via Spaces
`timeline`	Temporal browsing by entity and date
`relationships_at`	Point-in-time relationship queries
`ingest_conversation`	Auto-extract from chat history
`health_check`	Verify connection

🏗️ Architecture

┌─────────────────────────────────────────────────────────────┐
│                    Your Application                          │
├──────────┬──────────────┬───────────────────────────────────┤
│ Python   │ TypeScript   │ MCP Server (Claude/Cursor)        │
│ SDK      │ SDK          │ remembra-mcp                      │
├──────────┴──────────────┴───────────────────────────────────┤
│                   Remembra REST API                          │
├──────────────┬──────────────┬───────────────┬───────────────┤
│  Extraction  │   Entities   │   Retrieval   │   Security    │
│  (LLM)       │  (Graph)     │ (Hybrid)      │  (PII/Audit)  │
├──────────────┴──────────────┴───────────────┴───────────────┤
│                    Storage Layer                             │
│         Qdrant (vectors) + SQLite (metadata/graph)          │
└─────────────────────────────────────────────────────────────┘

🤝 Contributing

We welcome contributions! See CONTRIBUTING.md for guidelines.

# Clone
git clone https://github.com/remembra-ai/remembra
cd remembra

# Install dev dependencies
pip install -e ".[dev]"

# Run tests
pytest

# Start dev server
remembra-server --reload

📄 License

MIT License — Use it however you want.

⭐ Star History

If Remembra helps you, please star the repo! It helps others discover the project.

Built with ❤️ by DolphyTech
remembra.dev • docs • twitter • discord

Remembra — AI Memory Layer

Tools

Remembra

🚀 What's New in v0.13.0

Dashboard v2.0

Core API

Supported Agents (6+)

Previous (v0.12.x)

The Problem

The Solution

⚡ Quick Start (2 Minutes)

One Command Install

Connect ALL Your AI Agents (NEW in v0.10.0)

Python SDK

TypeScript SDK

🔥 Why Remembra?

Feature Comparison

Core Features

📊 Benchmark Results

📖 Documentation

🛠️ MCP Server

🏗️ Architecture

🤝 Contributing

📄 License

⭐ Star History

Remembra — AI Memory Layer

Tools

Remembra

🚀 What's New in v0.13.0

Dashboard v2.0

Core API

Supported Agents (6+)

Previous (v0.12.x)

The Problem

The Solution

⚡ Quick Start (2 Minutes)

One Command Install

Connect ALL Your AI Agents (NEW in v0.10.0)

Python SDK

TypeScript SDK

🔥 Why Remembra?

Feature Comparison

Core Features

📊 Benchmark Results

📖 Documentation

🛠️ MCP Server

🏗️ Architecture

🤝 Contributing

📄 License

⭐ Star History

Related Documents & Knowledge MCP Servers

Related Documents & Knowledge MCP Servers