CAT
/MCP
SkillsMCPMarketplacesDigestToolsAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Sales & MarketingWeb & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web Crawling
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Cross AI Tools

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Tools
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic

Vidlens Mcp

thatsrajan/vidlens-mcp
25authSTDIOregistry active
Summary

Turns YouTube into a queryable research database with 41 tools covering search, transcript analysis, visual search via frame extraction, and playlist indexing. Works immediately without API keys through a three-tier fallback system (YouTube API, yt-dlp, page scraping). The explore pipeline handles full research workflows in one prompt: searches videos, ranks by relevance, reads transcripts, extracts benchmark data, and generates comparison charts. Semantic search uses Gemini embeddings to query across entire playlists by meaning. Visual search pulls keyframes, runs OCR on slides, and lets you search by what appears on screen rather than just spoken content. Includes sentiment analysis, content gap detection, and hook pattern analysis. Zero-config setup wizard auto-detects Claude Desktop and Claude Code.

CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

VidLens — video as a queryable local asset for AI agents

npm License MCP 46 tools Zero Config

Watch the 60s demo

Most tools can read what was said in a video. VidLens can see what was shown.


🔍 What is VidLens?

Stop watching 10 videos to answer one question. VidLens searches YouTube, reads the transcripts, and synthesizes what creators actually said — across multiple videos, with timestamps, benchmark charts, and sources.

VidLens is a Model Context Protocol server that gives AI agents deep, reliable access to YouTube. Not just transcripts — full intelligence: search, analysis, visual search, and auto-generated comparison charts.

It is also growing into a universal video asset layer: direct video URLs from X/Twitter, Instagram, TikTok, generic yt-dlp-supported pages, and local video files can be imported into the same local media store for frame extraction, Apple Vision OCR/similarity, and visual search. Claude Desktop, Claude Code, Codex CLI, and the Codex desktop plugin all use the same MCP server.

No API key required to start. Every tool has a three-tier fallback chain (YouTube API → yt-dlp → page extraction) so nothing breaks when quota runs out or keys aren't configured.

One prompt → full research pipeline with benchmark comparison

Try it — paste any of these into Claude:

"I'm thinking about buying the M5 Max MacBook Pro. Search YouTube for top tech reviewers and tell me what they're saying. Is it worth the upgrade from M3/M4?"

VidLens finds 10+ reviews, reads the transcripts, extracts benchmark scores, and presents comparison charts — all from one prompt.

"I want to understand how AI agents work. Search YouTube for the best videos for a beginner and summarize what I need to know."

Discovers videos across creators, ranks by learning value, and prepares transcripts for follow-up questions.

"Search YouTube for reviews comparing the iPhone 17 Pro vs Samsung S26 Ultra. What do reviewers agree on? Where do they disagree?"

Searches, reads transcripts from multiple reviewers, and synthesizes consensus vs disagreements with sources.


🎯 Core Capabilities

🔍 Explore — One Prompt, Full Pipeline

Ask a question about YouTube and VidLens does the rest: searches, ranks by creator match and freshness, reads transcripts, extracts benchmark data, and presents comparison charts automatically. Works for product research, learning, competitive analysis — anything on YouTube.

🔎 Semantic Search Across Playlists

Import entire playlists or video sets, index every transcript with Gemini embeddings, and search across hundreds of hours of content by meaning — not just keywords.

👁️ Visual Search — See What's In Videos

Extract keyframes, describe them with Gemini Vision, run OCR on slides and whiteboards, and search by what you see — not just what's said.

Visual search — find benchmark charts inside videos by searching

📊 Intelligence Layer — Not Just Data

Sentiment analysis, niche trend discovery, content gap detection, hook pattern analysis, upload timing recommendations. The LLM does the thinking — VidLens gives it the right data.

Video intelligence — stats, structure analysis, and comment sentiment

⚡ Zero Config, Always Works

No API key needed to start. Three-tier fallback chain on every tool. Nothing breaks when quota runs out. Keys are optional power-ups.

🎬 Full Media Pipeline

Download videos/audio/thumbnails. Extract keyframes. Index comments for semantic search. Build a local knowledge base from any YouTube content.


⚡ Why VidLens?

VidLensOther YouTube MCP servers
🔑 Setup✅ Works immediately - no keys needed❌ Most require YouTube API key upfront
🛡️ Reliability✅ Three-tier fallback on every tool❌ Single point of failure - API down = broken
🧠 Intelligence✅ Sentiment, trends, content gaps, hooks❌ Raw data dumps - you do the analysis
📦 Token efficiency✅ 75-87% smaller responses❌ Verbose JSON with thumbnails, etags, junk
🔬 Depth✅ 46 tools across 11 modules⚠️ 1-5 tools, mostly transcripts only
🖼️ Visual evidence✅ Returns actual frame paths + timestamps, not just text hits⚠️ Usually transcript-only or raw frame dumps
⚖️ Trademark✅ Compliant naming⚠️ Most violate YouTube trademark

🚀 Quick Start

1. Install

npx vidlens-mcp setup

This auto-detects your MCP clients (Claude Desktop, Claude Code, Codex when present), downloads yt-dlp if needed, and walks you through optional API keys, speech-to-text, web search, and cookies. No manual config editing required. For Claude Code, setup registers VidLens in the user MCP registry and verifies the result with claude mcp list; when API keys or cookie settings are present, it writes the registry file directly so secrets are not passed through command arguments.

If you install globally with npm install -g vidlens-mcp, npm prints the next command to run. The install step itself does not collect secrets; vidlens-mcp setup is the interactive configuration wizard that writes the MCP env blocks for your clients.

From a local checkout, npm install does not put this package's own binary on your shell PATH. Use npm run setup from the checkout, or run npm install -g . / npm link if you want the bare vidlens-mcp command while developing.

2. Or configure manually

Claude Desktop — add to ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "vidlens-mcp": {
      "command": "npx",
      "args": ["-y", "vidlens-mcp", "serve"]
    }
  }
}

Claude Code — prefer the setup wizard. It registers VidLens in Claude Code's user MCP registry and checks that Claude Code can see it:

npx vidlens-mcp setup --client claude_code
claude mcp list

If you must configure it manually, add the same mcpServers.vidlens-mcp entry to the Claude Code user registry file at ~/.claude.json.

3. Restart your MCP client

Fully quit and reopen Claude Desktop (⌘Q). For Claude Code, start a new session or run /mcp again after setup.

4. Try it

Start with "Search YouTube" to activate VidLens:

"Search YouTube for the top M5 Max MacBook Pro reviews and tell me if it's worth upgrading from M4."

"Search YouTube for the best videos about agentic AI for a beginner."

"Import this playlist and search across all videos for mentions of machine learning."

"Search this video's frames for the benchmark comparison chart."

"What's trending in the AI coding niche right now?"


🧰 Tools - 46 across 11 modules

🔍 Explore - YouTube Discovery & Research

The front door — one prompt, full pipeline

ToolWhat it does
exploreYouTubeIntent-aware search with multi-query ranking, parallel enrichment, transcript summaries, structured benchmark data, and background indexing. One call replaces 5-8 individual tool calls.

📺 Core - Video & Channel Intelligence

Always available, no API key needed

ToolWhat it does
findVideosSearch YouTube by query with metadata
inspectVideoDeep metadata - tags, engagement, language, category
inspectChannelChannel stats, description, recent uploads
listChannelCatalogBrowse a channel's full video library
readTranscriptFull transcript with timestamps and chapters
readCommentsTop comments with likes and engagement
expandPlaylistList all videos in any playlist

🔎 Knowledge Base - Semantic Search

Index transcripts and search across them with natural language

ToolWhat it does
importPlaylistIndex an entire playlist's transcripts
importVideosIndex specific videos by URL/ID
searchTranscriptsNatural language search across indexed content
listCollectionsBrowse your indexed collections
setActiveCollectionScope searches to one collection
clearActiveCollectionSearch across all collections
removeCollectionDelete a collection and its index

💬 Sentiment & Analysis

Understand what audiences think and feel

ToolWhat it does
measureAudienceSentimentComment sentiment with themes and risk signals
analyzeVideoSetCompare performance across multiple videos
analyzePlaylistPlaylist-level engagement analytics
buildVideoDossierComplete single-video deep analysis

🎯 Creator Intelligence

Insights for content strategy

ToolWhat it does
scoreHookPatternsAnalyze what makes video openings work
researchTagsAndTitlesTag and title optimization insights
compareShortsVsLongShort-form vs long-form performance
recommendUploadWindowsBest times to publish for engagement

📈 Discovery & Trends

Find what's working in any niche

ToolWhat it does
discoverNicheTrendsMomentum, saturation, content gaps in any topic
exploreNicheCompetitorsChannel landscape and top performers

🌐 Universal Video Sources

Resolve, search, and import video sources beyond YouTube

ToolWhat it does
inspectVideoSourceResolve YouTube, X/Twitter, Instagram, TikTok, generic URLs, and local files into source metadata and capability flags
searchVideoSourcesSearch native YouTube and local assets, with ScrapeCreators support for TikTok/Instagram plus configurable Brave/SerpAPI/DuckDuckGo fallback
searchSocialTrendsSearch social platforms through ScrapeCreators and return a playlist-like ranked list with engagement metrics and importable URLs where available
importVideoSourcesImport URLs or local files into the local media store, optionally building a visual index or transcript
transcribeVideoSourceTranscribe YouTube, social/generic URLs, and local files into the transcript knowledge base via native captions or configured STT

🎬 Media Assets

Download and manage video files locally

ToolWhat it does
downloadAssetDownload or ingest video, audio, or thumbnails from YouTube/social URLs/generic URLs/local files
listMediaAssetsBrowse stored media files
removeMediaAssetClean up downloaded assets
extractKeyframesExtract key frames from videos
mediaStoreHealthStorage usage and diagnostics

🖼️ Visual Search

Three-layer visual intelligence. Not transcript reuse.

ToolWhat it does
indexVisualContentExtract frames, run Apple Vision OCR + feature prints, Gemini frame descriptions, and Gemini semantic embeddings
searchVisualContentSearch visual frames using semantic embeddings + lexical matching. Returns actual image paths + timestamps as evidence
findSimilarFramesImage-to-image frame similarity using Apple Vision feature prints

Three layers, all real:

  1. Apple Vision feature prints — image-to-image similarity (find frames that look alike)
  2. Gemini 2.5 Flash frame descriptions — natural language scene understanding per frame
  3. Gemini semantic embeddings — 768-dim embedding retrieval over OCR + description text for true text→visual search

What you always get back: frame path on disk, timestamp, source video URL/title, match explanation, OCR text, visual description.

What is NOT happening: no transcript embeddings are reused for visual search. This is a separate visual index.

💭 Comment Knowledge Base

Index and semantically search YouTube comments

ToolWhat it does
importCommentsIndex a video's comments for search
searchCommentsNatural language search over comment corpus
listCommentCollectionsBrowse comment collections
setActiveCommentCollectionScope comment searches
clearActiveCommentCollectionSearch all comment collections
removeCommentCollectionDelete a comment collection

🏥 Diagnostics

Health checks and pre-flight validation

ToolWhat it does
checkSystemHealthFull system diagnostic report
checkImportReadinessValidate before importing content

🔑 API Keys (Optional)

VidLens works without any API keys. Add them to unlock more capabilities:

KeyWhat it unlocksFree?How to get it
YOUTUBE_API_KEYBetter metadata, comment API, search via YouTube API✅ Free tier (10,000 units/day)Google Cloud Console → APIs → Enable YouTube Data API v3 → Credentials → Create API Key
GEMINI_API_KEYHigher-quality embeddings for semantic search (768d vs 384d)✅ Free tierGoogle AI Studio → Get API Key
OPENAI_API_KEYOptional STT provider for transcribeVideoSourcePaid/free trial variesOpenAI Platform
SCRAPECREATORS_API_KEYDirect social search/trending for TikTok, Instagram, Threads, Pinterest, Reddit, and supported endpoints100 free credits, paid afterScrapeCreators
BRAVE_API_KEY / SERPAPI_KEYOptional structured web search for social/generic URL discoveryVariesBrave Search API or SerpAPI

⚠️ These are separate keys from separate Google services. A Gemini key will NOT work for YouTube API calls and vice versa. Create them independently.

# Configure via setup wizard. It prompts for YouTube, Gemini, OpenAI,
# ScrapeCreators, Brave/SerpAPI, STT, browser cookies, and platform cookies.
npx vidlens-mcp setup

# Or provide everything non-interactively.
npx vidlens-mcp setup \
  --youtube-api-key YOUR_YOUTUBE_KEY \
  --gemini-api-key YOUR_GEMINI_KEY \
  --openai-api-key YOUR_OPENAI_KEY \
  --scrapecreators-api-key YOUR_SCRAPECREATORS_KEY \
  --brave-api-key YOUR_BRAVE_KEY \
  --stt-provider auto \
  --cookies-from-browser chrome

# Or via environment variables
export YOUTUBE_API_KEY=your_youtube_key
export GEMINI_API_KEY=your_gemini_key
export OPENAI_API_KEY=your_openai_key
export SCRAPECREATORS_API_KEY=your_scrapecreators_key
export BRAVE_API_KEY=your_brave_key

Cookies, STT, and Codex

For platforms that rate-limit anonymous access, the setup wizard can persist cookies by browser profile or file path into Claude/Codex config:

npx vidlens-mcp setup --cookies-from-browser chrome --cookies-profile Default
npx vidlens-mcp setup --x-cookies-file /path/to/x-cookies.txt

Recommended wizard answers for most users:

PromptRecommended answerWhy
STT providerPress Enter for autoSetup checks local whisper.cpp, then Gemini, then OpenAI after you answer
Default STT language hinten if most videos are English; otherwise press EnterHelps STT quality without locking you in
whisper.cpp model pathPress Enter unless you already have a local model fileGemini/OpenAI fallback is simpler
Web search providerPress Enter for autoSetup checks Brave/SerpAPI keys, then DuckDuckGo-lite
Use browser cookiesYour logged-in browser, e.g. chromeLets yt-dlp read social-video cookies during import
Browser profile namePress Enter unless you use a named profileMost users do not need this
Platform-specific cookie filesn unless you exported Netscape cookie filesBrowser cookies are easier

You can also configure them directly in your shell:

export VIDLENS_X_COOKIES_FILE=/path/to/x-cookies.txt
export VIDLENS_INSTAGRAM_COOKIES_FILE=/path/to/instagram-cookies.txt
export VIDLENS_TIKTOK_COOKIES_FILE=/path/to/tiktok-cookies.txt
export VIDLENS_COOKIES_FROM_BROWSER=chrome

STT selection is automatic: local whisper.cpp first, then Gemini, then OpenAI. Override with VIDLENS_STT_PROVIDER=whisper-cpp|gemini|openai|none|auto.

Codex setup:

vidlens-mcp setup --client codex --print-only
vidlens-mcp doctor --no-live
vidlens-mcp update-deps

💻 CLI

npx vidlens-mcp               # Start MCP server (stdio)
npx vidlens-mcp serve         # Start MCP server (explicit)
npx vidlens-mcp setup         # Auto-configure Claude Desktop, Claude Code, Codex, keys, STT, and cookies
npx vidlens-mcp doctor        # Run diagnostics
npx vidlens-mcp update-deps   # Refresh managed yt-dlp and Deno helpers
npx vidlens-mcp version       # Print version
npx vidlens-mcp help          # Usage guide

Doctor - diagnose issues

npx vidlens-mcp doctor --no-live

Checks: Node.js version, yt-dlp freshness, JS runtime, STT and web-search providers, platform readiness, API key validation, data directory health, MCP client registration (Claude Desktop, Claude Code, Codex), and whether claude mcp list can see the Claude Code registration.


📱 Works Everywhere — Desktop, Cowork, Phone

VidLens works across the full Claude ecosystem. Set it up once, use it everywhere.

Claude Desktop — Chat

The classic experience. Ask a question, get charts and analysis inline. Best for interactive research sessions.

Claude Desktop — Cowork Projects (March 2026)

Create a persistent research project with VidLens connected. Claude remembers context across sessions — last week's competitive research informs this week's analysis. Set up scheduled tasks that run automatically:

"Every Monday, search YouTube for new AI agent framework videos and compare to last week's findings."

Claude Dispatch — From Your Phone (March 2026)

Trigger any VidLens research from the Claude mobile app. Ask from your phone, Claude Desktop runs the tools locally, results come back to your pocket:

"Run my competitive research project — what new M5 Max content dropped this weekend?"

Claude Code — Remote Control

Start a Claude Code session with claude --remote-control, then continue from any browser or your phone at claude.ai/code. Full tool access, full context.

Note: Your Mac must be awake with Claude Desktop open for Cowork, Dispatch, and scheduled tasks to execute.


🏗️ Architecture

System Overview

VidLens System Overview

How the Fallback Chain Works

Every tool that touches YouTube data uses the same resilience pattern:

VidLens Fallback Chain

Every response includes a provenance field telling you exactly which tier served the data and whether anything was partial. No silent degradation — you always know what happened.

Visual Search Pipeline

Visual search is not transcript reuse. It's a dedicated three-layer index:

VidLens Visual Search Pipeline

Three layers, all real:

  1. Apple Vision feature prints — image-to-image similarity (find frames that look alike)
  2. Gemini Vision frame descriptions — natural language scene understanding per frame
  3. Gemini semantic embeddings — 768-dim retrieval over OCR + description text

Data Storage

Everything lives in a single directory. No external databases, no Docker, no infrastructure.

VidLens Data Storage

One directory. Portable. Back it up by copying. Delete it to start fresh.


📋 Requirements

RequirementStatusNotes
Node.js ≥ 22RequiredUses node:sqlite — node --version to check
yt-dlpAuto-installedDownloaded automatically during npx vidlens-mcp setup
ffmpeg + ffprobeRecommended for universal videoNeeded for Instagram/TikTok/X reels, local video files, STT audio chunking, frame extraction, and visual indexing. Setup/doctor detects it and suggests brew install ffmpeg on macOS
YouTube API keyOptionalUnlocks comments, better metadata
Gemini API keyOptionalUpgrades transcript embeddings and frame descriptions for visual search
macOS Apple VisionAutomatic on macOSPowers native OCR and image similarity for visual search

🔧 Troubleshooting

"Tool not found" in Claude Desktop

Fully quit Claude Desktop (⌘Q, not just close window) and reopen. MCP servers only load on startup.

"YOUTUBE_API_KEY not configured" warning

This is informational, not an error. VidLens works without it. Add a key only if you need comments/sentiment features.

"API_KEY_SERVICE_BLOCKED" error

Your API key has restrictions. Create a new unrestricted key in Google Cloud Console, or remove the API restriction from the existing key.

Gemini key doesn't work for YouTube API

These are separate services. You need a YouTube API key from Google Cloud Console AND a Gemini key from Google AI Studio. They are not interchangeable.

Build errors

npx vidlens-mcp doctor     # Run diagnostics
npx vidlens-mcp doctor --no-live  # Skip network checks

Instagram/TikTok/X reel downloads but visual analysis fails

Install ffmpeg/ffprobe, then rerun setup or doctor:

brew install ffmpeg
vidlens-mcp setup
vidlens-mcp doctor --no-live

downloadAsset can often fetch the video without ffmpeg, but indexVisualContent, extractKeyframes, local-file ingestion, and STT chunking need ffmpeg/ffprobe.


📄 License

MIT


GitHub · npm · Model Context Protocol

Featured
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

Configuration

YOUTUBE_API_KEYsecret

YouTube Data API v3 key (optional — works without it via fallback scraping)

GEMINI_API_KEYsecret

Google Gemini API key for AI-powered visual descriptions (optional)

Categories
DatabasesMedia & Entertainment
Registryactive
Packagevidlens-mcp
TransportSTDIO
AuthRequired
UpdatedApr 8, 2026
View on GitHub

Related Databases MCP Servers

View all →
Postgres

ai.waystation/postgres

Connect to your PostgreSQL database to query data and schemas.
54
Read Only Local Postgres Mcp Server

hovecapital/read-only-local-postgres-mcp-server

MCP server for read-only PostgreSQL database queries in Claude Desktop
2
Database Mcp

cocaxcode/database-mcp

MCP server for database connectivity. Multi-DB (PostgreSQL, MySQL, SQLite), 19 tools.
1
Mcp Mysql

io.github.infoinlet-marketplace/mcp-mysql

Read-only MySQL/MariaDB for AI agents — query, list/describe tables, health. SQL-guarded.
Database Admin

io.github.cybeleri/database-admin

Database admin MCP: schema inspection, query optimization for PostgreSQL and MySQL
Postgres Secured (Aegis Zero-Trust)

io.github.yash-0620/postgres-mcp-secured

Enterprise PostgreSQL MCP secured by Aegis Zero-Trust to block unauthorized SQL injections.