This server wraps Apify scrapers to search Bluesky posts, Substack newsletters, and Hacker News stories from a single interface. Each endpoint returns structured JSON with engagement metrics like points, comments, and post counts. It supports both traditional API key auth and x402 micropayments at $0.05 per call on Base, so you can pay per request without setting up an account. Useful when you're building agents that need to pull discussions or articles from multiple platforms without managing separate API clients. The server exposes standard discovery endpoints for MCP, A2A agent cards, and OpenAPI specs.
Web scraping APIs for Bluesky, Substack, and Hacker News with x402 micropayment support. Built with FastAPI.
Live: https://frog03-20494.wykr.es
| Method | Endpoint | Description | Price |
|---|---|---|---|
| POST | /api/bluesky/search | Search Bluesky posts by keyword | $0.05 |
| POST | /api/substack/search | Scrape Substack newsletter articles | $0.05 |
| POST | /api/hn/search | Search Hacker News stories | $0.05 |
# Clone
git clone https://github.com/MarcinDudekDev/the-data-collector.git
cd the-data-collector
# Install
python3 -m venv venv && source venv/bin/activate
pip install -r requirements.txt
# Configure
cp .env.example .env
# Edit .env with your APIFY_TOKEN and API_KEY
# Run
uvicorn server:app --host 0.0.0.0 --port 8001
docker build -t the-data-collector .
docker run -p 8001:8001 --env-file .env the-data-collector
| Variable | Required | Description |
|---|---|---|
APIFY_TOKEN | Yes | Apify API token for running scrapers |
API_KEY | No | API key for authenticated access (X-API-Key header) |
BASE_URL | No | Public URL of the server (default: https://frog03-20494.wykr.es) |
PAY_TO | No | Wallet address for x402 payments |
PRICE_ATOMIC | No | Price per call in USDC atomic units (default: 50000 = $0.05) |
Send a POST request without credentials. You'll receive a 402 response with payment requirements. Pay $0.05 USDC on Base — settlement is instant.
# First call returns 402 with payment details
curl -X POST https://frog03-20494.wykr.es/api/hn/search \
-H "Content-Type: application/json" \
-d '{"searchTerms": ["AI agents"]}'
curl -X POST https://frog03-20494.wykr.es/api/hn/search \
-H "Content-Type: application/json" \
-H "X-API-Key: your-key" \
-d '{"searchTerms": ["AI agents"], "maxResults": 5}'
| Endpoint | Protocol |
|---|---|
/.well-known/mcp.json | MCP (Model Context Protocol) |
/.well-known/agent-card.json | A2A (Agent-to-Agent) |
/.well-known/x402 | x402 payment discovery |
/.well-known/openapi.json | OpenAPI 3.1 spec |
/health | Health check |
{
"mcpServers": {
"the-data-collector": {
"url": "https://frog03-20494.wykr.es/.well-known/mcp.json"
}
}
}
MIT
com.mcparmory/google-search
io.github.pipeworx-io/brave-search
marcopesani/mcp-server-serper
brave/brave-search-mcp-server
com.mcparmory/google-search-console
acamolese/google-search-console-mcp