CAT
/MCP
SkillsMCPMarketplacesDigestToolsAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Sales & MarketingWeb & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web Crawling
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Cross AI Tools

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Tools
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic

AWT (AI Watch Tester)

ksgisang/awt-skill
authSTDIOregistry active
Summary

Connects Claude to a Playwright-based testing engine that can navigate pages, find elements via OCR and image matching, and execute declarative YAML test scenarios. Exposes operations for generating test specs from natural language, running browser automation with humanized mouse movements, taking screenshots, and analyzing failures. The DevQA Loop feature auto-fixes failing tests by reading error logs and source code, then re-running scenarios until they pass. Stores successful element matches in SQLite for faster subsequent runs. Useful when you want Claude to write end-to-end tests that actually click through your UI and self-heal when things break, rather than just generating test code you have to run separately.

CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

AWT — Eyes and Hands for Your AI Coding Tool

Your AI coding tool is smart. But it can't see or click a web page. AWT gives it a real browser — so it can test, find bugs, and fix them without you lifting a finger.

License: AGPL-3.0 Agent Skills

Works without AI API — write YAML scenarios manually and run aat run. No API key needed. AI coding tools make it easier, but are not required.


What is AWT?

AWT is the execution engine for AI-powered E2E testing. Your AI coding tool designs the tests; AWT runs them in a real browser with Playwright.

How it works in Skill Mode (no extra AI API key needed):

You: "Test the login flow on https://mysite.com"

Your AI coding tool:
  → Writes 5 YAML scenarios (30 steps)
  → Runs: aat run scenarios/
  → Reads failure: "Step 4: Dashboard text not visible"
  → Reads screenshot + source code
  → Fixes src/pages/login.js:23
  → Re-runs: aat run scenarios/
  → All 5 scenarios pass ✓

Key capabilities:

  • Execute tests in a real browser with human-like mouse/keyboard interaction
  • See — take screenshots, detect elements via OCR and image matching
  • Report — step-by-step pass/fail with error details and screenshots
  • Self-heal (CLI mode) — aat loop auto-fixes failures with its own AI
  • Learn — successful matches stored in SQLite, getting faster over time

Installation

One-line install (recommended)

npx skills add ksgisang/awt-skill --skill awt -g

Manual Installation

git clone https://github.com/ksgisang/awt-skill.git /tmp/awt-skill
cp -r /tmp/awt-skill/awt ~/.claude/skills/awt
rm -rf /tmp/awt-skill

Per-Project Installation

git clone https://github.com/ksgisang/awt-skill.git /tmp/awt-skill
cp -r /tmp/awt-skill/awt .claude/skills/awt
rm -rf /tmp/awt-skill

After installation, the skill auto-triggers on testing-related prompts.

Updating

# Re-run the same install command to get the latest version
npx skills add ksgisang/awt-skill --skill awt -g -y

For manual installations: cd ~/.claude/skills/awt && git pull

Comparison with Other Testing Skills

Featurewebapp-testingplaywright-skillAWT
Scenario formatPython scriptsPython scriptsDeclarative YAML
No-code test creationNoNoYes
Natural language → testNoNoYes
Self-healing loopNoNoYes (DevQA Loop)
Auto-fix on failureNoNoYes (AI → PR)
Visual matching (OpenCV + OCR)NoNoYes (OpenCV + OCR)
Pattern learningNoNoYes (SQLite)
Server lifecycle mgmtYesYesYes (aat dashboard)
Multiple AI providersNoNoYes (5 providers)
Human-like interactionNoNoYes (Bezier mouse)
Approval modesNoNoYes (manual/branch/auto)
Cost optimizationN/AN/A$0.02–0.05/test
Live screenshotsNoScreenshot onlyYes (WebSocket stream)
Multi-language OCRNoNoYes (10+ languages)
Offline supportNoNoYes (Ollama)
Cost trackingNoNoYes (per-call logging + aat cost)
Source code analysisNoNoYes (Skill-exclusive)
Scenario cachingNoNoYes (same spec = no re-call)
Browser test overlayNoNoYes (live step progress)
Dependency orderingNoNoYes (depends_on field)
Skill Mode (no extra AI cost)NoNoYes
Canvas/Flutter OCR fallbackNoNoYes (auto)
Platform auto-detectionNoNoYes (7 frameworks)
Structured failure diagnosisNoNoYes (AI-independent)
Multi-document YAMLNoNoYes (--- separator)
Strict validationNoNoYes (aat validate --strict)

When to use each:

  • webapp-testing — Quick one-off Playwright scripts for simple page checks
  • playwright-skill — When you need fine-grained Playwright API control
  • AWT Skill Mode — Your AI coding tool designs tests, AWT executes them. No extra AI API key needed.
  • AWT CLI Mode — Standalone automated testing with aat generate + aat loop for CI/CD

Supported AI Coding Tools

AWT follows the Agent Skills open standard and works with:

ToolStatus
Claude CodeSupported
CursorCompatible
CodexCompatible
Gemini CLICompatible
AmpCompatible
ClineCompatible
AiderCompatible
WindsurfCompatible
Roo CodeCompatible
PearAICompatible
AntigravitySupported

Quick Example

YAML Scenario

id: "SC-001"
name: "User Login"
tags: ["auth", "login"]
depends_on: ["SC-000"]
steps:
  - step: 1
    action: navigate
    value: "{{url}}/login"
    description: "Go to login page"

  - step: 2
    action: find_and_type
    target:
      text: "Email"
      match_method: ocr
    value: "test@example.com"
    humanize: true
    description: "Enter email"

  - step: 3
    action: find_and_click
    target:
      text: "Login"
    description: "Click login button"

  - step: 4
    action: assert
    assert_type: text_visible
    expected:
      - type: text_visible
        value: "Welcome back"
    description: "Verify login success"

Natural Language (in Claude Code)

"Login to mysite.com with test@example.com, then check if the dashboard loads"

AWT converts this to a YAML scenario and executes it automatically.

5-Step DevQA Workflow

Scan → Plan → Review → Execute → Heal
  ↑                                 |
  └─── Loop back on failure ────────┘
  1. Scan — Crawl site, analyze DOM, detect features
  2. Plan — AI generates YAML test scenarios
  3. Review — User reviews/edits before execution
  4. Execute — Playwright runs with live screenshots
  5. Heal — AI analyzes failure → suggests fix → re-tests

AI Providers

ProviderVisionCostOffline
Claude (default)YesMediumNo
OpenAI (GPT-4o)YesHigherNo
Gemini (default: free)YesFree tierYes
DeepSeekNoLowNo
OllamaNoFreeYes

Project Structure

awt-skill/
├── awt/                        # ← Skill content (installed by npx skills)
│   ├── SKILL.md                # Main skill definition
│   ├── references/
│   │   ├── scenario-schema.md  # Full YAML schema reference
│   │   ├── cli-reference.md    # CLI command reference
│   │   └── config-reference.md # Configuration options
│   └── templates/
│       ├── scenario-template.yaml
│       └── config-template.yaml
├── README.md
├── LICENSE
├── CONTRIBUTING.md
└── package.json

Key CLI Commands

CommandDescription
aat doctorCheck environment (Python, Playwright, Tesseract, AI)
aat initInitialize project + AI setup + environment check
aat setupConfigure AI provider and API key
aat generateAI-generate scenarios (with cost estimate + caching)
aat run --learnExecute tests + learn from fixes (always use --learn)
aat loopSelf-healing DevQA loop
aat costView AI API usage costs
aat validate --strictValidate YAML + quality checks
aat learn platform -p <key> -t <tip>Add platform-specific tip

System Dependencies

macOS:

brew install python@3.12 tesseract
pipx install aat-devqa
playwright install chromium

Linux (Ubuntu/Debian):

sudo apt install python3.12 python3.12-venv tesseract-ocr
pipx install aat-devqa
playwright install chromium

Windows:

winget install Python.Python.3.12
choco install tesseract
pip install aat-devqa
playwright install chromium

After install, run aat doctor to verify everything works.

Links

  • Main Repository: github.com/ksgisang/AI-Watch-Tester
  • Cloud Demo: ai-watch-tester.vercel.app
  • Agent Skills Standard: agentskills.io

License

AGPL-3.0 — see LICENSE file for full text.

Built by AILoopLab.

Featured
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

Configuration

ANTHROPIC_API_KEYsecret

Anthropic API key (optional — required only when using provider=claude)

OPENAI_API_KEYsecret

OpenAI API key (optional — required only when using provider=openai)

Registryactive
Packageaat-devqa
TransportSTDIO
AuthRequired
UpdatedMay 28, 2026
View on GitHub