OpenClaw Skill

captcha-relay

Human-in-the-loop CAPTCHA solving with two modes: screenshot (default, zero infrastructure) and token relay (requires network access). Screenshot mode captures the page with a grid overlay, sends it to the human, and injects clicks based on their reply. Token relay mode detects CAPTCHA type + sitekey, serves the real widget on a relay page for native solving, and injects the token via CDP.

Install

$npx clawhub@latest install captcha-relay
All-time installs4
Active installs4
Stars0

CAPTCHA Relay v2

Solve CAPTCHAs by relaying them to a human. Two modes available.

Modes

Screenshot Mode (default) — No infrastructure needed

Grid overlay screenshot → send image to human via Telegram → human replies with cell numbers → inject clicks.

  • Zero setup beyond the skill itself. No Tailscale, no tunnels, no relay server.
  • Works for any CAPTCHA type (reCAPTCHA, hCaptcha, sliders, text, etc.)
  • Uses sharp for image processing + CDP for screenshots and click injection.
bash
node index.js                       # screenshot mode (default)
node index.js --mode screenshot     # explicit
node index.js --screenshot          # legacy alias
js
const { solveCaptchaScreenshot } = require('./index');
const capture = await solveCaptchaScreenshot({ cdpPort: 18800 });
// capture.imagePath — annotated screenshot to send to human
// capture.prompt — text prompt for the human

Token Relay Mode — Requires network access

Detects CAPTCHA type + sitekey → serves real widget on relay page → human solves natively → token injected via CDP.

  • Requires Tailscale or a tunnel (localtunnel/cloudflared) so the human's device can reach the relay server.
  • Produces a proper CAPTCHA token — more reliable for reCAPTCHA v2, hCaptcha, Turnstile.
  • Best when you have Tailscale already set up.
bash
node index.js --mode relay              # with localtunnel
node index.js --mode relay --no-tunnel  # with Tailscale/LAN
js
const { solveCaptcha } = require('./index');
const result = await solveCaptcha({ cdpPort: 18800, useTunnel: false });
// result.relayUrl — URL to send to human
// result.token — solved CAPTCHA token

When to Use Each

ScenarioMode
Quick & easy, no setupscreenshot
Any CAPTCHA type (sliders, text, etc.)screenshot
Known CAPTCHA with sitekey (reCAPTCHA, hCaptcha, Turnstile)relay
Tailscale already configuredrelay
No network access to hostscreenshot

CLI Flags

FlagDefaultDescription
--mode screenshot|relayscreenshotSelect solving mode
--screenshotAlias for --mode screenshot
--no-injectinjectReturn token without injecting into browser
--no-tunneltunnelSkip tunnel, use local/Tailscale IP (relay mode)
--timeout N120Timeout in seconds
--cdp-port N18800Chrome DevTools Protocol port

Agent Workflow

Screenshot mode (simplest)

  1. Call solveCaptchaScreenshot({ cdpPort })
  2. Send capture.imagePath to human via message tool with capture.prompt
  3. Human replies with cell numbers (e.g. "1,3,5,7")
  4. Call injectGridClicks(cdpPort, capture, selectedCells) to click those cells

Relay mode

  1. Call solveCaptcha({ useTunnel: false }) (Tailscale) or solveCaptcha() (tunnel)
  2. Send result.relayUrl to human via message tool
  3. Wait — resolves when human completes the CAPTCHA
  4. Token is auto-injected; continue automation

Requirements

  • Chrome/Chromium with --remote-debugging-port=18800
  • Node.js 18+ and npm install (deps: ws, sharp)
  • Relay mode only: Tailscale or internet for tunnel

Persistent memory

Give your OpenClaw agent a memory layer

Mem0 remembers users and context across sessions so you send fewer tokens and get better answers.

Try Mem0Mem0 + OpenClaw guide