BROWSER AGENT

Your QA team is an AI.

Goal in English. Real Chromium. Screenshot + DOM trace per turn.

$ npm i -g @tangle-network/browser-agent-driver Copied

Get Started Star on GitHub

your-app.com/dashboard

200 · 312ms

turn 2 · click "Sign up"

turn 1 · goto turn 2 · click turn 3 · verify ✓ goal · 4.7s

plain-English goals | vision-enabled, any site | screenshot + DOM evidence per turn | MIT licensed · npm i -g

HOW IT RUNS

CLI for one-offs. SDK for CI.

Same engine, two surfaces. bad run drives an attached Chrome for explore-and-debug; the SDK runs headless in CI with deterministic seeds.

$ bad run --goal "Sign up, verify dashboard loads" \ --url https://your-app.com --model claude-sonnet-4-6 # turn 1 · navigate → goto https://your-app.com · 312ms · 200 OK # turn 2 · plan → click "Sign up" · header CTA → type "[email protected]" · #email → type "••••••••" · #password → click "Create account" # turn 3 · verify → wait_for 'h1:has-text("Dashboard")' · visible in 814ms → screenshot · artifacts/turn-3.png ✓ Goal achieved · 3 turns · 4.7s · 1 screenshot · DOM trace

WHERE TEAMS RUN IT

From PR smoke to release-blocker audits.

Describe what matters. The agent finds the bug, keeps the evidence.

PR SMOKE

Every PR runs 30+ goals.

Run on every preview URL. Fails the build on a miss, posts evidence to the PR.

DESIGN AUDITS

WCAG + brand in one pass.

Per-page punchlist: contrast ratios, spacing drift, broken components.

COMPETITIVE RECON

Watch what competitors ship.

Diff copy, pricing, flows over time. Pipe to Slack.

ONE COMMAND

Describe the test. Get evidence.

CLI for one-offs. TypeScript SDK for CI.

BROWSER AGENT WORKFLOWS

Watch a real run, turn by turn.

Goal in. Goto, click, type, verify. Screenshot + DOM evidence per turn. Real @tangle-network/browser-agent-driver API.

$ bad run \ --goal "Sign up, verify email, check dashboard" \ --url https://your-app.com \ --model claude-sonnet-4-6 --vision # turn 1 · navigate → goto /signup · 312ms · 200 OK → type "[email protected]" · #email → click "Create account" → wait_for 'h1:has-text("Dashboard")' · 814ms → screenshot · artifacts/turn-3.png ✓ Goal achieved · 6 turns · 12.1s · video + DOM trace

Capabilities

One package: E2E, regression, audit, web3.

E2E Testing

Whole flows in one sentence.

Signup, checkout, onboarding: one prompt, one run.

navigate

→

type email

→

type password

→

click "Sign Up"

→

verify dashboard

Visual Regression

Catch design drift.

Vision-diff against baselines, per page.

98%

0 visual regressions detected

Self-Healing

Recover without human eyes.

Stuck detection, auto-retry, overlay dismiss.

stuck detected

recovered → continue

Design Audit

Score any site's UI in one pass.

Extract tokens, flag drift, return a structured report.

health score

⚠ Typography: 3 inconsistent sizes

✓ Colors: consistent palette

✓ Contrast: WCAG AA compliant

Parallel Execution

1 to 100 browsers.

Concurrent, dependency-aware.

Trajectory Memory

Reliability climbs with every run.

Case-based reasoning over prior runs.

Reliability 98%

Learn more

Frequently Asked Questions

What is Browser Agent?

A browser automation agent. Describe a goal in English. It launches Chromium, navigates, clicks, types, verifies, and saves screenshots and DOM per turn.

How does it work?

Goal + URL in. Vision-capable model decides each action, Playwright executes it. Every turn produces a screenshot, DOM snapshot, and reasoning trace.

What models does it support?

Claude, GPT, and other vision-capable models. Choose per run with --model.

Is it open source?

Yes. MIT. npm i -g @tangle-network/browser-agent-driver. Source: github.com/tangle-network/browser-agent-driver.

Your QA team is an AI.

npm install. Describe a goal. Open source, MIT.

Launch Browser Agent