tot-agent¶

Autonomous browser agent for scripted GUI testing.

tot-agent drives a real Playwright browser with Claude vision + tool-use to execute natural-language test scenarios against any web application. It was originally built to seed and test the This-or-That A/B book-cover testing platform, but its core components are generic enough to target any web GUI.

How it works¶

sequenceDiagram
    participant User
    participant CLI
    participant BrowserAgent
    participant Claude
    participant Browser

    User->>CLI: tot-agent seed --tests 5
    CLI->>BrowserAgent: run(goal_string)
    loop Agent loop
        BrowserAgent->>Claude: messages.create(goal + history + tools)
        Claude-->>BrowserAgent: tool_use blocks
        BrowserAgent->>Browser: execute tool (navigate / click / screenshot…)
        Browser-->>BrowserAgent: result
        BrowserAgent->>Claude: tool_result blocks
        Claude-->>BrowserAgent: end_turn + summary text
    end
    BrowserAgent-->>CLI: summary string
    CLI-->>User: display result

Goal — you provide a plain-English objective.
Plan — Claude decides which tools to call (navigate, click, fill, screenshot, …).
See — after each action, a screenshot is taken; Claude looks at it to decide what to do next.
Act — tools execute in a real Playwright browser.
Report — the agent summarises what it accomplished.

Because the agent uses screenshots + vision rather than hardcoded selectors, it adapts to your actual UI structure automatically.

Quick start¶

# 1. Clone and set up
git clone https://github.com/mattbriggs/this-or-that-agent
cd this-or-that-agent
python -m venv .venv && source .venv/bin/activate
pip install -e ".[dev]"

# 2. Install Playwright browsers (one-time)
playwright install chromium

# 3. Configure
cp .env.example .env
# Edit .env — add your ANTHROPIC_API_KEY

# 4. Run!
tot-agent seed --tests 3 --headless

Key features¶

Feature	Description
Vision-first navigation	Adapts to any UI using screenshots, not fragile selectors
Multi-user contexts	Simulates multiple logged-in users with isolated browser sessions
Cover fetching	Pulls real book covers from Open Library / Google Books
Full-featured CLI	`create`, `vote`, `simulate`, `seed`, `goal`, `users`, `info`, `covers`
Strategy pattern	Swap or extend cover sources without changing orchestration code
Observer pattern	Attach loggers, monitors, or custom reporters to the agent loop
pytest suite	Unit + integration tests with coverage reports

Installation — prerequisites and setup
Usage Guide — all CLI commands with examples
API Reference — auto-generated module docs
Software Design — architecture, patterns, diagrams
Requirements (SRS) — IEEE 830 specification
Project Roadmap — planned features and milestones

tot-agent¶

How it works¶

Quick start¶

Key features¶

Navigation¶