Chrome Relay docs

Your agent drives the Chrome you're signed into. It reads pages, clicks buttons, and fills forms. Nothing leaves the machine.

Chrome Relay lets a terminal agent operate your real Chrome.

Claude Code, Codex, anything that runs shell commands.

Not a headless automation browser with an empty profile. The actual browser you're signed into, with your cookies, SSO sessions, extensions, and localhost tabs.

It works on background tabs without stealing focus, so you keep browsing while the agent works.

chrome-relay tabs                             # find or open a tab
chrome-relay navigate "https://kushalsm.com" --new   # opens in the background
chrome-relay snapshot --tab 1234 -i           # see the page: actionable elements get @refs
chrome-relay click @e12                       # act on a ref, no selectors, no --tab
chrome-relay fill @e14 "hello"
chrome-relay snapshot --tab 1234 -i           # look again after the page changes

That's the whole loop. A snapshot of the Hacker News front page is ~14 KB of plain text. A click round-trip is ~150 ms.

What it is, in one diagram

your agent (terminal)
  -> chrome-relay CLI
  -> localhost HTTP (127.0.0.1:12122)
  -> native messaging host
  -> Chrome extension (service worker)
  -> CDP (chrome.debugger)
  -> your real tabs

Everything is local. There is no cloud relay, no account, no telemetry.

The extension talks to one whitelisted local process through Chrome's own native-messaging permission system. The same Chrome you'd have to trust anyway.

Where to go

You want to	Read
Install it	Installation
Drive a page in 2 minutes	Quickstart
Understand why it uses your Chrome	Why your real Chrome
Know how the pieces connect	Architecture
Read pages cheaply	Snapshots
Click things reliably	Refs · Clicking strategies
Look up a command	Command reference
Teach your agent	The agent skill
Compare with agent-browser	vs agent-browser

Design positions, briefly

CLI, not MCP. Tool schemas cost thousands of context tokens on every agent turn; a CLI costs nothing until invoked, and every agent already knows how to run shell commands.
Verbs, not magic. There's no "smart click" with hidden fallbacks. Each addressing mode (@ref, CSS selector, coordinates) is explicit, and each failure is a structured error code the agent can branch on.
Strict by default. No silent partial success. If a click would land on an overlay instead of the target, you get click_intercepted naming the overlay. Not a success report and a broken page.
Output is the product. Agents pay for every byte on stdout. The snapshot format exists to be small.

Chrome Relay docs

What it is, in one diagram#

Where to go#

Design positions, briefly#

What it is, in one diagram

Where to go

Design positions, briefly