The Browser API
Built for AI Agents

Control real browsers through a simple REST API. Get structured page data, stable element refs, and change diffs instead of raw HTML.

Stability detection built in
Fraction of the payload size
Diffs after every action
No credit card required. 5,000 free credits included.

See the API in Action

Six steps showing the full lifecycle: create a session, observe the page, fill a search form, extract results with AI selectors, scroll for more, and screenshot.

POST /v1/sessions
{
  "url": "https://browserbeam.com/blog/",
  "viewport": {
    "width": 1280,
    "height": 720
  },
  "auto_dismiss_blockers": true
}
201 Created
{
  "session_id": "ses_abc123def456",
  "expires_at": "2026-04-14T14:05:00.000Z",
  "request_id": "req_8f3a2bc1d4e5",
  "completed": 0,
  "page": {
    "url": "https://browserbeam.com/blog/",
    "title": "Browserbeam Blog",
    "stable": true,
    "markdown": {
      "content": "### Puppeteer vs Playwright vs Browserbeam...\n\n### Build a Competitive Intelligence Agent..."
    },
    "interactive_elements": [
      { "ref": "e1", "tag": "input",
        "label": "Search articles...", "in": "form", "form": "f1" }
    ],
    "forms": [
      { "ref": "f1", "action": "/blog/", "method": "GET", "fields": ["e1"] }
    ]
  },
  "blockers_dismissed": ["cookie_consent"]
}
Without Browserbeam

Launch Puppeteer, set viewport, navigate, wait for networkidle, detect and dismiss cookie banner (2-3 extra actions), call page.content(), parse 15,000+ character HTML with cheerio, manually extract form fields.

~25 lines of code. A wall of raw HTML for your LLM to parse.

With Browserbeam

One POST request. Navigate, auto-dismiss the cookie banner, return markdown content, element refs, and form structures. The page is ready for your agent to read and act on.

1 API call. Markdown + refs + forms. Compact and LLM-ready.

An Intelligence Layer for the Browser

Nine capabilities that sit between your agent and the page, so the LLM spends tokens on the task, not on browser overhead.

Stability Detection

Every response includes a stability signal that tells your agent when the page is fully loaded and ready. No more guessing wait times or burning tokens on premature reads.

Element Registry

Interactive elements get short, stable refs like e1, e2, e3. Your agent clicks by ref instead of constructing fragile CSS selectors.

Diff Tracking

After each action, the API returns only what changed: elements added, removed, or modified. Your agent reads a 30-token diff instead of re-parsing the entire page.

Blocker Dismissal

Cookie banners, newsletter popups, and chat widgets are detected and dismissed automatically. Your agent never wastes actions on interruptions irrelevant to the task.

Semantic Extraction

Pages are compressed into a structured, token-efficient representation: interactive elements, headings, and visible text. Thousands of DOM nodes become a compact JSON object.

Error Enrichment

When an action fails, you get context, not just "element not found." The API tells you if an overlay is blocking the target, if a CAPTCHA appeared, and what to do next.

JavaScript Execution

Run custom JavaScript on any page when built-in steps aren't enough. Your agent writes a JS snippet and the API executes it in the browser context, returning the result as structured data.

Cookie Injection

Inject cookies at session creation to skip login flows entirely. Your agent authenticates once, saves the cookies, and resumes authenticated sessions instantly.

Smart Wait Conditions

Wait for JavaScript expressions to become truthy, not just DOM selectors. Your agent handles complex SPAs where visibility depends on framework state, not raw DOM presence.

What Will You Build?

One API, many possibilities. From autonomous agents to data pipelines, Browserbeam gives your code a browser it can see through.

Give your AI agent a real browser it can see and control. Every response includes markdown, stable element refs, and optional context (landmark, nearby heading, and parent form), plus forms grouped with their field refs.

Request
POST /v1/sessions

{
  "url": "https://en.wikipedia.org/wiki/Web_scraping",
  "auto_dismiss_blockers": true
}
Response
{
  "session_id": "ses_abc123def456",
  "request_id": "req_8f3a2bc1d4e5",
  "page": {
    "url": "https://en.wikipedia.org/wiki/Web_scraping",
    "title": "Web scraping - Wikipedia",
    "stable": true,
    "markdown": {
      "content": "**Web scraping**, **web harvesting**, or **web data extraction** is data scraping used for extracting data from websites..."
    },
    "interactive_elements": [
      { "ref": "e3", "tag": "input",
        "label": "Search Wikipedia", "in": "form", "form": "f1" },
      { "ref": "e4", "tag": "button",
        "label": "Search", "in": "form", "form": "f1" }
    ]
  }
}

Drop Into Your Stack in Minutes

Official SDKs for the languages you already use, plus an MCP server that turns Browserbeam into tools your AI coding assistant can call.

Browserbeam is a REST API. Any language that can make HTTP requests can use it.

Browse All Integrations

Simple, intuitive pricing.

One monthly credit pool covers runtime, proxies, AI selectors, and CAPTCHA solving. Start with 5,000 free credits, no card required.

Starter

For individuals and side projects

$29 / month
  • 500,000 credits / month
  • 5 concurrent sessions
  • 15 min per session
  • Datacenter & residential proxies
  • AI selectors & semantic extraction
  • Auto CAPTCHA solving
Get Started
Best Value

Pro

For teams and production use

$99 / month
  • 2,000,000 credits / month
  • 50 concurrent sessions
  • 30 min per session
  • Datacenter & residential proxies
  • AI selectors & semantic extraction
  • Auto CAPTCHA solving
Get Started

Scale

For agencies and high-volume use

$299 / month
  • 10,000,000 credits / month
  • 100 concurrent sessions
  • 1 hour per session
  • Datacenter & residential proxies
  • AI selectors & semantic extraction
  • Auto CAPTCHA solving
Get Started
Billed monthly. Unused credits expire at period end. Cancel anytime.
Prices exclude VAT where applicable.

Estimate your monthly credits

Drag the sliders to match your workload. We'll show you which plan fits and how many credits you'll burn each month.

10,000
10s
1.0 MB
0

Each resolution averages ~600 AI tokens (15 credits / 1K tokens).

0

Auto-solved CAPTCHAs — DataDome, Cloudflare, reCAPTCHA, etc. (75 credits / solve).

Estimated monthly spend
30,000
credits
Recommended plan
Starter $29 / month
Comfortably fits your workload with headroom for spikes.
Breakdown
  • Runtime: 0 credits
  • Proxy bandwidth: 0 credits
  • AI selectors: 0 credits
  • CAPTCHA solves: 0 credits

Frequently Asked Questions

How does the browser API work?

Browserbeam provides a REST API that gives your LLM or automation code control over a real browser. Create a session, navigate to any URL, observe the page as structured JSON, interact with elements using short refs, and get change diffs after every action. You send HTTP requests. We handle the browser infrastructure.

How is this different from Browserbase, Browserless, Steel, or Firecrawl?

Several competitors now return markdown or structured output, but Browserbeam goes deeper. Element refs let your agent click a specific button by ID instead of guessing from raw HTML. DOM diff tracking returns only what changed after each action, cutting token usage by 60-80%. Stability detection tells your agent when the page is ready without arbitrary sleeps. Declarative CSS extraction returns typed JSON without any LLM cost. Browserbase is a full agent platform (browsers, model gateway, serverless functions). Browserless and Steel give you hosted Playwright with their own extraction endpoints. Firecrawl turns pages into LLM-ready markdown. Browserbeam gives your agent the structured, interactive page state that none of them provide.

What can I build with Browserbeam?

LLM-powered web agents, automated form filling, web scraping, QA testing, screenshot and PDF generation, and any workflow where AI needs to see and interact with web pages. Browserbeam handles the browser so your code stays simple.

How does pricing work?

Every plan ships with a monthly credit pool. Runtime burns 1 credit per second, datacenter proxy bandwidth burns 35 credits per MB, residential proxy bandwidth burns 350 credits per MB, AI selectors burn 15 credits per 1,000 AI tokens, and auto CAPTCHA solving burns 75 credits per solve. The free trial includes 5,000 credits, enough to fully evaluate the API on a real workload. See the billing reference for the full rate sheet.

Is my data private and secure?

Every session runs in an isolated browser context with separate cookies, storage, and cache. No CDP port is exposed. Sessions are destroyed when closed and we never store page content beyond the session lifetime. API keys are hashed. Your usage data is accessible only to you.

Do I need to manage Playwright or Puppeteer?

No. Browserbeam is a REST API. You send JSON, you get JSON back. No browser libraries to install, no Chrome binaries to maintain, no WebSocket connections to manage. If your code can make HTTP requests, it can use Browserbeam.

Give your AI agent a faster, leaner browser

Structured page data instead of raw HTML. Your agent processes less, decides faster, and costs less to run.

Stability detection built in
Fraction of the payload size
Diffs after every action
No credit card required. 5,000 free credits included.