agent-browser adds Chat mode to power conversational browser agents

agent-browser has added a new Chat mode that turns CLI commands into conversations, with one-shot prompts and an interactive loop. The dashboard now includes built-in AI chat that can run agent-browser commands directly, plus options for profiles, sessions, and auth.

tool cover

TL;DR

  • New chat mode: agent-browser chat reframes agent-browser as a browser agent with conversational control
  • Two usage patterns: One-shot CLI prompts or an interactive loop started via agent-browser chat
  • Dashboard update: Built-in AI chat can execute agent-browser commands directly
  • Workflow integration: Chat can run any command; supports use as a sub-agent in broader agent setups
  • Install/upgrade: npm i -g agent-browser; agent-browser upgrade
  • Auth and session tooling: --profile Default, --session-name, auth save/login, state save/load, set headers/credentials

agent-browser has added a new “chat” mode that reframes the tool less as a scriptable browser controller and more as a browser agent—with both one-shot CLI usage and an interactive loop. The update centers on a single entry point, agent-browser chat, plus a built-in AI chat surface in the dashboard that can run agent-browser commands directly.

From commands to conversations (without losing the CLI)

The headline change is the new chat interface, which supports two patterns:

  • One-shot runs, such as: agent-browser chat "open google, search for dogs"
  • Interactive sessions that start with: agent-browser chat

In addition, the dashboard now includes built-in AI chat, positioned as a more conversational way to drive the same underlying command set. Chris Tate also notes that chat can execute any agent-browser command, and that the tool can be used as a sub-agent—a clue that this is meant to plug into broader agentic workflows rather than remain a standalone automation utility.

Install and upgrade

The install path is intentionally simple:

  • First-time install: npm i -g agent-browser
  • Upgrade: agent-browser upgrade

Auth, profiles, and keeping sessions around

A common sticking point for browser agents is authentication. In replies, Tate outlined multiple options for handling credentials and state in agent-browser:

Reusing Chrome and persisting sessions

  • Reuse existing Chrome login sessions via --profile Default
  • Auto-save/restore cookies + storage across runs using --session-name github

Storing credentials and moving state between runs

  • An encrypted credential vault with commands like auth save mysite and auth login mysite
  • Portable auth snapshots using state save and state load

Working with headers and basic tokens

  • Tokens and basic auth via set headers and set credentials

Taken together, these cover the practical cases teams hit quickly: staying logged in, not redoing MFA flows every run, and carrying state between environments.

Developer questions are already shifting to the next layer

The thread quickly moved past “can it click things” into more operational concerns: token usage efficiency, exposing the dashboard over the network (0.0.0.0), whether it can access console/network tabs, and how it stacks up against Chrome’s MCP in terms of flexibility and control. There were also early asks about E2E testing workflows (Playwright-style) and even “templates support” for repeatable higher-level tasks.

For now, the concrete shipped change is clear: agent-browser chat turns the CLI into a conversational interface, while keeping direct command execution and session tooling close at hand.

Source: https://x.com/i/status/2041176865092563122

Continue the conversation on Slack

Did this article spark your interest? Join our community of experts and enthusiasts to dive deeper, ask questions, and share your ideas.

Join our community