Browser Automation
Browser Automation Hermes Agent includes a full browser automation toolset with multiple backend options: Browserbase cloud mode via Browserbase for managed clo
Hermes Agent includes a full browser automation toolset with multiple backend options: In all modes, the agent can navigate websites, interact with page elements, fill forms, and extract information. Pages are represented as accessibility trees (text-based snapshots), making them ideal for LLM agents. …
What this page covers
- Overview
- Setup
- Browserbase cloud mode
- Browser Use cloud mode
- Firecrawl cloud mode
- Hybrid routing: cloud for public URLs, local for LAN/localhost
- Camofox local mode
- Local Chromium-family browser via CDP (/browser connect)
- WSL2 + Windows Chrome: prefer MCP over /browser connect
- Local browser mode
- Optional Environment Variables
- Install agent-browser CLI
- Available Tools
- browsernavigate
- browsersnapshot
- browserclick
- browsertype
- browserscroll
- browserpress
- browserback
- browsergetimages
- browservision
- browserconsole
- browsercdp
- browserdialog
- Practical Examples
- Filling Out a Web Form
- Researching Dynamic Content
- Session Recording
- Stealth Features
- Session Management
- Limitations
Section outline mirrored from the official Hermes Agent documentation. Follow any heading to read the complete text on the source site.
More in Core Features
Features Overview
Hermes Agent includes a rich set of capabilities that extend far beyond basic chat. From persistent memory and file aware context to browser automation and voic
Tools & Toolsets
Tools are functions that extend the agent's capabilities. They're organized into logical toolsets that can be enabled or disabled per platform. Available Tools
Skills System
Skills are on demand knowledge documents the agent can load when needed. They follow a progressive disclosure pattern to minimize token usage and are compatible
Curator
The curator is a background maintenance pass for agent created skills . It tracks how often each skill is viewed, used, and patched, moves long unused skills th
Persistent Memory
Hermes Agent has bounded, curated memory that persists across sessions. This lets it remember your preferences, your projects, your environment, and things it h
Memory Providers
Hermes Agent ships with 8 external memory provider plugins that give the agent persistent, cross session knowledge beyond the built in MEMORY.md and USER.md. On