[RFC] Making browser agents reliable

Hey 👋

One-shot browser agents work maybe 30-60% of the time. You ask it to do something, it works once, fails the next time. Not great when you actually need to get stuff done.

We're trying to fix this with **Workflows**—you chat with the agent to build a step-by-step graph instead of just giving it a goal and hoping.

<img width="840" height="705" alt="Image" src="https://github.com/user-attachments/assets/53e385d3-b852-4848-99c5-98a27945adb8" />

Say you want to unsubscribe from Gmail marketing emails. Instead of "unsubscribe from all my emails" and crossing your fingers, you end up with a graph: Navigate to Gmail → Click More menu → Manage Subscriptions → Extract subscriptions → Loop through each → Unsubscribe → Confirm if needed.

You can test it, tweak it, run it again later. It follows the same steps every time.

**Examples:**
- Fill out a form for each row in a spreadsheet
- Monitor prices across a few sites
- Accept LinkedIn requests matching certain criteria
- Move data from one app to another

---

**What we want to know:**

1. What browser task would you automate if it actually worked?
2. What would make it useful? (Scheduling? Notifications? Something else?)
3. How often do you do this task?

Reply below—trying to figure out what to build next.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Making browser agents reliable #329

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[RFC] Making browser agents reliable #329

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions