Browser Based Agents
Hi, I’m new to this community and exploring browser-based agents. Most web automation tools I’ve tried feel unreliable for real-world workflows. Selenium/Playwright scripts are more deterministic, but my use case still needs bits of intelligence for changing pages, edge cases, and decision-making.
I’m especially curious about using Knowledge Graphs(like how coding agents have knowledge graphs for codebases) for this. Could I store page structure, selectors, actions, form fields, workflow steps, API endpoints, and past successful paths as a graph, so future agents can navigate with less reasoning and fewer tokens?
Basically: deterministic automation first, LLM only when uncertainty appears, and a KG as memory between runs.
Has anyone built something like this at scale? What architecture would you recommend?