I built a desktop automation CLI for AI agents.
Hey r/coolgithubprojects
I was using agent-browser to power my agentic workflow, and it worked great. When I wanted to expand computer-use to the OS itself, I couldn't find a good enough tool that was open-source, so I decided to build it myself.
What is agent-ctrl?
agent-ctrl is an OS automation CLI for AI agents written in Rust for speed.
How does it work?
agent-ctrl turns native app UIs into agent-readable format, then letting you or your agent act upon UIs.
It flattens and parses accessibility trees from any OS into one schema, which allows for cross-OS agents.
For now it supports Windows & MacOS, I'm working on Linux right now.
Looking for people open to contribute for Linux, since I do not run it myself.