u/Amazing-Wind2305

▲ 2 r/OpenSourceeAI+2 crossposts

I built a desktop automation CLI for AI agents.

Hey r/coolgithubprojects

I was using agent-browser to power my agentic workflow, and it worked great. When I wanted to expand computer-use to the OS itself, I couldn't find a good enough tool that was open-source, so I decided to build it myself.

What is agent-ctrl?

agent-ctrl is an OS automation CLI for AI agents written in Rust for speed.

How does it work?

agent-ctrl turns native app UIs into agent-readable format, then letting you or your agent act upon UIs.

It flattens and parses accessibility trees from any OS into one schema, which allows for cross-OS agents.

For now it supports Windows & MacOS, I'm working on Linux right now.

Looking for people open to contribute for Linux, since I do not run it myself.

github.com
u/Amazing-Wind2305 — 11 days ago

I built a desktop automation CLI for AI agents.

Hey r/OpenSourceeAI

I was using agent-browser to power my agentic workflow, and it worked great. When I wanted to expand computer-use to the OS itself, I couldn't find a good enough tool that was open-source, so I decided to build it myself.

What is agent-ctrl?

agent-ctrl is an OS automation CLI for AI agents written in Rust for speed.

How does it work?

agent-ctrl turns native app UIs into agent-readable format, then letting you or your agent act upon UIs.

It flattens and parses accessibility trees from any OS into one schema, which allows for cross-OS agents.

For now it supports Windows & MacOS, I'm working on Linux right now.

Looking for people open to contribute for Linux, since I do not run it myself.

reddit.com
u/Amazing-Wind2305 — 12 days ago

I built agent-browser but for OS automation.

Hey r/AI_Agents !

I was using agent-browser to power my agentic workflow, and it worked great. When I wanted to expand computer-use to the OS itself, I couldn't find a good enough tool that was open-source, so I decided to build it myself.

What is agent-ctrl?

agent-ctrl is an OS automation CLI for AI agents written in Rust for speed.

How does it work?

agent-ctrl turns native app UIs into agent-readable format, then letting you or your agent act upon UIs.

It flattens and parses accessibility trees from any OS into one schema, which allows for cross-OS agents.

For now it supports Windows, I'm working on MacOS and Linux right now.

Looking for people open to contribute for Linux, since I do not run it myself.

reddit.com
u/Amazing-Wind2305 — 13 days ago