got my first "rm -rf /" today
Agent decided to test if harmful command block worked by issuing a rm -rf /
Thankfully it worked so only damage was a mild heart attack.
I implemented a sandbox immediately afterwards.
EDIT: for those wondering, I was implementing a bash command whitelist and also bubblewrap for isolation. I did the whitelist implementation first and that was the command the agent chose to test it 😂 bwrap got done quickly afterwards!