Has anyone actually shrunk EBS safely in production?
Spent the last couple days going down a rabbit hole of old Reddit threads, AWS re:Post discussions, and random blog posts from 2019, all trying to figure out if reducing EBS volume sizes is actually viable.
Almost every answer eventually lands on the same thing: just leave it alone.
Which honestly surprised me more than I expected. We've gotten pretty good at right-sizing almost everything else in AWS. Reserved instances, auto-scaling, S3 lifecycle policies, there's a whole culture around not paying for idle capacity. But storage still feels weirdly exempt from that conversation. Volumes just... grow forever, and apparently that's fine.
I get why teams don't touch it. The risk/reward math is brutal. Nobody wants a 3am incident because someone tried to reclaim 200GB on a production database volume. The downside is catastrophic and the upside is a smaller AWS bill. Easy call.
But I keep wondering if the tooling and processes have quietly gotten better and I'm just not hearing about it because the people who succeeded aren't posting "I shrunk my EBS volume and nothing caught fire" to Reddit.
Has anyone actually done this cleanly on live workloads recently? Curious whether the standard approach is still snapshot then new volume then migrate, or if there's something less painful now.