u/Clark12-002

▲ 20 r/Cisco

monitoring tool maintanence is starting to consume more time than the actual infrastructure

my monitoring environments has gradually become its own engineering project. every new device onboarding requires manual tweaks, custom thresholds, dependency adjustments and alert cleanup. we reached a point where only one or two people fully understand how everything is weird together which makes troubleshooting stressful whenever they are unavaible. i still want detailed visibility and reliable alerting but maintaining the monitoring stack itself shouldnt feel like a second full time job. want to know how other teams reduced operational overhead without sacrificing monitoring quality.

reddit.com
u/Clark12-002 — 4 days ago