Sapan Doshi on 19 Aug 2020 15:29:45
Currently, We are having reactive approach towards alerting on premium capacity outage. We are getting alert only when capacity usage reaches to its maximum and by that time entire enterprise has started experiencing outage and users are unable to access reports.
It will be great if can have proactive alerting and monitoring solution. As an example, admin / group of people should notify as soon as it cross 90% of capacity so admin group can start looking into an issue and we can avoid enterprise level outage.
OR
If we can have real time monitoring on Premium capacity.
- Comments (1)
RE: Proactive Monitoring / Alerting on Premium Capacity
Power BI says that one of our Premium Capacities is overallocated, but the background CUs that are shown in the monitoring report do not reflect that.It was found that the overload was caused by a dataflow that took a lot of CPU resources to refresh.This dataflow did not appear on the Capacity metrics app until the next day after completing the refresh.This caused the capacity to go from nominal (60%) to blocking queries and refreshes (150%) in a matter of seconds.The issue can ONLY be mitigated (not solved) by immediately moving everything off that capacity for the next 24 hours. That of course assumes you have another capacity that has enough space to accommodate the workspaces.Without proactive capacity monitoring we are flying blind, always on the precipice of a major meltdown. Not a way to run your business.