You only find out about problems when someone complains ("Why is the office cold?"). By then, the system has been malfunctioning for hours or days, and the issue is hard to trace.
Don't rely on manual oversight—automate alerts:
Not all faults are equally urgent: