Status Summaries - Group Summary Report


The group summary report provides an "at a glance" indication of system health across all the systems that are a member of one or more groups. Essentially, it is a composite key indicators report displaying status information for alerts, processor load, memory usage, available disk space, failed services and selected performance items  in an easy to review grid.  Each metric within the report is colour coded to indicate the severity of the problem, if any.

 Error or failure. The information could not be obtained.
 Critical. A serious problem that should be resolved immediately.
 Warning. A problem that warrants attention.
 Normal. Value is within normal limits.
 Not available. The information is not currently available.

The metrics for each group are generated by taking the individual metrics for each system in the group, and displaying the "worst" metric from those systems. Furthermore, the individual disk metrics are combined into one overall disk metric for the group.

For each group, colour-coded severity indicators are always displayed. Clicking the » next to the group will display a second row of information that gives, where applicable, the underlying numeric or textual data for each status indicator. Clicking on the description of the group drills down to a computer summary report of the computers that make up the group.

Alerts

This metric displays the number of new and acknowledged alerts totalled across the systems in each group. If any new alerts are present, the status is deemed critical, if any acknowledged alerts are present, but no new alerts, the status is a warning.

CPU Load

The CPU load metric displays an indication of average CPU utilisation over a period of time, rather than an instantaneous value. A high load will be displayed as a warning state, and a very high loads as a critical state. The thresholds used to determine the state for individual systems are set using the thresholds & filters configuration.

This metric will not be available if insufficient data has been collected (for example shortly after system start-up) for an accurate average to be calculated.

Memory Load

The memory load metric displays an indication of how the memory in the system is being used. A high load will be displayed as a warning state, and a very high loads as a critical state. The thresholds used to determine the state for individual systems are set using the thresholds & filters configuration.

Disk Space

The free space on all fixed, non-removable disks is displayed, with one combined metric across all disks. A warning state indicates a reduced disk capacity, and a critical state indicates a nearly full disk. The thresholds used to determine the state for individual systems are set using the thresholds & filters configuration.

Failed Services

A failed service is any service which is set to automatic start-up but which is not currently running. Note that this may include services not monitored by the ServerAssist service monitoring facility. If there are any failed services, this metric is displayed as a critical state.

Performance Items

The performance items metric includes any collected performance item for which two thresholds have been set, and which is configured in system monitoring to be monitored via the key indicators.