Ganglia Software Description
Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and Grids. It monitors CPU, disk, file system, tape, load average, Infiniband, topology, etc. It presents data via Web interfaces and XML/RPC-based Ganglia APIs to monitor the status of thousands of heterogeneous systems. As an open source solution, it provides a flexible system for monitoring different environments, from small clusters to large Grids with around 100,000 nodes.
Ganglia is targeted at infrastructure monitoring on high-performance computing systems such as clusters and Grids. It provides a scalable mechanism for storing statistics such as CPU Usage, Memory Usage, Load Averages, Disk Sizes, and File System Space for thousands of hosts. It supports dynamic discovery and automatic configuration of monitored hosts, flexible querying for performance data, and a wide variety of output formats for easy monitoring of local resources, as well as grids and clusters of hosts distributed across the internet.