Set up Ganglia for EC2 with Hadoop and HBase
Ganglia is a very nice tool to help monitor several statistics in a cluster environment. For eg., you can track the CPU times for all the clusters in a single end point.
Setting up ganglia is fairly simple. You just have to start two daemon processes. gmond (Ganglia monitor) and gmetad (Ganglia meta). And you need to copy the web scripts, the php ones into proper apache directory ( assuming your web server is apache).
gmond - This has to run on every node in the cluster.
gmetad - This has to run where the end point is specified.
gmond collects statistics and stores in a location. It frequently sends data to gmetad. Usually gmond is set to support multicast (by default) the meta data. This can be set to unicast to gmetad. Once this setup is done, you should be able to see the webpage showing graphs.
For detailed setup follow this link.
As you follow the above link, make a note that EC2 supports only unicast and because of this you need to set "send_metadata_interval" to a value greater than 0 in gmond.conf to support unicast. ( I lost about 2 hours figuring this out ).

















