Apache Ambari - Monitoring

Apache Ambari Monitoring

Ambari’s monitoring tools use two other open source projects: Ganglia and Nagios. Both come preconfigured with Ambari out of the box after a cluster has been provisioned.

Ambari can monitor the health and status of a Hadoop cluster right down to a granular detail for metric display on the web UI. Ganglia integration provides not just tons of useful metrics that are all Hadoop based (e.g. HDFS Disk Usage, Links, DataNode info etc) but also provides nice histograms for one to understand patterns of usage from a threshold and trend perspective.

The following are the some of the widgets provided by ambari for monitoring the cluster,

HDFS Disk Usage
DataNodes Live
HDFS Links
Memory Usage
Network Usage
CPU Usage
Cluster Load
NameNode Heap
NameNode RPC
NameNode CPU WIO
NameNode Uptime

Ambari Metrics System (AMS) is a system for collecting, aggregating and serving Hadoop and system metrics in Ambari-managed clusters.

Ambari also utilizes Ganglia for detailed heatmaps which are a great way to quickly see which hosts are utilizing too many resources or not in an acceptable threshold for a certain monitored value.

The following are the few of heatmaps provided by Ambari,

Host Disk Space Used

Host Memory Used

Host CPU WIO

HDFS Bytes Read & Written

Garbage Collection Time

JVM Heap Memory Used

Host Disk Space Used %:

YARN: Total Allocatable RAM Utilized per NodeManager:

Host Metrics:

Nagios is primarily used for health checking and alerting. During the Ambari wizard installation, the installer must provide a support email for Nagios alerts. These alert emails come set with the notification type, service, host, host address, state, date and any other additional info.

Techie Savior

Search This Blog

Apache Ambari - Monitoring

Comments

Post a Comment

Popular posts from this blog

HDFS Health Check

Load epoch timestamp value into hive table

Fix: Under Replicated blocks in HDFS manually