This is the fourth public release of Chukwa, a log analysis framework on top of Hadoop and HBase. Chukwa has been tested at scale and used in some production settings, and is reasonably robust and well behaved. For instructions on setting up Chukwa, see the administration guide and the rest of the Chukwa documentation.
- Chukwa can store data on HBase for improved low latency read performance.
- Chukwa collector has been deprecated. HBase and HDFS can support better reliability for data injection. The original purpose of collector is superseded by both services.
- There have been a number of bug fixes and code cleanups since the last release; check the changelog and JIRA for details.
Chukwa relies on Java 1.6, and requires maven 3.0.3 to build. The back-end processing requires Hadoop 1.2.1+, HBase 0.96+, and Pig 0.9.1+.
- HICC defaults to assuming data is UTC; if your machines run on local time, HICC graphs will not display properly until you change the HICC timezone. You can do this by clicking the small "gear" icon on the time selection tool.
- As mentioned in the administration guide, the pig aggregation script requires external scheduling in cronjob or Jenkins.
- Salsa Finite State Machine has not been ported forward to store data on HBase.
- There is currently no down sampling script for data stored on HBase. Long term trending visualization might not work for large scale data.