Wednesday, February 25, 2009

HIVE

Hive is an data warehousing system developed by Facebook.
On top of Hadoop, Hive enables developers (or analysist) to write quick query to examine large dataset.
Even though the features of Hive is relatively simple compared with heavier language (e.g. Pig-Latin, DryadLINQ, etc...), it seems working in many cases. If a user need more sophisticated analysis, he/she could write his/her own MapReduce code. If it is a simple task, it is better to use Hive to run that query - Hive is good fit to this scenario. This is a 'good enough' system for many cases, and a stepping stone to more high-profile languages, which would take time to mature.

No comments:

Post a Comment