Monday, March 2, 2009

Pig Latin

Pig-Latin is a query language runs on top of Hadoop.
Even though Hadoop or MapReduce style programming framework is good to write a program to analyze large dataset, it is rigid and low-level. Pig-Latin aims to privode means to write ad-hoc analysis queries. This should be quite influential, as the more it is easy to use, the more users will use it. Actually there are a few different approaches aims the same end - Sawzal, DryadLINQ, etc. Each approach has its own characteristics and pros/cons. It would be interesting to see which one will win this area. To me, Pig Latin is seems closest to SQL, which is widely adopted language. If the performance of Pig (an execution stack of Pig-Latin) is good enough and Pig-Latin could privode with flexible (and easy to use) grammar, combined with the fact that this is an open-source project to which many prospective users have access, Pig would win a significant portion.

No comments:

Post a Comment