A development platform for Apache Hadoop MapReduce programs. Originally developed by Yahoo Research in the mid-2000s, Apache Pig uses the Pig Latin language, which can be extended with functions in ...
Will Apache Parquet be the "next big thing" in the Big Data/Hadoop ecosystem? It was only 14 months ago that the Apache Spark project was graduated to a top-level project by Hadoop steward Apache ...
The Apache Software Foundation's promotion of Tez to a top-level project not only endorses the technology but also the strength of the community behind it, according to Hortonworks, the Hadoop ...
In this RCE Podcast, Marcel Kornacker from Cloudera describes the Impala project. Impala brings scalable parallel database technology to Hadoop, enabling users to issue low-latency SQL queries to data ...