Correctness of Hive on MR3, Presto, and Impala
· 7 min read
Introduction
Do you trust Hive? Do you trust Presto? Do you trust Impala? Do you trust your SQL system?
Do you trust Hive? Do you trust Presto? Do you trust Impala? Do you trust your SQL system?
In our previous article, we use the TPC-DS benchmark to compare the performance of five SQL-on-Hadoop systems: Hive-LLAP, Presto, SparkSQL, Hive on Tez, and Hive on MR3. As it uses both sequential tests and concurrency tests across three separate clusters, we believe that the performance evaluation is thorough and comprehensive enough to closely reflect the current state in the SQL-on-Hadoop landscape.