Abstract

Currently companies in the world have focused on the Big Data business which has become an invaluable tool in assisting business processes and data analysis. SQL-on-Hadoop is a small part of the Big Data Platform that has been developed to date. Our research implements Big Data Platform on Cloudera and Hortonworks using TPC-H Benchmarks on SQL-on-Hadoop systems and evaluates the characteristics and performance of query processing machines in each scenario applied to each platform. We focuses on evaluating the two Big Data Platforms, Cloudera and Hortonworks, to determine the advantages and disadvantages of each platform based on the TPC-H Benchmark that has been recognized as a Decision Support System to compare the two with four different scenario that run on the same configurations. The results obtained are Cloudera with Impala can process queries with a ratio of up to 41x faster than LLAP-Tez and 200x faster than Hive-Spark.

Details

Title
Performance evaluation sql-on-hadoop: a case study of Hortonworks and Cloudera
Author
Ronianto, M Faridh 1 ; Asror, Ibnu 1 ; Sidik Prabowo 1 ; Fajar Arief Nugraha 1 

 Telkom University, Bandung 
Publication year
2019
Publication date
Mar 2019
Publisher
IOP Publishing
ISSN
17426588
e-ISSN
17426596
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2566071207
Copyright
© 2019. This work is published under http://creativecommons.org/licenses/by/3.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.