Improving hadoop hive query response times through efficient virtual resource allocation

Tansel Dokeroglu, Muhammet Serkan Cınar, Seyyit Alper Sert, Ahmet Cosar, Adnan Yazıcı

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

The performance of theMapReduce-based Cloud datawarehouses mainly depends on the virtual hardware resources allocated. Most of the time, the resources are values selected/given by the Cloud service providers. However, setting the right virtual resources in accordance with the workload demands of a query, such as the number of CPUs, the size of RAM, and the network bandwidth, will improve the response time when querying large data on an optimized system. In this study, we carried out a set of experiments with a well-known Mapreduce SQL-translator, Hadoop Hive, on benchmark decision support the TPC benchmark (TPC-H) database in order to analyze the performance sensitivity of the queries under different virtual resource settings. Our results provide valuable hints for the decision makers who design efficient MapReduce-based data warehouses on the Cloud.

Original languageEnglish
Title of host publicationFlexible Query Answering Systems 2015 - Proceedings of the 11th International Conference, FQAS 2015
EditorsOlivier Pivert, Adnan Yazici, Henrik Larsen, Maria Amparo Vila, Troels Andreasen, Henning Christiansen, Janusz Kacprzyk, Slawomir Zadrozny, Guy De Tre, Gabriella Pasi
PublisherSpringer Verlag
Pages215-225
Number of pages11
ISBN (Print)9783319261539
DOIs
Publication statusPublished - 2016
Externally publishedYes
Event11th International Conference on Flexible Query Answering Systems, FQAS 2015 - Cracow, Poland
Duration: Oct 26 2015Oct 28 2015

Publication series

NameAdvances in Intelligent Systems and Computing
Volume400
ISSN (Print)2194-5357

Conference

Conference11th International Conference on Flexible Query Answering Systems, FQAS 2015
Country/TerritoryPoland
CityCracow
Period10/26/1510/28/15

Keywords

  • Hadoop
  • Hive
  • Multi-objective query Optimization
  • Virtual resource allocation

ASJC Scopus subject areas

  • Control and Systems Engineering
  • General Computer Science

Fingerprint

Dive into the research topics of 'Improving hadoop hive query response times through efficient virtual resource allocation'. Together they form a unique fingerprint.

Cite this