A Modified Key Partitioning for BigData Using MapReduce in Hadoop

Gothai Ekambaram; Balasubramanie Palanisamy

doi:10.3844/jcssp.2015.490.497

Research Article Open Access

A Modified Key Partitioning for BigData Using MapReduce in Hadoop

Gothai Ekambaram¹ and Balasubramanie Palanisamy¹

¹ Kongu Engineering College, India

Abstract

In the period of BigData, massive amounts of structured and unstructured data are being created every day by a multitude of ever-present sources. BigData is complicated to work with and needs extremely parallel software executing on a huge number of computers. MapReduce is a current programming model that makes simpler writing distributed applications which manipulate BigData. In order to make MapReduce to work, it has to divide the workload between the computers in the network. As a result, the performance of MapReduce vigorously depends on how consistently it distributes this study load. This can be a challenge, particularly in the arrival of data skew. In MapReduce, workload allocation depends on the algorithm that partitions the data. How consistently the partitioner distributes the data depends on how huge and delegate the sample is and on how healthy the samples are examined by the partitioning method. This study recommends an enhanced partitioning algorithm using modified key partitioning that advances load balancing and memory utilization. This is completed via an enhanced sampling algorithm and partitioner. To estimate the proposed algorithm, its performance was compared against a high-tech partitioning mechanism employed by TeraSort. Experimentations demonstrate that the proposed algorithm is quicker, more memory efficient and more accurate than the existing implementation.

Journal of Computer Science

Volume 11 No. 3, 2015, 490-497

DOI: https://doi.org/10.3844/jcssp.2015.490.497

Submitted On: 7 March 2014 Published On: 15 April 2015

How to Cite: Ekambaram, G. & Palanisamy, B. (2015). A Modified Key Partitioning for BigData Using MapReduce in Hadoop. Journal of Computer Science, 11(3), 490-497. https://doi.org/10.3844/jcssp.2015.490.497

Copyright: © 2015 Gothai Ekambaram and Balasubramanie Palanisamy. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

5,446 Views
3,917 Downloads
0 Citations

Download

Keywords

Hadoop
Hash Code
Partitioning
MapReduce