Vol.126, No.2, 2021, pp.771-790, doi:10.32604/cmes.2021.013244
OPEN ACCESS
ARTICLE
Run-Time Dynamic Resource Adjustment for Mitigating Skew in MapReduce
  • Zhihong Liu1, Shuo Zhang2,*, Yaping Liu2, Xiangke Wang1, Dong Yin1
1 College of Intelligence and Technology, National University of Defense Technology, Changsha, 410073, China
2 Cyberspace Institute of Advanced Technology, Guangzhou University, Guangzhou, 510006, China
* Corresponding Author: Shuo Zhang. Email:
(This article belongs to this Special Issue: Intelligent Models for Security and Resilience in Cyber Physical Systems)
Received 30 July 2020; Accepted 15 September 2020; Issue published 21 January 2021
Abstract
MapReduce is a widely used programming model for large-scale data processing. However, it still suffers from the skew problem, which refers to the case in which load is imbalanced among tasks. This problem can cause a small number of tasks to consume much more time than other tasks, thereby prolonging the total job completion time. Existing solutions to this problem commonly predict the loads of tasks and then rebalance the load among them. However, solutions of this kind often incur high performance overhead due to the load prediction and rebalancing. Moreover, existing solutions target the partitioning skew for reduce tasks, but cannot mitigate the computational skew for map tasks. Accordingly, in this paper, we present DynamicAdjust, a run-time dynamic resource adjustment technique for mitigating skew. Rather than rebalancing the load among tasks, DynamicAdjust monitors the run-time execution of tasks and dynamically increases resources for those tasks that require more computation. In so doing, DynamicAdjust can not only eliminate the overhead incurred by load prediction and rebalancing, but also culls both the partitioning skew and the computational skew. Experiments are conducted based on a 21-node real cluster using real-world datasets. The results show that DynamicAdjust can mitigate the negative impact of the skew and shorten the job completion time by up to 40.85%.
Keywords
MapReduce; task scheduling; resource allocation; data skew; big data
Cite This Article
Liu, Z., Zhang, S., Liu, Y., Wang, X., Yin, D. (2021). Run-Time Dynamic Resource Adjustment for Mitigating Skew in MapReduce. CMES-Computer Modeling in Engineering & Sciences, 126(2), 771–790.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.