TY - EJOU AU - Liu, Zhihong AU - Zhang, Shuo AU - Liu, Yaping AU - Wang, Xiangke AU - Yin, Dong TI - Run-Time Dynamic Resource Adjustment for Mitigating Skew in MapReduce T2 - Computer Modeling in Engineering \& Sciences PY - 2021 VL - 126 IS - 2 SN - 1526-1506 AB - MapReduce is a widely used programming model for large-scale data processing. However, it still suffers from the skew problem, which refers to the case in which load is imbalanced among tasks. This problem can cause a small number of tasks to consume much more time than other tasks, thereby prolonging the total job completion time. Existing solutions to this problem commonly predict the loads of tasks and then rebalance the load among them. However, solutions of this kind often incur high performance overhead due to the load prediction and rebalancing. Moreover, existing solutions target the partitioning skew for reduce tasks, but cannot mitigate the computational skew for map tasks. Accordingly, in this paper, we present DynamicAdjust, a run-time dynamic resource adjustment technique for mitigating skew. Rather than rebalancing the load among tasks, DynamicAdjust monitors the run-time execution of tasks and dynamically increases resources for those tasks that require more computation. In so doing, DynamicAdjust can not only eliminate the overhead incurred by load prediction and rebalancing, but also culls both the partitioning skew and the computational skew. Experiments are conducted based on a 21-node real cluster using real-world datasets. The results show that DynamicAdjust can mitigate the negative impact of the skew and shorten the job completion time by up to 40.85%. KW - MapReduce; task scheduling; resource allocation; data skew; big data DO - 10.32604/cmes.2021.013244