Meeting deadlines for approximation processing in MapReduce environments

Ming-hao HU, Chang-jian WANG, Yu-xing PENG

PDF(856 KB)
PDF(856 KB)
Front. Inform. Technol. Electron. Eng ›› 2017, Vol. 18 ›› Issue (11) : 1754-1772. DOI: 10.1631/FITEE.1601056
Article
Article

Meeting deadlines for approximation processing in MapReduce environments

Author information +
History +

Abstract

To provide timely results for big data analytics, it is crucial to satisfy deadline requirements for MapReduce jobs in today’s production environments. Much effort has been devoted to the problem of meeting deadlines, and typically there exist two kinds of solutions. The first is to allocate appropriate resources to complete the entire job before the specified time limit, where missed deadlines result because of tight deadline constraints or lack of resources; the second is to run a pre-constructed sample based on deadline constraints, which can satisfy the time requirement but fail to maximize the volumes of processed data. In this paper, we propose a deadline-oriented task scheduling approach, named ‘Dart’, to address the above problem. Given a specified deadline and restricted resources, Dart uses an iterative estimation method, which is based on both historical data and job running status to precisely estimate the real-time job completion time. Based on the estimated time, Dart uses an approach–revise algorithm to make dynamic scheduling decisions for meeting deadlines while maximizing the amount of processed data and mitigating stragglers. Dart also efficiently handles task failures and data skew, protecting its performance from being harmed. We have validated our approach using workloads from OpenCloud and Facebook on a cluster of 64 virtual machines. The results show that Dart can not only effectively meet the deadline but also process near-maximum volumes of data even with tight deadlines and limited resources.

Keywords

MapReduce / Approximation jobs / Deadline / Task scheduling / Straggler mitigation

Cite this article

Download citation ▾
Ming-hao HU, Chang-jian WANG, Yu-xing PENG. Meeting deadlines for approximation processing in MapReduce environments. Front. Inform. Technol. Electron. Eng, 2017, 18(11): 1754‒1772 https://doi.org/10.1631/FITEE.1601056

RIGHTS & PERMISSIONS

2017 Zhejiang University and Springer-Verlag GmbH Germany
PDF(856 KB)

Accesses

Citations

Detail

Sections
Recommended

/