![](/develop/static/imgs/pdf.png)
Areliable power management scheme for consistent hashing based distributed key value storage systems
Nan-nan ZHAO, Ji-guang WAN, Jun WANG, Chang-sheng XIE
Areliable power management scheme for consistent hashing based distributed key value storage systems
Distributed key value storage systems are among the most important types of distributed storage systems currently deployed in data centers. Nowadays, enterprise data centers are facing growing pressure in reducing their power consumption. In this paper, we propose GreenCHT, a reliable power management scheme for consistent hashing based distributed key value storage systems. It consists of a multi-tier replication scheme, a reliable distributed log store, and a predictive power mode scheduler (PMS). Instead of randomly placing replicas of each object on a number of nodes in the consistent hash ring, we arrange the replicas of objects on nonoverlapping tiers of nodes in the ring. This allows the system to fall in various power modes by powering down subsets of servers while not violating data availability. The predictive PMS predicts workloads and adapts to load fluctuation. It cooperates with the multi-tier replication strategy to provide power proportionality for the system. To ensure that the reliability of the system is maintained when replicas are powered down, we distribute the writes to standby replicas to active servers, which ensures failure tolerance of the system. GreenCHT is implemented based on Sheepdog, a distributed key value storage system that uses consistent hashing as an underlying distributed hash table. By replaying 12 typical real workload traces collected from Microsoft, the evaluation results show that GreenCHT can provide significant power savings while maintaining a desired performance. We observe that GreenCHT can reduce power consumption by up to 35%–61%.
Consistent hash table (CHT) / Replication / Power management / Key value storage system / Reliability
[1] |
Amur, H., Cipar, J., Gupta, V.,
|
[2] |
Bhagwan, R., Savage, S., Voelker, G.M., 2003. Replication strategies for highly available peer-to-peer storage. In: Future Directions in Distributed Computing. Springer-Verlag, p.153–158.
|
[3] |
Box, G.E.P., Jenkins, G., 1990. Time series analysis forecast ing and control. In: Wiley Series in Probability and Statistics. Holden-Day, Inc.
|
[4] |
Brockwell, P.J., Davis, R.A., 1991. Time series: theory and methods. In: Springer Series in Statistics. Springer-Verlag, New York, NY, USA.
|
[5] |
Cisco Systems, 2012. FNV-1. Available from http://www.isthe.com/chongo/tech/comp/fnv/index.html.
|
[6] |
Colarelli, D., Grunwald, D., 2002. Massive arrays of idle disks for storage archives. Proc. ACM/IEEE Conf. on Supercomputing, p.1–11.
|
[7] |
DeCandia, G., Hastorun, D., Jampani, M.,
|
[8] |
Goiri, I., Le, K., Haque, M.E.,
|
[9] |
Goiri, I., Le, K., Nguyen, T.D.,
|
[10] |
Gorini, S., Quirini, M., Menciassi, A.,
|
[11] |
Harnik, D., Naor, D., Segall, I., 2009. Low power mode in cloud storage systems. Proc. Int. Symp. on Parallel and Distributed Processing Systems, p.1–8.
|
[12] |
Karger, D., Lehman, E., Leighton, T.,
|
[13] |
Kaushik, R.T., Bhandarkar, M., 2010. GreenHDFS: towards an energy-conserving, storage-efficient, hybrid Hadoop compute cluster. Proc. Int. Conf. on Power Aware Computing and Systems, p.1–9.
|
[14] |
Kaushik, R., Cherkasova, L., Campbell, R.,
|
[15] |
Lakshman, A., Malik, P., 2010. Cassandra—a decentralized structured storage system. ACM SIGOPS Oper. Syst. Rev., 44(2):35–40.
|
[16] |
Li, C., Qouneh, A., Li, T., 2012. iSwitch: coordinating and optimizing renewable energy powered node clusters. Proc. 39th Annual Int. Symp. on Computer Architecture, p.512–523.
|
[17] |
LinkedIn, 2009. Voldemort Project. Available from http://www.project-voldemort.com/voldemort/.
|
[18] |
Microsoft Research Ltd., 2014. MRS Cambridge Traces..
|
[19] |
MySQL, 2004. SysBench. Available from http://sysbench. sourceforge.net/.
|
[20] |
Narayanan, D., Donnelly, A., Rowstron, A., 2008. Write offloading: practical power management for enterprise storage. ACM Trans. Stor., 4(3):1–10.
|
[21] |
NTT Group, 2011. Sheepdog. Available from https:// github.com/sheepdog/sheepdog/wiki.
|
[22] |
Open Source and Linux Organization, 2007. Blktrace User Guide. Hewlett-Packard Company.
|
[23] |
Park, H., Park, K., 2001. Parallel algorithms for red-black trees. Theor. Comput. Sci., 262(1-2):415–435.
|
[24] |
Pinheiro, E., Bianchini, R., 2004. Energy conservation techniques for disk array-based servers. Proc. 18th Annual Int. Conf. on Supercomputing, p.68–78.
|
[25] |
Pinheiro, E., Bianchini, R., Dubnicki, C., 2006. Exploiting redundancy to conserve energy in storage systems. Proc. Joint Int. Conf. on Measurement and Modeling of Computer Systems, p.15–26.
|
[26] |
Stoica, I., Morris, R., Karger, D.,
|
[27] |
Thereska, E., Donnelly, A., Narayanan, D., 2011. Sierra: practical power-proportionality for data center storage. Proc. 6th Conf. on Computer Systems, p.169–182.
|
[28] |
Zhu, Q., Chen, Z., Tan, L.,
|
/
〈 |
|
〉 |