Heuristic file sorted assignment algorithm of parallel I/O on cluster computing system

Zhi-gang Chen , Bi-qing Zeng , Ce Xiong , Xiao-heng Deng , Zhi-wen Zeng , An-feng Liu

Journal of Central South University ›› 2005, Vol. 12 ›› Issue (5) : 572 -577.

PDF
Journal of Central South University ›› 2005, Vol. 12 ›› Issue (5) : 572 -577. DOI: 10.1007/s11771-005-0125-7
Article

Heuristic file sorted assignment algorithm of parallel I/O on cluster computing system

Author information +
History +
PDF

Abstract

A new file assignment strategy of parallel I/O, which is named heuristic file sorted assignment algorithm was proposed on cluster computing system. Based on the load balancing, it assigns the files to the same disk according to the similar service time. Firstly, the files were sorted and stored at the set I in descending order in terms of their service time, then one disk of cluster node was selected randomly when the files were to be assigned, and at last the continuous files were taken orderly from the set I to the disk until the disk reached its load maximum. The experimental results show that the new strategy improves the performance by 20.2% when the load of the system is light and by 31.6% when the load is heavy. And the higher the data access rate, the more evident the improvement of the performance obtained by the heuristic file sorted assignment algorithm.

Keywords

cluster computing / parallel I/O / file sorted assignment / variance of service time

Cite this article

Download citation ▾
Zhi-gang Chen, Bi-qing Zeng, Ce Xiong, Xiao-heng Deng, Zhi-wen Zeng, An-feng Liu. Heuristic file sorted assignment algorithm of parallel I/O on cluster computing system. Journal of Central South University, 2005, 12(5): 572-577 DOI:10.1007/s11771-005-0125-7

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

BajajR, AgrawalD P. Improving scheduling of tasks in a heterogeneous environment [J]. IEEE Transactions on Parallel and Distributed Systems, 2004, 15(2): 107-118

[2]

LongXiang, LiZhong-ze, GaoXian-peng, et al.. A new method to improve the I/O efficiency on network of workstations[J]. Journal of Computer Research and Development, 2000, 37(6): 650-656(in Chinese)

[3]

RajkumarBHigh Performance Cluster Computing: Architectures and Systems (Vol. 1)[M], 1999, New Jersey, Prentice Hall PTR Inc

[4]

BellK, ChienA, LauriaM. A high-performance cluster storage server[A]. Proceeding of the 11th IEEE International Symposium on High Performance Distributed Computing[C], 2002, Scotland, Edinburgh

[5]

Ma X S, Jiao X M, Campbell M, et al. Flexible and efficient parallel I/O for large-scale multi-component simulations[A]. Proceedings of the International Parallel and Distributed Processing Symposium[C]. New Mexico, 2003.

[6]

NancyT, DanielA, Reed. Automatic ARIMA time seies modeling for adaptive I/O prefetching[J]. IEEE Transactions on Parallel and Distributed Systems, 2004, 15(4): 362-377

[7]

KerenA, BarakA. Opportunity cost algorithms for reduction of I/O and interprocess communication overhead in a computing cluster[J]. IEEE Transactions on Parallel and Distributed Systems, 2003, 14(1): 39-50

[8]

Venugopal C R, Rao S S S P. Impact of delays in parallel I/O system: an empirical study[A]. Proceedings of the High Performance Distributed Computing [C]. New York, 1996.

[9]

ShenX H, LiaoW K, ChoudharyA, et al.. A high-performance application data environment for largescale scientific computations[J]. IEEE Transactions on Parallel and Distributed Systems, 2003, 14(12): 1262-1274

[10]

AguilarJ. A graph theoretical model for scheduling simultaneous I/O operations on parallel and distributed environments [J]. Parallel Processing Lettes, 2002, 12(1): 113-115

[11]

ScheuermannP, WeikumG, ZabbackP. Data partitioning and load balancing in parallel disk systems [J]. The International Journal on Very Large Data Bases, 1998, 7(1): 48-66

[12]

Copeland G, Alexander W, Bougher E, et al. Data placement in bubba[A]. Proceeding ACM SIGMOD Int’l Conf Management of Data[C]. Los Angeles, 1988.

[13]

Apon A W, Wolinski P D, Amerson G M. Sensitivity of cluster file system access to I/O server selection [A]. Proceeding of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid[C]. Berlin, 2002.

[14]

CarballeiraF G, CarreteroJ, CalderonA, et al.. An adaptive cache coherence protocol specification for parallel input/output systems[J]. IEEE Transactions on Parallel and Distributed Systems, 2004, 15(6): 533-545

[15]

SUN Jian-hua, JIN Hai, CHEN Hao, et al. Server scheduling scheme for asynchronous cluster video server [A]. Proceedings of the 17th International Conference on Advanced Information Networking and Applications[C]. Xi’an, 2003.

[16]

Ching A, Choudhary A, Coloma K, et al. Noncontinuous I/O accesses through MPI-IO[A]. Proceedings of the 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid [C]. Tokyo, 2003.

[17]

ZHOU Xin-rong, WEI Tong. A greedy I/O scheduling method in the storage system of clusters[A]. Proceedings of the 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid[C]. Tokyo, 2003.

[18]

KwanT, McgrathR, ReedD. Ncsas world wide web server design and performance[J]. Computer, 1995, 28(11): 67-74

[19]

AliS, MaciejewskiA A, SiegelH J, et al.. Measuring the robustness of a resource allocation[J]. IEEE Transactions on Parallel and Distributed Systems, 2004, 15(7): 630-641

[20]

Perez J M, Garcia F, Carretero J, et al. Data allocation and load balancing for heterogeneous cluster storage systems[A]. Proceedings of the 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid[C]. Tokyo, 2003.

AI Summary AI Mindmap
PDF

123

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/