Schedule refinement for homogeneousmulti-core processors in the presence of manufacturing-caused heterogeneity
Zhi-xiang CHEN, Zhao-lin LI, Shan CAO, Fang WANG, Jie ZHOU
Schedule refinement for homogeneousmulti-core processors in the presence of manufacturing-caused heterogeneity
Multi-core homogeneous processors have been widely used to deal with computation-intensive embedded applications. However, with the continuous down scaling of CMOS technology, within-die variations in the manufacturing process lead to a significant spread in the operating speeds of cores within homogeneous multi-core processors. Task scheduling approaches, which do not consider such heterogeneity caused by within-die variations, can lead to an overly pessimistic result in terms of performance. To realize an optimal performance according to the actual maximum clock frequencies at which cores can run, we present a heterogeneity-aware schedule refining (HASR) scheme by fully exploiting the heterogeneities of homogeneous multi-core processors in embedded domains. We analyze and show how the actual maximum frequencies of cores are used to guide the scheduling. In the scheme, representative chip operating points are selected and the corresponding optimal schedules are generated as candidate schedules. During the booting of each chip, according to the actual maximum clock frequencies of cores, one of the candidate schedules is bound to the chip to maximize the performance. A set of applications are designed to evaluate the proposed scheme. Experimental results show that the proposed scheme can improve the performance by an average value of 22.2%, compared with the baseline schedule based on the worst case timing analysis. Compared with the conventional task scheduling approach based on the actual maximum clock frequencies, the proposed scheme also improves the performance by up to 12%.
Schedule refining / Multi-core processor / Heterogeneity / Representative chip operating point
[1] |
Aguilera, P., Lee, J., Farmahini-Farahani, A.,
CrossRef
Google scholar
|
[2] |
Bell, S., Edwards, B., Amann, J.,
CrossRef
Google scholar
|
[3] |
Bowman, K.A., Duvall, S.G., Meindl, J.D., 2002. Impact of die-to-die and within-die parameter fluctuations on the maximum clock frequency distribution for gigascale integration. IEEE J. Solid-State Circ., 37(2):183–190.
CrossRef
Google scholar
|
[4] |
Bowman, K.A., Alameldeen, A.R., Srinivasan, S.T.,
CrossRef
Google scholar
|
[5] |
Chon, H., Kim, T., 2009. Timing variation-aware task scheduling and binding for MPSoC. Proc. Asia and South Pacific Design Automation Conf., p.137–142.
CrossRef
Google scholar
|
[6] |
Dick, R.P., Rhodes, D.L., Wolf, W., 1998. TGFF: task graphs for free. Proc. 6th Int. Workshop on Hardware/Software Codesign, p.97–101.
CrossRef
Google scholar
|
[7] |
Dietrich, M., Haase, J., 2012. Process Variations and Probabilistic Integrated Circuit Design. Springer, New York, p.69–89.
CrossRef
Google scholar
|
[8] |
Ferrandi, F., Lanzi, P.L., Pilato, C.,
CrossRef
Google scholar
|
[9] |
Huang, L., Xu, Q., 2010. Performance yield-driven task allocation and scheduling for MPSoCs under process variation. Proc. 47th Design Automation Conf., p.326–331.
CrossRef
Google scholar
|
[10] |
Huang, W., Rajamani, K., Stan, M.R.,
CrossRef
Google scholar
|
[11] |
ITRS, 2013. International Technology Roadmap for Semiconductors.
|
[12] |
Khailany, B., Dally, W.J., Kapasi, U.J.,
CrossRef
Google scholar
|
[13] |
Khodabandeloo, B., Khonsari, A., Gholamian, F.,
CrossRef
Google scholar
|
[14] |
Lin, Y.C., Lu, F., Cheng, K.T., 2005. Pseudo-functional scan-based BIST for delay fault. Proc. 23rd IEEE VLSI Test Symp., p.229–234.
CrossRef
Google scholar
|
[15] |
Mirzoyan, D., Akesson, B., Goossens, K., 2012. Processvariation aware mapping of real-time streaming applications to MPSoCs for improved yield. Proc. 13th Int. Symp. on Quality Electronic Design, p.41–48.
CrossRef
Google scholar
|
[16] |
Mirzoyan, D., Akesson, B., Goossens, K., 2014. Processvariation-aware mapping of best-effort and real-time streaming applications to MPSoCs. ACM Trans. Embed. Comput. Syst., 13(2s):61.1–61.24.
CrossRef
Google scholar
|
[17] |
Momtazpour, M., Goudarzi, M., Sanaei, E., 2010a. Variation-aware task and communication scheduling in MPSoCs for power-yield maximization. IEICE Trans. Fundament. Electron. Commun. Comput. Sci., 93(12):2542–2550.
CrossRef
Google scholar
|
[18] |
Momtazpour, M., Sanaei, E., Goudarzi, M., 2010b. Poweryield optimization in MPSoC task scheduling under process variation. Proc. 11th Int. Symp. on Quality Electronic Design, p.747–754.
CrossRef
Google scholar
|
[19] |
Momtazpour, M., Ghorbani, M., Goudarzi, M.,
CrossRef
Google scholar
|
[20] |
Momtazpour, M., Goudarzi, M., Sanaei, E., 2013. Static statistical MPSoC power optimization by variation-aware task and communication scheduling. Microprocess. Microsyst., 37(8B):953–963.
CrossRef
Google scholar
|
[21] |
Omara, F.A., Arafa, M.M., 2010. Genetic algorithms for task scheduling problem. J. Parall. Distrib. Comput., 70(1):13–22.
CrossRef
Google scholar
|
[22] |
Ramamritham, K., 1995. Allocation and scheduling of precedence-related periodic tasks. IEEE Trans. Parall. Distrib. Syst., 6(4):412–420.
CrossRef
Google scholar
|
[23] |
Raychowdhury, A., Ghosh, S., Roy, K., 2005. A novel on-chip delay measurement hardware for efficient speed-binning. Proc. 11th IEEE Int. On-Line Testing Symp., p.287–292.
CrossRef
Google scholar
|
[24] |
Sarangi, S.R., Greskamp, B., Teodorescu, R.,
CrossRef
Google scholar
|
[25] |
Singhal, L., Bozorgzadeh, E., 2008. Process variation aware system-level task allocation using stochastic ordering of delay distributions. Proc. IEEE/ACM Int. Conf. on Computer-Aided Design, p.570–574.
CrossRef
Google scholar
|
[26] |
Stuijk, S., Geilen, M., Basten, T., 2006. SDF3: SDF for free. Proc. 6th Int. Conf. on Application of Concurrency to System Design, p.276–278.
CrossRef
Google scholar
|
[27] |
Taylor, M.B., Kim, J., Miller, J.,
CrossRef
Google scholar
|
[28] |
Topcuoglu, H., Hariri, S., Wu, M.Y., 2002. Performanceeffective and low-complexity task scheduling for heterogeneous computing. IEEE Trans. Parall. Distrib. Syst., 13(3):260–274.
CrossRef
Google scholar
|
[29] |
Von Mises, R., 1964. Mathematical Theory of Probability and Statistics. Academic Press, New York, p.329–367.
CrossRef
Google scholar
|
[30] |
Wang, F., Chen, Y., Nicopoulos, C.,
CrossRef
Google scholar
|
[31] |
Yi, Y., Han, W., Zhao, X.,
CrossRef
Google scholar
|
[32] |
Yu, Z., Baas, B.M., 2009. High performance, energy efficiency, and scalability with GALS chip multiprocessors. IEEE Trans. VLSI Syst., 17(1):66–79.
CrossRef
Google scholar
|
[33] |
Zhao, W., Liu, F., Agarwal, K.,
CrossRef
Google scholar
|
/
〈 | 〉 |