Schedule refinement for homogeneousmulti-core processors in the presence of manufacturing-caused heterogeneity

Zhi-xiang CHEN, Zhao-lin LI, Shan CAO, Fang WANG, Jie ZHOU

PDF(903 KB)
PDF(903 KB)
Front. Inform. Technol. Electron. Eng ›› 2015, Vol. 16 ›› Issue (12) : 1018-1033. DOI: 10.1631/FITEE.1500035
Orginal Article
Orginal Article

Schedule refinement for homogeneousmulti-core processors in the presence of manufacturing-caused heterogeneity

Author information +
History +

Abstract

Multi-core homogeneous processors have been widely used to deal with computation-intensive embedded applications. However, with the continuous down scaling of CMOS technology, within-die variations in the manufacturing process lead to a significant spread in the operating speeds of cores within homogeneous multi-core processors. Task scheduling approaches, which do not consider such heterogeneity caused by within-die variations, can lead to an overly pessimistic result in terms of performance. To realize an optimal performance according to the actual maximum clock frequencies at which cores can run, we present a heterogeneity-aware schedule refining (HASR) scheme by fully exploiting the heterogeneities of homogeneous multi-core processors in embedded domains. We analyze and show how the actual maximum frequencies of cores are used to guide the scheduling. In the scheme, representative chip operating points are selected and the corresponding optimal schedules are generated as candidate schedules. During the booting of each chip, according to the actual maximum clock frequencies of cores, one of the candidate schedules is bound to the chip to maximize the performance. A set of applications are designed to evaluate the proposed scheme. Experimental results show that the proposed scheme can improve the performance by an average value of 22.2%, compared with the baseline schedule based on the worst case timing analysis. Compared with the conventional task scheduling approach based on the actual maximum clock frequencies, the proposed scheme also improves the performance by up to 12%.

Keywords

Schedule refining / Multi-core processor / Heterogeneity / Representative chip operating point

Cite this article

Download citation ▾
Zhi-xiang CHEN, Zhao-lin LI, Shan CAO, Fang WANG, Jie ZHOU. Schedule refinement for homogeneousmulti-core processors in the presence of manufacturing-caused heterogeneity. Front. Inform. Technol. Electron. Eng, 2015, 16(12): 1018‒1033 https://doi.org/10.1631/FITEE.1500035

References

[1]
Aguilera, P., Lee, J., Farmahini-Farahani, A., , 2014. Process variation-aware workload partitioning algorithms for GPUs supporting spatial-multitasking. Design, Automation and Test in Europe Conf. and Exhibition, p.176.1–176.6.
CrossRef Google scholar
[2]
Bell, S., Edwards, B., Amann, J., , 2008. TILE64 processor: a 64-core SoC with mesh interconnect. IEEE Int. Solid-State Circuits Conf., p.588–598.
CrossRef Google scholar
[3]
Bowman, K.A., Duvall, S.G., Meindl, J.D., 2002. Impact of die-to-die and within-die parameter fluctuations on the maximum clock frequency distribution for gigascale integration. IEEE J. Solid-State Circ., 37(2):183–190.
CrossRef Google scholar
[4]
Bowman, K.A., Alameldeen, A.R., Srinivasan, S.T., , 2009. Impact of die-to-die and within-die parameter variations on the clock frequency and throughput of multi-core processors. IEEE Trans. VLSI Syst., 17(12):1679–1690.
CrossRef Google scholar
[5]
Chon, H., Kim, T., 2009. Timing variation-aware task scheduling and binding for MPSoC. Proc. Asia and South Pacific Design Automation Conf., p.137–142.
CrossRef Google scholar
[6]
Dick, R.P., Rhodes, D.L., Wolf, W., 1998. TGFF: task graphs for free. Proc. 6th Int. Workshop on Hardware/Software Codesign, p.97–101.
CrossRef Google scholar
[7]
Dietrich, M., Haase, J., 2012. Process Variations and Probabilistic Integrated Circuit Design. Springer, New York, p.69–89.
CrossRef Google scholar
[8]
Ferrandi, F., Lanzi, P.L., Pilato, C., , 2010. Ant colony heuristic for mapping and scheduling tasks and communications on heterogeneous embedded systems. IEEE Trans. Comput.-Aid. Des. Integr. Circ. Syst., 29(6):911–924.
CrossRef Google scholar
[9]
Huang, L., Xu, Q., 2010. Performance yield-driven task allocation and scheduling for MPSoCs under process variation. Proc. 47th Design Automation Conf., p.326–331.
CrossRef Google scholar
[10]
Huang, W., Rajamani, K., Stan, M.R., , 2011. Scaling with design constraints: predicting the future of big chips. IEEE Micro, 31(4):16–29.
CrossRef Google scholar
[11]
ITRS, 2013. International Technology Roadmap for Semiconductors.
[12]
Khailany, B., Dally, W.J., Kapasi, U.J., , 2001. Imagine: media processing with streams. IEEE Micro, 21(2):35–46.
CrossRef Google scholar
[13]
Khodabandeloo, B., Khonsari, A., Gholamian, F., , 2014. Scenario-based quasi-static task mapping and scheduling for temperature-efficient MPSoC design under process variation. Microprocess. Microsyst., 38(5):399–414.
CrossRef Google scholar
[14]
Lin, Y.C., Lu, F., Cheng, K.T., 2005. Pseudo-functional scan-based BIST for delay fault. Proc. 23rd IEEE VLSI Test Symp., p.229–234.
CrossRef Google scholar
[15]
Mirzoyan, D., Akesson, B., Goossens, K., 2012. Processvariation aware mapping of real-time streaming applications to MPSoCs for improved yield. Proc. 13th Int. Symp. on Quality Electronic Design, p.41–48.
CrossRef Google scholar
[16]
Mirzoyan, D., Akesson, B., Goossens, K., 2014. Processvariation-aware mapping of best-effort and real-time streaming applications to MPSoCs. ACM Trans. Embed. Comput. Syst., 13(2s):61.1–61.24.
CrossRef Google scholar
[17]
Momtazpour, M., Goudarzi, M., Sanaei, E., 2010a. Variation-aware task and communication scheduling in MPSoCs for power-yield maximization. IEICE Trans. Fundament. Electron. Commun. Comput. Sci., 93(12):2542–2550.
CrossRef Google scholar
[18]
Momtazpour, M., Sanaei, E., Goudarzi, M., 2010b. Poweryield optimization in MPSoC task scheduling under process variation. Proc. 11th Int. Symp. on Quality Electronic Design, p.747–754.
CrossRef Google scholar
[19]
Momtazpour, M., Ghorbani, M., Goudarzi, M., , 2011. Simultaneous variation-aware architecture exploration and task scheduling for MPSoC energy minimization. Proc. 21st Symp. on GLSVLSI, p.271–276.
CrossRef Google scholar
[20]
Momtazpour, M., Goudarzi, M., Sanaei, E., 2013. Static statistical MPSoC power optimization by variation-aware task and communication scheduling. Microprocess. Microsyst., 37(8B):953–963.
CrossRef Google scholar
[21]
Omara, F.A., Arafa, M.M., 2010. Genetic algorithms for task scheduling problem. J. Parall. Distrib. Comput., 70(1):13–22.
CrossRef Google scholar
[22]
Ramamritham, K., 1995. Allocation and scheduling of precedence-related periodic tasks. IEEE Trans. Parall. Distrib. Syst., 6(4):412–420.
CrossRef Google scholar
[23]
Raychowdhury, A., Ghosh, S., Roy, K., 2005. A novel on-chip delay measurement hardware for efficient speed-binning. Proc. 11th IEEE Int. On-Line Testing Symp., p.287–292.
CrossRef Google scholar
[24]
Sarangi, S.R., Greskamp, B., Teodorescu, R., , 2008. VARIUS: a model of process variation and resulting timing errors for microarchitects. IEEE Trans. Semicond. Manufact., 21(1):3–13.
CrossRef Google scholar
[25]
Singhal, L., Bozorgzadeh, E., 2008. Process variation aware system-level task allocation using stochastic ordering of delay distributions. Proc. IEEE/ACM Int. Conf. on Computer-Aided Design, p.570–574.
CrossRef Google scholar
[26]
Stuijk, S., Geilen, M., Basten, T., 2006. SDF3: SDF for free. Proc. 6th Int. Conf. on Application of Concurrency to System Design, p.276–278.
CrossRef Google scholar
[27]
Taylor, M.B., Kim, J., Miller, J., , 2002. The raw microprocessor: a computational fabric for software circuits and general-purpose programs. IEEE Micro, 22(2):25–35.
CrossRef Google scholar
[28]
Topcuoglu, H., Hariri, S., Wu, M.Y., 2002. Performanceeffective and low-complexity task scheduling for heterogeneous computing. IEEE Trans. Parall. Distrib. Syst., 13(3):260–274.
CrossRef Google scholar
[29]
Von Mises, R., 1964. Mathematical Theory of Probability and Statistics. Academic Press, New York, p.329–367.
CrossRef Google scholar
[30]
Wang, F., Chen, Y., Nicopoulos, C., , 2011. Variationaware task and communication mapping for MPSoC architecture. IEEE Trans. Comput.-Aid. Des. Integr. Circ. Syst., 30(2):295–307.
CrossRef Google scholar
[31]
Yi, Y., Han, W., Zhao, X., , 2009. An ILP formulation for task mapping and scheduling on multi-core architectures. Design, Automation and Test in Europe Conf. and Exhibition, p.33–38.
CrossRef Google scholar
[32]
Yu, Z., Baas, B.M., 2009. High performance, energy efficiency, and scalability with GALS chip multiprocessors. IEEE Trans. VLSI Syst., 17(1):66–79.
CrossRef Google scholar
[33]
Zhao, W., Liu, F., Agarwal, K., , 2009. Rigorous extraction of process variations for 65-nm CMOS design. IEEE Trans. Semicond. Manufact., 22(1):196–203.
CrossRef Google scholar

RIGHTS & PERMISSIONS

2015 Zhejiang University and Springer-Verlag Berlin Heidelberg
PDF(903 KB)

Accesses

Citations

Detail

Sections
Recommended

/