A survey of testing automated driving system
Zheng LI , Wentai ZHU , Haohui HUANG , Yu WANG , Linzhang WANG
Front. Comput. Sci. ›› 2026, Vol. 20 ›› Issue (12) : 2012205
A survey of testing automated driving system
Automated driving systems (ADSs) have made significant strides in recent years through the combined efforts of academia and industry. A typical ADS is composed of various complex modules, including perception, planning, and control. As emerging and complex computer programs, ADSs inevitably contain flaws, making it crucial to ensure their safety since any unsafe behavior can result in catastrophic outcomes. Testing is widely recognized as a key approach to ensuring ADS safety by uncovering unsafe behaviors. However, designing effective testing techniques for ADSs is exceptionally challenging due to the high complexity and multidisciplinary nature of these systems. Although an extensive body of literature focuses on ADS testing and several surveys summarizing technical advancements have been published, most concentrate on system-level testing performed within software simulators. Consequently, they often overlook the distinct characteristics, testing requirements, and datasets associated with various ADS modules. In this paper, we present a comprehensive survey of existing ADS testing literature. We begin by investigating the testing infrastructure for ADSs, including available datasets and tools, detailing their capabilities and characteristics. We then survey testing techniques for individual ADS modules (e.g., AI-based modules and firmware) and the integrated system, highlighting technical differences between validation layers. Finally, based on our findings, we identify key challenges and outline potential research opportunities in the field.
ADS testing / module-level testing / system-level testing / firmware testing / dataset
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
Zhao X, Robu V, Flynn D, Salako K, Strigini L. Assessing the safety and reliability of autonomous vehicles from road testing. In: Proceedings of the 30th IEEE International Symposium on Software Reliability Engineering. 2019, 13–23 |
| [8] |
Rong G, Shin B H, Tabatabaee H, Lu Q, Lemke S, Možeiko M, Boise E, Uhm G, Gerow M, Mehta S, Agafonov E, Kim T H, Sterner E, Ushiroda K, Reyes M, Zelenkovsky D, Kim S. LGSVL simulator: a high fidelity simulator for autonomous driving. In: Proceedings of the 23rd IEEE International Conference on Intelligent Transportation Systems. 2020, 1–6 |
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
Fowler D S, Bryans J, Cheah M, Wooderson P, Shaikh S A. A method for constructing automotive cybersecurity tests, a CAN fuzz testing example. In: Proceedings of the 19th IEEE International Conference on Software Quality, Reliability and Security Companion. 2019, 1–8 |
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
|
| [26] |
Dai J, Gao B, Luo M, Huang Z, Li Z, Zhang Y, Yang M. SCTRANS: constructing a large public scenario dataset for simulation testing of autonomous driving systems. In: Proceedings of the 46th IEEE/ACM International Conference on Software Engineering. 2024, 591–603 |
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
|
| [34] |
|
| [35] |
|
| [36] |
|
| [37] |
|
| [38] |
|
| [39] |
|
| [40] |
|
| [41] |
|
| [42] |
|
| [43] |
|
| [44] |
|
| [45] |
|
| [46] |
|
| [47] |
|
| [48] |
|
| [49] |
|
| [50] |
|
| [51] |
|
| [52] |
|
| [53] |
|
| [54] |
|
| [55] |
|
| [56] |
|
| [57] |
|
| [58] |
|
| [59] |
|
| [60] |
|
| [61] |
|
| [62] |
|
| [63] |
|
| [64] |
|
| [65] |
|
| [66] |
|
| [67] |
|
| [68] |
|
| [69] |
|
| [70] |
|
| [71] |
|
| [72] |
|
| [73] |
|
| [74] |
|
| [75] |
|
| [76] |
|
| [77] |
Gao X, Wang Z, Feng Y, Ma L, Chen Z, Xu B. MultiTest: physical-aware object insertion for testing multi-sensor fusion perception systems. In: Proceedings of the 46th IEEE/ACM International Conference on Software Engineering. 2024, 139 |
| [78] |
|
| [79] |
Zhou H, Li W, Kong Z, Guo J, Zhang Y, Yu B, Zhang L, Liu C. DeepBillboard: systematic physical-world testing of autonomous driving systems. In: Proceedings of the 42nd IEEE/ACM International Conference on Software Engineering. 2020, 347–358 |
| [80] |
|
| [81] |
|
| [82] |
|
| [83] |
|
| [84] |
|
| [85] |
|
| [86] |
|
| [87] |
|
| [88] |
|
| [89] |
|
| [90] |
|
| [91] |
|
| [92] |
|
| [93] |
|
| [94] |
|
| [95] |
|
| [96] |
|
| [97] |
|
| [98] |
|
| [99] |
|
| [100] |
|
| [101] |
|
| [102] |
|
| [103] |
|
| [104] |
|
| [105] |
|
| [106] |
Lu C, Yue T, Ali S. DeepScenario: an open driving scenario dataset for autonomous driving system testing. In: Proceedings of the 20th IEEE/ACM International Conference on Mining Software Repositories. 2023, 52–56 |
| [107] |
|
| [108] |
|
| [109] |
|
| [110] |
|
| [111] |
|
| [112] |
Tang S, Zhang Z, Zhou J, Zhou Y, Li Y F, Xue Y. EvoScenario: integrating road structures into critical scenario generation for autonomous driving system testing. In: Proceedings of the 34th IEEE International Symposium on Software Reliability Engineering. 2023, 309–320 |
| [113] |
|
| [114] |
Haq F U, Shin D, Briand L. Efficient online testing for DNN-enabled systems using surrogate-assisted and many-objective optimization. In: Proceedings of the 44th IEEE/ACM International Conference on Software Engineering. 2022, 811–822 |
| [115] |
|
| [116] |
|
| [117] |
|
| [118] |
|
| [119] |
|
| [120] |
Yang Y, Kujanpää K, Babadi I A, Pajarinen J, Ilin A. Suicidal pedestrian: generation of safety-critical scenarios for autonomous vehicles. In: Proceedings of the 26th IEEE International Conference on Intelligent Transportation Systems. 2023, 1983–1988 |
| [121] |
Underwood R, Luu Q H, Liu H. A metamorphic testing framework and toolkit for modular automated driving systems. In: Proceedings of the 8th IEEE/ACM International Workshop on Metamorphic Testing. 2023, 17–24 |
| [122] |
|
| [123] |
|
| [124] |
|
| [125] |
Li G, Li Y, Jha S, Tsai T, Sullivan M, Hari S K S, Kalbarczyk Z, Iyer R. AV-FUZZER: finding safety violations in autonomous driving systems. In: Proceedings of the 31st IEEE International Symposium on Software Reliability Engineering. 2020, 25–36 |
| [126] |
|
| [127] |
|
| [128] |
|
| [129] |
Zhou J, Tang S, Guo Y, Li Y F, Xue Y. From collision to verdict: responsibility attribution for autonomous driving systems testing. In: Proceedings of the 34th IEEE International Symposium on Software Reliability Engineering. 2023, 321–332 |
| [130] |
|
| [131] |
|
| [132] |
|
| [133] |
|
| [134] |
|
| [135] |
|
| [136] |
|
| [137] |
|
| [138] |
|
| [139] |
|
| [140] |
|
| [141] |
|
| [142] |
|
| [143] |
|
| [144] |
|
| [145] |
|
| [146] |
|
| [147] |
|
| [148] |
|
| [149] |
|
| [150] |
|
| [151] |
|
| [152] |
|
| [153] |
|
| [154] |
|
| [155] |
|
| [156] |
|
| [157] |
|
| [158] |
|
| [159] |
|
| [160] |
|
| [161] |
|
| [162] |
|
| [163] |
|
| [164] |
|
| [165] |
|
| [166] |
|
| [167] |
|
| [168] |
|
| [169] |
|
| [170] |
|
| [171] |
|
| [172] |
|
| [173] |
|
| [174] |
|
| [175] |
|
| [176] |
|
| [177] |
|
| [178] |
|
| [179] |
|
| [180] |
|
| [181] |
|
| [182] |
|
| [183] |
|
| [184] |
|
| [185] |
|
| [186] |
|
| [187] |
|
| [188] |
|
| [189] |
|
| [190] |
|
| [191] |
|
| [192] |
|
| [193] |
|
| [194] |
|
| [195] |
|
| [196] |
|
| [197] |
|
| [198] |
|
| [199] |
|
| [200] |
|
| [201] |
|
| [202] |
|
| [203] |
|
| [204] |
|
Higher Education Press
/
| 〈 |
|
〉 |