Trustworthy evaluation of large language models
Xin-Yi ZHANG , Han-Jia YE , De-Chuan ZHAN
Front. Comput. Sci. ›› 2026, Vol. 20 ›› Issue (2) : 2002324
Trustworthy evaluation of large language models
| [1] |
|
| [2] |
Huang Y, Sun L, Wang H, Wu S, Zhang Q, Li Y, Gao C, Huang Y, Lyu W, Zhang Y, Li X, Sun H, Liu Z, Liu Y, Wang Y, Zhang Z, Vidgen B, Kailkhura B, Xiong C, Xiao C, Li C, Xing E P, Huang F, Liu H, Ji H, Wang H, Zhang H, Yao H, Kellis M, Zitnik M, Jiang M, Bansal M, Zou J, Pei J, Liu J, Gao J, Han J, Zhao J, Tang J, Wang J, Vanschoren J, Mitchell J, Shu K, Xu K, Chang K W, He L, Huang L, Backes M, Gong N Z, Yu P S, Chen P Y, Gu Q, Xu R, Ying R, Ji S, Jana S, Chen T, Liu T, Zhou T, Wang W Y, Li X, Zhang X, Wang X, Xie X, Chen X, Wang X, Liu Y, Ye Y, Cao Y, Chen Y, Zhao Y. TrustLLM: trustworthiness in large language models. In: Proceedings of the 41st International Conference on Machine Learning. 2024, 20166–20270 |
| [3] |
|
| [4] |
Toreini E, Aitken M, Coopamootoo K, Elliott K, Zelaya C G, van Moorsel A. The relationship between trust in AI and trustworthy machine learning technologies. In: Proceedings of 2020 Conference on Fairness, Accountability, and Transparency. 2020, 272–283 |
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
Higher Education Press
/
| 〈 |
|
〉 |