Breaking student-concept sparsity barrier for cognitive diagnosis

Pengyang SHAO, Kun ZHANG, Chen GAO, Lei CHEN, Miaomiao CAI, Le WU, Yong LI, Meng WANG

Front. Comput. Sci. ›› 2025, Vol. 19 ›› Issue (11) : 1911363.

PDF(2451 KB)
PDF(2451 KB)
Front. Comput. Sci. ›› 2025, Vol. 19 ›› Issue (11) : 1911363. DOI: 10.1007/s11704-025-40591-2
Artificial Intelligence
RESEARCH ARTICLE

Breaking student-concept sparsity barrier for cognitive diagnosis

Author information +
History +

Abstract

Educational Cognitive Diagnosis (CD) aims to provide students’ mastery levels on different concepts. One common observation is that students often conduct many exercises but engage with a small subset of concepts, leading to a sparsity barrier. Current CD models mostly adopt mastery levels on all concepts as student modeling, overlooking the sparsity barrier. If a student does not interact with all concepts, we can not ensure that each dimension of mastery levels on concepts can be well-trained. In this paper, we propose a novel Enhancing Student Representations in Cognitive Diagnosis (ESR-CD), which combines application abilities and comprehension degrees for mastery levels on concepts. To model application ability, we propose a sparsity-based mask module that solely depends on the dense student-concept entries. Simultaneously, to further enhance comprehension degrees, we propose two layers: a matrix factorization layer and a relation refinement layer. Extensive experiments on two real-world datasets demonstrate the effectiveness of ESR-CD.

Graphical abstract

Keywords

cognitive diagnosis / student modeling / educational data mining

Cite this article

Download citation ▾
Pengyang SHAO, Kun ZHANG, Chen GAO, Lei CHEN, Miaomiao CAI, Le WU, Yong LI, Meng WANG. Breaking student-concept sparsity barrier for cognitive diagnosis. Front. Comput. Sci., 2025, 19(11): 1911363 https://doi.org/10.1007/s11704-025-40591-2

Pengyang SHAO is currently pursuing a PhD degree at Hefei University of Technology (HFUT), China. He received his Bachelor’s degree in 2019 from the same university. His research interest lies on data mining, and large language models. He has published several papers in leading conferences and journals, including KDD, WWW, ACM TOIS and SCIS

Kun ZHANG received a PhD degree in computer science and technology from the University of Science and Technology of China, China in 2019. He is currently a faculty member at the Hefei University of Technology (HFUT), China. His research interests include Natural Language Understanding and Recommender Systems. He has published several papers in referred journals and conferences, such as IEEE TSMC:S, IEEE TKDE, ACM TKDD, AAAI, KDD, ACL, and ICDM. He received the KDD 2018 Best Student Paper Award

Chen GAO is now a Faculty Member (Research-track AP) of BNRist, Tsinghua University, China. He obtained his PhD degree (advised by Prof. Yong Li and Prof. Depeng Jin) and bachelor’s degree from the Department of Electronic Engineering, Tsinghua University in 2021 and 2016, respectively. His research primarily focuses on data mining (recommender system and spatio-temporal data mining), large language model, embodied agent, etc., with over 60 papers in top-tier venues (50+ CCF-A), attracting over 4,000 citations

Lei CHEN is currently a postdoctoral researcher at Tsinghua University, China. He received his PhD from Hefei University of Technology, China in 2022. His research primarily focuses on fairness-aware recommender systems and large language model applications. He has published several papers in leading conferences and journals, including WWW, SIGIR, IEEE TKDE, and ACM TOIS

Miaomiao CAI is a PhD candidate at Hefei University of Technology, China, where she also earned her Bachelor’s degree in Engineering in 2020. Her research primarily focuses on debiasing techniques for recommender systems. She has published several papers in leading conferences and journals, including ACM KDD, ACM MM, and ACM TIST

Le WU is currently a professor at Hefei University of Technology (HFUT), China. She received her PhD degree from the University of Science and Technology of China (USTC), China. Her general area of research interests are data mining, recommender systems, and responsible user modeling. She has published more than 60 papers in referred journals and conferences, such as IEEE TKDE, NIPS, SIGIR, WWW, and AAAI. Dr. Le Wu is the recipient of the Best of SDM 2015 Award, and the Distinguished Dissertation Award from the China Association for Artificial Intelligence (CAAI) 2017

Yong LI received the BS degree from Huazhong University of Science and Technology, China in 2007, and the MS and the PhD degrees in electrical engineering from Tsinghua University, China in 2009 and 2012, respectively. Currently, he is a Faculty Member of the Department of Electronic Engineering, Tsinghua University, China. His research interests are in the areas of wireless networking, mobile computing and urban computing. Dr. Li has served as General Chair, TPC Chair, TPC Member for several international workshops and conferences, and he is on the editorial board of four international journals

Meng WANG is a professor at Hefei University of Technology, China. He received his BE degree and PhD degrees in the Special Class for the Gifted Young and the Department of Electronic Engineering and Information Science from the University of Science and Technology of China (USTC), China in 2003 and 2008, respectively. His current research interests include multimedia content analysis, computer vision, and pattern recognition. He is an associate editor of IEEE Transactions on Knowledge and Data Engineering (IEEE TKDE), IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT), IEEE Transactions on Multimedia (IEEE TMM), and IEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS)

References

[1]
Chen X, Wu L, Liu F, Chen L, Zhang K, Hong R, Wang M. Disentangling cognitive diagnosis with limited exercise labels. In: Proceedings of the 37th International Conference on Neural Information Processing Systems. 2023, 792
[2]
Liu Y, Zhang T, Wang X, Yu G, Li T . New development of cognitive diagnosis models. Frontiers of Computer Science, 2023, 17( 1): 171604
[3]
Shen J, Qian H, Zhang W, Zhou A. Symbolic cognitive diagnosis via hybrid optimization for intelligent education systems. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence. 2024, 14928−14936
[4]
Gao W, Wang H, Liu Q, Wang F, Lin X, Yue L, Zhang Z, Lv R, Wang S. Leveraging transferable knowledge concept graph embedding for cold-start cognitive diagnosis. In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2023, 983−992
[5]
Liu H, Zhang T, Li F, Yu M, Yu G . A probabilistic generative model for tracking multi-knowledge concept mastery probability. Frontiers of Computer Science, 2024, 18( 3): 183602
[6]
Dai H, Zhang Y, Yun Y, An R, Zhang W, Shang X . Adaptive meta-knowledge dictionary learning for incremental knowledge tracing. Engineering Applications of Artificial Intelligence, 2024, 132: 107969
[7]
Wang F, Liu Q, Chen E, Huang Z, Chen Y, Yin Y, Huang Z, Wang S. Neural cognitive diagnosis for intelligent education systems. In: Proceedings of the 34th AAAI Conference on Artificial Intelligence. 2020, 6153−6161
[8]
Liu S, Qian H, Li M, Zhou A. QCCDM: a Q-augmented causal cognitive diagnosis model for student learning. In: Proceedings of the 26th European Conference on Artificial Intelligence. 2023, 1536−1543
[9]
Wang F, Gao W, Liu Q, Li J, Zhao G, Zhang Z, Huang Z, Zhu M, Wang S, Tong W, Chen E. A survey of models for cognitive diagnosis: new developments and future directions. 2024, arXiv preprint arXiv: 2407.05458
[10]
Liu Q. Towards a new generation of cognitive diagnosis. In: Proceedings of the 30th International Joint Conference on Artificial Intelligence. 2021, 4961−4964
[11]
Gao W, Liu Q, Huang Z, Yin Y, Bi H, Wang M C, Ma J, Wang S, Su Y. RCD: relation map driven cognitive diagnosis for intelligent education systems. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2021, 501−510
[12]
Li J, Wang F, Liu Q, Zhu M, Huang W, Huang Z, Chen E, Su Y, Wang S. HierCDF: a Bayesian network-based hierarchical cognitive diagnosis framework. In: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2022, 904−913
[13]
Wang F, Liu Q, Chen E, Huang Z, Yin Y, Wang S, Su Y . NeuralCD: a general framework for cognitive diagnosis. IEEE Transactions on Knowledge and Data Engineering, 2023, 35( 8): 8312–8327
[14]
Burke K. The Mindful School: How To Assess Thoughtful Outcomes: K-College. Palatine: IRI Skylight Publishing, 1993
[15]
Embretson S E, Reise S P. Item Response Theory for Psychologists. New York: Psychology Press, 2013
[16]
Ma H, Wang C, Zhu H, Yang S, Zhang X, Zhang X. Enhancing cognitive diagnosis using un-interacted exercises: a collaboration-aware mixed sampling approach. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence. 2024, 8877−8885
[17]
Gao W, Liu Q, Wang H, Yue L, Bi H, Gu Y, Yao F, Zhang Z, Li X, He Y. Zero-1-to-3: domain-level zero-shot cognitive diagnosis via one batch of early-bird students towards three diagnostic objectives. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence. 2024, 8417−8426
[18]
Li M, Qian H, Lv J, He M, Zhang W, Zhou A . Foundation model enhanced derivative-free cognitive diagnosis. Frontiers of Computer Science, 2025, 19( 1): 191318
[19]
Zhang Z, Liu Q, Jiang H, Wang F, Zhuang Y, Wu L, Gao W, Chen E. FairLISA: fair user modeling with limited sensitive attributes information. In: Proceedings of the 37th Conference on Neural Information Processing Systems. 2023, 1796
[20]
Zhang Y, Dai H, Yun Y, Liu S, Lan A, Shang X . Meta-knowledge dictionary learning on 1-bit response data for student knowledge diagnosis. Knowledge-Based Systems, 2020, 205: 106290
[21]
De La Torre J . Dina model and parameter estimation: a didactic. Journal of educational and behavioral statistics, 2009, 34( 1): 115–130
[22]
Wang S, Zeng Z, Yang X, Zhang X. Self-supervised graph learning for long-tailed cognitive diagnosis. In: Proceedings of the 37th AAAI Conference on Artificial Intelligence. 2023, 110−118
[23]
Qian H, Liu S, Li M, Li B, Liu Z, Zhou A. ORCDF: an oversmoothing-resistant cognitive diagnosis framework for student learning in online education systems. In: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2024, 2455−2466
[24]
Wu L, Chen X, Liu F, Xie J, Xia C, Tan Z, Tian M, Li J, Zhang K, Lian D, Hong R, Wang M . EduStudio: towards a unified library for student cognitive modeling. Frontiers of Computer Science, 2025, 19( 8): 198342
[25]
Liu S, Shen J, Qian H, Zhou A. Inductive cognitive diagnosis for fast student learning in web-based intelligent education systems. In: Proceedings of the ACM Web Conference 2024. 2024, 4260−4271
[26]
Zhang Y, Qin C, Shen D, Ma H, Zhang L, Zhang X, Zhu H. ReliCD: a reliable cognitive diagnosis framework with confidence awareness. In: Proceedings of 2023 IEEE International Conference on Data Mining (ICDM). 2023, 858−867
[27]
Li J, Liu Q, Wang F, Liu J, Huang Z, Yao F, Zhu L, Su Y. Towards the identifiability and explainability for personalized learner modeling: an inductive paradigm. In: Proceedings of the ACM Web Conference 2024. 2024, 3420−3431
[28]
Zhang Z, Wu L, Liu Q, Liu J, Huang Z, Yin Y, Zhuang Y, Gao W, Chen E . Understanding and improving fairness in cognitive diagnosis. Science China Information Sciences, 2024, 67( 5): 152106
[29]
Zhang D, Zhang K, Wu L, Tian M, Hong R, Wang M. Path-Specific causal reasoning for fairness-aware cognitive diagnosis. In: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2024, 4143−4154
[30]
Zhang Z, Liu Q, Hu Z, Zhan Y, Huang Z, Gao W, Mao Q. Enhancing fairness in meta-learned user modeling via adaptive sampling. In: Proceedings of the ACM Web Conference 2024. 2024, 3241−3252
[31]
Chen L, Wu L, Zhang K, Hong R, Lian D, Zhang Z, Zhou J, Wang M. Improving recommendation fairness via data augmentation. In: Proceedings of the ACM Web Conference 2023. 2023, 1012−1020
[32]
Jiang Y, Ma H, Zhang X, Li Z, Chang L . Incorporating metapath interaction on heterogeneous information network for social recommendation. Frontiers of Computer Science, 2024, 18( 1): 181302
[33]
Gao C, Wang S, Li S, Chen J, He X, Lei W, Li B, Zhang Y, Jiang P . CIRS: bursting filter bubbles by counterfactual interactive recommender system. ACM Transactions on Information Systems, 2023, 42( 1): 14
[34]
Cai M, Hou M, Chen L, Wu L, Bai H, Li Y, Wang M . Mitigating recommendation biases via group-alignment and global-uniformity in representation learning. ACM Transactions on Intelligent Systems and Technology, 2024, 15( 5): 101
[35]
Cai M, Chen L, Wang Y, Bai H, Sun P, Wu L, Zhang M, Wang M. Popularity-aware alignment and contrast for mitigating popularity bias. In: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2024, 187−198
[36]
Huang L, Ma H, He X, Chang L . Multi-affect(ed): improving recommendation with similarity-enhanced user reliability and influence propagation. Frontiers of Computer Science, 2021, 15( 5): 155331
[37]
Shuai J, Zhang K, Wu L, Sun P, Hong R, Wang M, Li Y. A review-aware graph contrastive learning framework for recommendation. In: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2022, 1283−1293
[38]
Zhang J, Zhu Y, Liu Q, Wu S, Wang S, Wang L. Mining latent structures for multimedia recommendation. In: Proceedings of the 29th ACM International Conference on Multimedia. 2021, 3872−3880
[39]
He X, Deng K, Wang X, Li Y, Zhang Y, Wang M. LightGCN: simplifying and powering graph convolution network for recommendation. In: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 2020, 639−648
[40]
Zhuang Y, Liu Q, Zhao G, Huang Z, Huang W, Pardos Z A, Chen E, Wu J, Li X. A bounded ability estimation for computerized adaptive testing. In: Proceedings of the 37th International Conference on Neural Information Processing Systems. 2023, 111
[41]
Hu Y, Koren Y, Volinsky C. Collaborative filtering for implicit feedback datasets. In: Proceedings of the 8th IEEE International Conference on Data Mining. 2008, 263−272
[42]
He X, Liao L, Zhang H, Nie L, Hu X, Chua T S. Neural collaborative filtering. In: Proceedings of the 26th International Conference on World Wide Web. 2017, 173−182
[43]
Salakhutdinov R, Mnih A. Probabilistic matrix factorization. In: Proceedings of the 21st International Conference on Neural Information Processing Systems. 2007, 1257−1264
[44]
Wu C, Wang X, Lian D, Xie X, Chen E. A causality inspired framework for model interpretation. In: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2023, 2731−2741
[45]
Yu J, Lu M, Zhong Q, Yao Z, Tu S, Liao Z, Li X, Li M, Hou L, Zheng H T, Li J, Tang J. MoocRadar: a fine-grained and multi-aspect knowledge repository for improving cognitive student modeling in MOOCs. In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2023, 2924−2934
[46]
Reckase M D . The past and future of multidimensional item response theory. Applied Psychological Measurement, 1997, 21( 1): 25–36
[47]
Li S, Guan Q, Fang L, Xiao F, He Z, He Y, Luo W. Cognitive diagnosis focusing on knowledge concepts. In: Proceedings of the 31st ACM International Conference on Information & Knowledge Management. 2022, 3272−3281
[48]
Ma H, Li M, Wu L, Zhang H, Cao Y, Zhang X, Zhao X. Knowledge-sensed cognitive diagnosis for intelligent education platforms. In: Proceedings of the 31st ACM International Conference on Information & Knowledge Management. 2022, 1451−1460

Acknowledgements

This work has been supported in part by grants from the National Science and Technology Major Project (2021ZD0111802), the New Cornerstone Science Foundation through the XPLORER PRIZE, the National Natural Science Foundation of China (Grant Nos. 72188101, 62376086), and the Joint Funds of the National Natural Science Foundation of China (U22A2094).

Competing interests

The authors declare that they have no competing interests or financial conflicts to disclose.

RIGHTS & PERMISSIONS

2025 Higher Education Press
AI Summary AI Mindmap
PDF(2451 KB)

Accesses

Citations

Detail

Sections
Recommended

/