A comprehensive exploration of personalized learning in smart education: from student modeling to personalized recommendations

Siyu WU; Yang CAO; Runze LI; Jiajun CUI; Hong QIAN; Bo JIANG; Wei ZHANG

doi:10.1007/s11704-026-50579-1

Front. Comput. Sci. ›› 2027, Vol. 21 ›› Issue (2) :2102334 DOI: 10.1007/s11704-026-50579-1

Artificial Intelligence

REVIEW ARTICLE

A comprehensive exploration of personalized learning in smart education: from student modeling to personalized recommendations

Author information +

History +

PDF (3427KB)

Abstract

With the development of artificial intelligence, personalized learning has attracted much attention as an integral part of intelligent education. In recent years, countries and regions such as China, the United States, and the European Union have increasingly recognized the importance of personalized learning, emphasizing its potential to integrate large-scale education with individualized instruction effectively. This survey provides a comprehensive analysis of personalized learning by reviewing relevant studies published in major conferences and journals between January 2017 and April 2025. We examine its definition, objectives, and underlying educational theories, highlighting its pedagogical significance. Furthermore, we explore personalized learning from two key dimensions: student modeling and personalized recommendations. Student modeling is analyzed from both cognitive and non-cognitive perspectives, while recommendation approaches are categorized based on their specific objectives. Additionally, we investigate the interplay between these components and their role in enhancing personalized learning. Beyond theoretical and algorithmic insights, this survey reviews real-world applications, demonstrating personalized learning’s effectiveness in educational practice. Finally, we discuss key challenges and future directions, offering a multidimensional perspective that bridges theory and practice.

Graphical abstract

Keywords

personalized learning / student modeling / personalized recommendation / cognitive diagnosis / educational theory

Cite this article

Download citation ▾

Siyu WU, Yang CAO, Runze LI, Jiajun CUI, Hong QIAN, Bo JIANG, Wei ZHANG. A comprehensive exploration of personalized learning in smart education: from student modeling to personalized recommendations. Front. Comput. Sci., 2027, 21 (2) : 2102334 DOI:10.1007/s11704-026-50579-1

登录浏览全文

4963

注册一个新账户忘记密码

1 Introduction

Throughout history, the landscape of education has continually evolved, undergoing significant and transformative changes. Personalized learning, positioned as a pivotal focus on education, has emerged as a crucial strategy to address the distinct needs and objectives of individual learners.

Personalized learning has evolved from early educational research on individual differences [1] to computer-assisted instruction [2], eventually becoming an integral part of intelligent educational systems through recommender systems and student modeling [3,4]. With the advent of big data technologies, personalized learning has entered a data-driven era, enabling more precise adaptation to individual learner needs [5].

This evolution has led to personalized adaptive learning, which dynamically adjusts instructional content and strategies based on students’ abilities and progress. Figure 1 outlines this developmental trajectory, illustrating key milestones from the early 1800s to the present. The trajectory of personalized learning has traversed an evolutionary continuum, progressing from simplicity to complexity, from mechanization to intelligence, and from theoretical conceptualization to practical implementation. By tailoring content and learning paths to individual characteristics, personalized learning enhances engagement, improves learning outcomes, and accommodates diverse educational needs [6–8]. It has demonstrated effectiveness across traditional classrooms, online education [9], workforce training [10], and career development [11].

As education increasingly shifts towards intelligent and adaptive models, research in personalized learning has become essential for addressing student diversity, improving motivation, and reducing dropout rates [12,13]. The growing prominence of personalized learning in academia, policy discussions, and media further highlights its significance [14].

Although personalized learning has made significant progress, existing review studies often focus on specific aspects and lack a comprehensive and systematic overview of the field. To fill this gap, this study systematically analyzes personalized learning, exploring its definitions, goals, educational theories, and key technologies. Special emphasis is placed on student modeling and personalized recommendations, as these components form the foundation of personalized learning implementation. By synthesizing recent advances and identifying future research directions, this review aims to provide valuable insights for researchers and practitioners in the field.

1.1 Comparison to relevant surveys

Recent studies have explored personalized learning from various perspectives, yielding valuable insights. Chen et al. [15] examined the impact of individual differences, including learning styles and abilities, on personalized learning. Shemshack et al. [16] analyzed personalized learning terminology, complementing our definition in Section 2. Bernacki et al. [17] reviewed empirical research using Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines, focusing on specific contexts and learner characteristics. While sharing a focus on educational theories, their study does not delve as deeply into the intricate relationship between educational theories and personalized learning. Essa et al. [18] extensively studied learning styles, primarily emphasizing machine learning methods for learner classification. While our review includes learning style identification, we adopt a broader scope, incorporating sentiment and behavioral analysis within learning analytics. Many reviews focus on personalized learning recommendations, emphasizing algorithms such as deep learning, machine learning, and meta-learning [19–21]. Some studies specialize in resource or learning path recommendations [22], often limiting their scope to specific techniques. In contrast, our review provides a more comprehensive analysis covering not only personalized recommendations but also other important aspects of personalized learning, such as cognitive diagnosis and learning styles.

1.2 Contributions

This paper presents a systematic and comprehensive review of personalized learning, encompassing its definitions, goals, methodologies, educational theories, practical applications, and the interplay between student modeling and personalized recommendations. Through detailed classification and in-depth analysis, it provides readers with a holistic understanding of the field. The main contributions of this paper are as follows:

● New taxonomy. Although comprehensive personalized learning systems typically consist of four key components: student modeling, content modeling, personalized recommendations, and instructional interventions [23], this paper focuses on the two core components, student modeling and personalized recommendations, as they are sufficient to capture the complete process of personalized learning implementation. To refine the classification framework, student modeling is divided into cognitive and non-cognitive categories. Cognitive modeling encompasses cognitive diagnosis and student performance prediction (cognitive), whereas non-cognitive modeling includes learning style analysis, sentiment analysis, behavior analysis, and student performance prediction (non-cognitive). Similarly, personalized recommendations are categorized into path recommendations, course recommendations, and exercise recommendations. Each classification is further delineated based on technical methodologies, research focus, and application domains. For example, in student behavior analysis, existing studies are categorized according to technical approaches such as clustering algorithms, association rules, machine learning, and hybrid methods, offering a systematic organization of prior research.

● A comprehensive and systematic review. This paper offers a comprehensive, multidimensional review of personalized learning, covering its definition, theoretical foundations, technical pipeline from student modeling to personalized recommendations, and real-world applications. Beyond technical methodologies, we examine its scientific basis and practical implications from an educational theory perspective.

● Latest information. This paper systematically collects and analyzes the latest research findings from January 2017 to January 2025, providing a comprehensive reflection on the latest advances and trends in the field of personalized learning.

● Application notes. This paper will systematically present real-world use cases of personalized learning that demonstrate the effects of personalized learning in real educational settings, providing a link between theory and practice.

● Future directions. We critically analyze the limitations of current state-of-the-art approaches and identify key challenges in the field across four critical areas: data quality, assessment systems, technological limitations, and ethical concerns. Based on these insights, we propose promising future research directions to address these challenges and advance the field of personalized learning.

1.3 Survey methodology

We systematically select articles from the field of personalized learning through a four-step procedure: retrieval, initial screening, fine-grained selection, and supplementation. The detailed workflow and the number of the corresponding articles are illustrated in Fig. 2(a). In the first step of retrieval, we conduct keyword-based queries on major academic search engines, using terms such as (“[intitle: personalized AND intitle: learning]” and (“[intitle: adaptive AND intitle: learning]”). This process yields a comprehensive initial set of publications. During the initial screening, we remove duplicates and retain articles published between 2017 and 2025. To ensure both comprehensiveness and academic rigor, we prioritize publications from high-impact journals and premier conferences (e.g., EDM, Computers & Education, IEEE TLT, AAAI, KDD, SIGIR), while also incorporating other influential or contextually significant works. The selection process is guided by multiple criteria, including research novelty, analytical rigor, and overall scholarly impact.

The subsequent fine-grained selection involves a thorough examination of the remaining corpus to identify works whose primary contribution lies in student modeling or personalized recommendation, thereby ensuring the thematic coherence of this survey. Given that student modeling encompasses both cognitive and non-cognitive aspects, we employ refined keywords for filtering, such as: (“[intitle: student* AND (intitle: modeling OR intitle: diagnosis OR intitle: analytics)]”) OR (“[intitle: learner* AND (intitle: modeling OR intitle: analysis)]”) OR (“[intitle: sentiment AND intitle: analysis]”). Similarly, for personalized recommendation, we used: (“[intitle: personalized AND (intitle: recommendation OR intitle: education)]” OR “[intitle: recommendation AND (intitle: exercise OR intitle: learning)]”).

Finally, to enhance the comprehensiveness of our survey, we employ a backward snowballing strategy. Starting from the articles selected in the third step, we trace back to the earlier influential articles they cited, ultimately forming the literature dataset for our research. The domain-wise distribution of the final article set is depicted in Fig. 2(b).

1.4 Article organization

As illustrated in Fig. 3, this survey is structured as follows. Section 2 provides a detailed discussion on the definition and objectives of personalized learning, establishing a foundational framework for the review and linking it to the educational theories discussed in Section 3. Section 3 explores the theoretical foundations of personalized learning, highlighting key educational theories that underpin its development and offering a comprehensive perspective on education to support subsequent discussions. Section 4 examines student modeling, covering cognitive (Section 4.1) and non-cognitive modeling (Subsection 4.2), and categorizing studies by research focus and technical methodology. This provides a comprehensive learner profile, forming the basis for personalized recommendations in Section 5. Building upon this foundation, Section 5 explores the second core aspect of personalized learning: personalized recommendations. This chapter categorizes personalized recommendations into three aspects: learning path recommendations, exercise recommendations, and course recommendations. Section 6 presents real-world applications that integrate the previously discussed theories and methodologies, showcasing actual tools and platforms implemented in practice. Section 7 outlines key challenges and future research directions, offering insights into unresolved issues and potential advancements in the field. Finally, Section 8 provides a summary of the survey. This structured approach thoroughly examines personalized learning across educational theory, technology, and practice, aiming to equip academics and practitioners with a well-rounded perspective.

2 Definition and goals

2.1 Definitions of personalized learning

Definitions of personalized learning from different organizations demonstrate the diversity and complexity of the concept, with different emphases, scopes, and implementation strategies. Globally, there is no standardized definition [24], yet a comparative analysis of multiple organizations’ definitions provides a clear picture of the evolution of the concept and its specific application in different educational contexts.

The definition of personalized learning was first put forward by the Organization for Economic Co-operation and Development (OECD, 2006) reflects its systemic and holistic nature, arguing that personalized learning is not just about individual adjustments to teaching methods but rather a redesign of education through five dimensions: learning assessments, teaching strategies, curriculum choices, school organization, and partnerships. This definition emphasizes a holistic reform of the education system, focusing on how to meet individual student needs and promote the optimization of the education process through the integration of feedback mechanisms and personalized teaching strategies.

In contrast, the International Society for Technology in Education (ISTE, 2014) puts the spotlight on the role of technology, arguing that the core of personalized learning lies in the customization and optimization of each student’s learning experience through the use of technological tools. ISTE’s definition highlights how technology can be a bridge to support personalized teaching and learning, especially in the age of information technology, which has become an important means of promoting personalized learning. In this perspective, personalized learning is more often reflected in the digital customization of educational resources and learning paths, with an emphasis on the adaptability and accessibility of tools. Meanwhile, the International Association for K-12 Online Learning (INACOL, 2016) tends to focus more on the students themselves, proposing a student-centered definition of personalized learning that emphasizes the need for education to be adapted to the interests, strengths, and needs of students. This perspective dovetails with the U.S. Department of Education’s (2017) definition in its National Education Technology Plan [25], which proposes to personalize each student’s pace and instructional approach, emphasizing the individualization and flexibility of education. Both INACOL and the U.S. Department of Education’s definitions exemplify the application of personalized learning to actual teaching and learning, especially at the K-12 education level, with a focus on tailoring instruction to students’ individual differences to customize instructional content and pacing. In contrast, the Stanford Research Institute (SRI, 2018) [26] provides a more in-depth definition, emphasizing that personalized learning is not limited to adapting instructional strategies but should also be comprehensively adapted to the needs of the student through customized assessments, curricula, and instructional methods. This definition reflects a deeper understanding of individual differences and promotes the incorporation of multiple instructional elements into the educational process to achieve more comprehensive personalized learning.

Taken together, these definitions suggest that personalized learning is centered on making adjustments to students’ individual needs and learning progress, but the specific ways in which this is practiced vary widely. From OECD’s emphasis on system-wide reforms to ISTE’s prominence of the role of technology support to INACOL’s and the U.S. Department of Education’s emphasis on student-centered strategies, each definition reflects the understanding and application of personalized learning in different educational contexts. Meanwhile, the Stanford Research Institute’s definition further deepens the concept by advocating for a more integrated and deeper model of personalized education. These different perspectives and foci reveal the diversity of personalized learning in theory and practice, and reflect the continuing exploration and development of this innovative concept in the field of education.

In summary, although the definitions of personalized learning vary across organizations and contexts, they all point to one core goal: to meet the unique abilities, knowledge levels, and learning needs of each student through customized teaching strategies and approaches. As the definitions continue to evolve, personalized learning focuses more on active student engagement and the effective use of technology, reflecting a gradual shift in the educational model away from a single instructional approach to one that is diverse and flexible.

2.2 Goals of personalized learning

The goals of personalized learning span various dimensions, including motivation, competence, and achievement, with variations in ultimate objectives across different definitions and methods [17]. Table 1 outlines seven potential goals associated with personalized learning. These goals are based on established educational theories and frameworks that provide the theoretical basis for their formulation and justification.

We can see that the goal of personalized learning is not just a simple adjustment to the individual learning needs of students but is also based on existing theoretical foundations and practical experience, aiming to improve learning outcomes, cultivate intrinsic interest in learning, develop self-directed learning skills and creative thinking, and provide effective empowerment strategies in a variety of areas. The achievement of these goals will help drive the development of personalized learning, providing students with a more personalized and efficient learning experience while improving educational outcomes and laying a theoretical foundation for the full implementation of personalized education.

3 Foundations of educational theory

3.1 Educational theory

Before entering the adaptive personalization stage, numerous educators have proposed many educational theories aimed at shaping and advancing personalized learning. Each theory offers unique insights into the design and implementation of personalized learning and provides important theoretical support for its development. Next, we present an overview of educational theories closely associated with personalized learning, highlighting their contributions to personalized learning and exploring practical applications in educational settings. We organize these theories into three categories: educational philosophy, learning psychology, and sociocultural perspectives.

1) Educational philosophy

● Progressivism [27–29]

– Introduction: Progressivism, a pivotal paradigm in 20th-century Western education, challenged conventional norms. It focused on problem-centered learning to cultivate independent learning, critical thinking, and heightened engagement. Progressivism emphasizes acknowledging individual differences and championing student-centered teaching approaches.

– Contribution: Provides the foundational goal of personalized learning, advocating for education tailored to individual needs.

– Application: Implemented through project-based learning, where students choose projects based on their interests.

2) Learning psychology

● Behaviorist theory [30,31]

– Introduction: Behaviorism, pioneered by psychologist Watson, sees behavior as observable responses to external stimuli. Modified by neo-behaviorists like Tolman, it introduces intervening variables between stimuli and responses. Skinner and followers emphasize reinforcement as the key mechanism, shaping behavior through environmental influences. In essence, behaviorism underscores the importance of observing, strategizing, and controlling learner behavior, as well as shaping behavior through environmental influences.

– Contribution: Informs the design of feedback systems in personalized learning, enabling immediate and tailored feedback to reinforce desired behaviors.

– Critique: Limited by its reliance on external stimuli, necessitating integration with other theories to address internal cognitive processes.

– Application: Applied in adaptive quizzing systems (e.g., Khan Academy), gamified learning platforms (e.g., Duolingo), and Intelligent Tutoring Systems (ITS) that provide personalized feedback based on student responses.

● Constructivism [32–34]

– Introduction: Constructivism, a cognitive psychology branch, emerged from studying children’s cognitive development. It explores principles governing learning processes, emphasizing active learner interaction with the environment. Learning involves participation, critical thinking, hands-on experiences, and the construction of knowledge. The theory recognizes diverse learner backgrounds and asserts that knowledge construction depends on individual differences.

– Contribution: Supports the design of collaborative and exploratory learning environments in personalized learning.

– Application: Implemented through online discussion forums, collaborative projects, and problem-based learning activities.

● Cognitive psychology [32,35,36]

– Introduction: Cognitive theory, rooted in Gestalt psychology, focuses on information processing and mental processes. Like constructivism, it sees learning as an active process involving interactions with the environment and cognitive functions such as perception and memory. Emphasizing problem-solving, it highlights using cognitive structures like schemas for adequate comprehension. The theory acknowledges diverse cognitive styles, influencing learning styles, subject interests, and mastery abilities.

– Contribution: Guides the design of personalized learning paths and strategies, addressing individual cognitive needs.

– Application: Used in adaptive learning systems that adjust content based on learners’ cognitive profiles and performance.

3) Sociocultural perspectives

● Sociocultural theory [33,37]

– Introduction: The theory, proposed by Soviet psychologist Lev Vygotsky, presents a theoretical framework that underscores the pivotal influence of sociocultural factors on human cognitive development. It posits that mental functioning is shaped through cultural mediation, primarily involving language. The theory introduces the Zone of Proximal Development (ZPD), stressing the importance of collaborative learning. Social interactions play a crucial role in individual cognition and learning.

– Contribution: Supports the implementation of collaborative learning activities in personalized learning.

– Application: Implemented through social learning platforms that facilitate peer interaction.

In conclusion, progressivism advocates for fostering autonomous learning, behaviorist theory offers valuable insights into designing effective feedback mechanisms, constructivism underscores the importance of active knowledge construction, cognitive theory focuses on addressing learners’ cognitive needs, and sociocultural theory emphasizes the critical role of collaboration and social context in student development. Despite the inherent limitations of each theoretical framework, personalized learning adopts an integrative approach that synthesizes these perspectives to create a holistic educational paradigm. By leveraging the foundational insights provided by these educational theories, personalized learning enables the design of flexible learning environments tailored to individual differences. The integration of these theoretical perspectives not only deepens the understanding of students’ diverse needs but also drives the development and continuous innovation of personalized learning within the dynamic landscape of modern education.

3.2 Educational theories for technological modeling

Educational theories not only establish the fundamental principles of personalized learning but also provide direct design foundations and interpretive frameworks for subsequent student modeling and recommendation strategies. Specifically, these theories guide the selection of core modeling variables (e.g., prior knowledge, emotional states), inform the underlying design logic of computational frameworks, and ensure the pedagogical validity of recommendation strategies. Table 2 clearly illustrates the supporting relationships between key educational theories and corresponding technical approaches. Overall, these theories function as a critical bridge connecting pedagogical philosophy with technological realization, thereby establishing a robust theoretical foundation for the technical explorations presented in the subsequent sections.

4 Student modeling

Effective personalized learning relies on two fundamental components: student modeling and personalized recommendations. Student modeling aims to construct a comprehensive representation of the learner by capturing both cognitive and non-cognitive factors, providing a foundation for adaptive learning. Cognitive modeling focuses on inferring students’ knowledge states through techniques like cognitive diagnosis, enabling adaptive instructional content for deeper comprehension. In contrast, non-cognitive modeling considers learning styles, behaviors, and emotional states to provide a broader perspective on individual learning needs. By integrating these dimensions, student modeling facilitates a nuanced learner profile that informs personalized recommendations. Based on student modeling, personalized recommendations improve learning outcomes by selecting appropriate resources, exercises, and feedback strategies based on inferred learner characteristics. This synergy between modeling and recommendation enhances learning outcomes by ensuring that instructional content is tailored to student’s unique skills and challenges.

The following sections will provide an in-depth analysis and summary of the relevant literature on cognitive modeling, non-cognitive modeling, and personalized recommendations, exploring the advantages, challenges, and interactions among these methods, as well as their synergistic effects.

4.1 Cognitive modeling

Cognitive modeling refers to the use of mathematical, computational, or cognitive theory methods to simulate an individual’s cognitive processes in order to understand and predict his or her learning, reasoning, and other behaviors [38,39]. In the field of personalized learning, cognitive modeling is mainly used to portray students’ knowledge states, learning behaviors, and ability levels to support personalized learning recommendations and instructional interventions. Specifically, cognitive modeling focuses on key factors such as students’ knowledge acquisition, academic performance, and cognitive abilities, which are essential for accurately assessing students’ learning progress and optimizing instructional strategies. Therefore, this chapter will systematically explore the research progress of cognitive modeling around two core aspects: cognitive diagnosis and student performance prediction.

4.1.1 Cognitive diagnosis

Rooted in constructivism and cognitive psychology, cognitive diagnosis conceptualizes learning as an active process in which individuals construct and refine internal knowledge representations. These theories posit that knowledge mastery can be decomposed into latent cognitive attributes, with observable responses serving as indirect evidence of their mastery levels. Building on this theoretical foundation, cognitive diagnosis models (e.g., Item Response Theory (IRT), Deterministic Inputs, Noisy “And” gate model (DINA)) operationalize the estimation of these latent abilities by linking response patterns to underlying cognitive skills.

As a core component of personalized learning, cognitive diagnosis provides fine-grained assessments of students’ knowledge states. The diagnostic results provide educators with valuable insights to customize downstream personalized learning materials and assignments, including course or question recommendations. In this section, we will delve into cognitive diagnosis for personalized learning, covering its task formulation, background, and a literature review of recent valuable works through a fine-grained taxonomy.

Task formulation. Advancements in information technology provide educators with abundant student learning data for cognitive diagnosis, aiming to estimate proficiency in specific knowledge concepts. As shown in Fig. 4, this task aims to estimate students’ proficiency in a specific knowledge concept by giving them a series of test questions with their responses. Consider a student set

U

and a test question set

P

. Let

r i u = (q i u, a i u)

represent the

i

-th response of student

u ∈ U

. This response comprises the answered question

q i u

and binary correctness indicator

a i u ∈ {0, 1}

, where

a i u = 1

denotes a correct response. Given the historical responses of a student as a set

H u = {r 1 u, r 2 u, …, r | H u | u}

, the Cognitive Diagnosis Model (CDM) captures their latent proficiency

θ u

regarding the target concept. CDMs are designed to estimate the probability of correct responses (

p (a i u = 1 | θ u, q i u)

) without explicit proficiency annotations, learned by maximizing the likelihood of observed responses:

(1)

max ∏ u ∈ U ∏ r i u ∈ H u p (a i u = 1 | θ u, q i u) .

With the proliferation of personalized learning data, the Cognitive Diagnosis (CD) task has evolved to assess proficiency across multiple knowledge concepts. This multi-dimensional scenario introduces a core challenge: how to formally represent the relationship between test questions and the multiple concepts they assess. This challenge is addressed by a fundamental construct in CDMs known as the Q-matrix. The Q-matrix, denoted as

Q ∈ R | P | × | C |

where

C

is the set of knowledge concepts, is a binary matrix that encodes the designer’s hypothesis about which concepts are required to solve each question. Its element

Q p c = 1

if question

p

requires concept

c

, and

0

otherwise. The critical assumption here is that the Q-matrix accurately reflects the cognitive attributes necessary for solving each item. Given the Q-matrix, a student’s proficiency is represented as a multi-dimensional vector

θ u ∈ R | C |

. The CDM’s objective is then to estimate

θ u

by modeling the probability of a correct response conditioned not only on the student’s proficiency and the question but also on the question’s required concepts specified by

Q

. Thus, we optimize the following revised objective:

(2)

max ∏ u ∈ U ∏ r i u ∈ H u p (a i u = 1 | θ u, q i u, Q) .

Background. The development of the cognitive diagnostic model is shown in Fig. 5. As a means of assessing psychological attributes, researchers historically employed psychometric methods for the CD task. One prevalent method, Classical Test Theory (CTT) [40], posits that an observed score

X

comprises a true score

T

(representing knowledge mastery, i.e.,

θ

) and random error

E

(3)

X = T + E .

Despite its simplicity, CTT has limitations; for instance, it does not explicitly model key question attributes like difficulty, nor does it naturally handle multiple proficiency dimensions. Item Response Theory (IRT) [41] provides a more powerful framework by directly modeling the functional relationship between a student’s single latent proficiency

θ u

and their probability of correctly answering a question

q i u

(4)

p (a i u = 1 | θ u, q i u) = c i + 1 − c i 1 + e − a i (θ u − b i),

where

c i

a i

, and

b i

are the item parameters representing the question’s guessability, discrimination, and difficulty, respectively. The foundational IRT model was later extended to multidimensional settings. Multidimensional Item Response Theory (MIRT) [42] models proficiency as a vector

θ u

and defines the response probability as a function of a linear combination of the proficiencies, often as

p (a i u = 1 | θ u, q i u) = f (a i, b i, θ u, w i)

, where

w i

is a vector weighting the relevance of each concept to question

i

. The Q-matrix can be viewed as a logical and binary precursor or special case of these weight vectors. Building on MIRT, more complex models, such as the Graded Response Model (GRM), were developed to handle polytomous (multi-level) responses.

Later, as neural network technology gained prominence, the effectiveness of CD tasks in predicting student outcomes achieved notable levels. Nevertheless, this progress also resulted in a reduction in interpretability [43]. Researchers are currently concentrating on the challenge of preserving high accuracy in forecasting student performance while simultaneously enhancing the interpretability of the models.

Taxonomy. In existing research, scholars have not reached a complete consensus on the classification of cognitive diagnosis models. Early reviews classified models from psychometric and cognitive-theoretical perspectives, emphasizing underlying cognitive assumptions and parameter interpretability [88]. More recently, Liu et al. [89] distinguished probabilistic cognitive diagnosis models from those based on deep learning, while Wang et al. [90] proposed a taxonomy encompassing both psychometric and machine learning approaches. This evolution from traditional methods to machine learning and then to deep learning highlights the technological progression of cognitive diagnosis methodologies and the field’s broader shift from theory-driven to data-driven paradigms. By drawing on the general taxonomy of educational technology research methodologies (data-driven and theory-driven approaches), a complementary perspective is proposed [91]. Representative cognitive diagnosis studies are categorized into theory-driven, data-driven, and integration methods, as illustrated in Fig. 6. Theory-driven approaches rely on cognitive psychology or educational theory to ensure the theoretical interpretability of students’ knowledge states, whereas data-driven approaches leverage large-scale analytics and machine learning to infer learning capabilities from interaction data. Integration methods integrate both perspectives, for instance, by embedding cognitive theory as structural constraints within data-driven architectures. Compared with previous classification systems, the proposed framework emphasizes the interaction between model-driven mechanisms and cognitive interpretability. It retains the theoretical coherence of traditional classifications while reflecting emerging trends that integrate deep learning with cognitive modeling, thereby providing a unified and systematic perspective on the evolution of cognitive diagnostic models. Table S1 summarizes representative studies under this framework, and the following is a detailed description of the listed research work.

1) Pedagogical theory driven methods The pedagogical theory-driven methods in cognitive diagnosis integrate educational theories and psychometric principles to model students’ cognitive states, focusing on teaching and learning processes, instructional design, and cognitive assessment. By leveraging well-established theories, this approach enhances both the accuracy and interpretability of cognitive diagnosis models. A key aspect of this paradigm is the estimation of students’ overall abilities and cognitive profiles based on their interactions with learning materials. For instance, the monotonicity assumption, a fundamental principle in instructional theory, posits that as a student’s mastery of a knowledge concept improves, their probability of correctly answering related questions should increase. This assumption underpins classic cognitive diagnosis frameworks such as the DINA [49] and IRT [41], ensuring that cognitive assessments align with established learning progression theories.

Beyond the monotonicity assumption, other pedagogical theories have been incorporated into cognitive diagnosis to refine knowledge assessment. For instance, Ma et al. [45] applied the Neutral Set (NS) theory, which comprehensively assesses students’ cognitive states by categorizing their understanding into three dimensions: understanding, misunderstanding, and uncertainty. Liu et al. [44] combined fuzzy set theory and educational assumptions to model test takers and introduced sliding and guessing factors into the mix. Further extending theory-driven approaches, Zhou et al. [46] proposed a situationally aware cognitive diagnostic framework, which integrates educational context characteristics into the diagnostic process. Recently, Wang et al. [48] focused on the underexplored issue of uncertainty in cognitive diagnosis and proposed a novel parameterization technique based on traditional psychometrics. This technique can be adapted to various cognitive diagnostic models across different domains, enhancing their robustness and applicability. Meanwhile, Guo et al. [47] also addressed uncertainty in cognitive diagnosis and further introduced an enhanced framework to tackle two major limitations: single-dimensional modeling and data sparsity. They developed a multidimensional latent cognitive diagnosis model that integrates Rough Concept Analysis (RCA) and IRT. This model effectively captures multiple cognitive dimensions simultaneously, enabling a more accurate and comprehensive assessment of students’ cognitive states.

2) Data-driven methods The rise of information technology has provided a wealth of data for cognitive diagnosis. Initiatives are examining the challenges in cognitive diagnostic scenarios related to data and proposing effective data-driven approaches. These studies, which rely on large data sets, use techniques such as machine learning and data mining to extract patterns from learner behavior, performance, and responses to facilitate cognitive diagnosis. As shown in Fig. 6, we have a rough categorization of existing work.

Deep-learning methods. Deep learning has emerged as a prominent and rapidly evolving research direction in cognitive diagnosis, offering powerful data processing capabilities and flexible network architectures. This section categorizes existing deep learning-based approaches based on model architectures and problem-solving strategies, providing a systematic review of recent advancements.

Data sparsity and the cold-start problem remain critical challenges in cognitive diagnosis. To address these issues, recent studies have explored various strategies. For instance, EIRS [58] and CMES [65] improved diagnostic accuracy by designing rational sampling strategies and incorporating un-interacted exercises as auxiliary training signals. Gao et al. [64] introduced the DZCD framework, which employs a dual regularizer to train the diagnostic model and categorizes student states into shared and specific components to enhance prior knowledge utilization. Additionally, Liu et al. [68] developed an inductive cognitive diagnostic model that significantly enhances cold-start performance by constructing a bipartite graph, modeling student proficiency using a Graph Convolutional Network (GCN), and inferring knowledge states through neighborhood information aggregation.

Beyond individual cognitive diagnosis, group-level cognitive diagnosis has gained growing research attention. RDGT [67] integrates transformer-enhanced Graph Neural Networks (GNNs) to effectively capture both inter and intra-group relationships. DGCD [69] introduces adaptive denoising and entropy-weighted balancing to refine information aggregation within group-student-exercise structures. Meanwhile, HomoGCD [61] employs a multi-granularity framework to jointly model cognitive states at both individual and group levels, enhancing diagnostic stability.

Beyond group-level modeling, substantial progress has been made in utilizing GNNs to capture complex relationships among students, exercises, and knowledge concepts, enhancing cognitive diagnosis [50]. Existing studies primarily focus on modeling these intricate relationships to gain a more comprehensive understanding of students’ cognitive states [51,52,54–57]. These approaches incorporate key factors such as knowledge structure, skill levels, and prerequisite dependencies to refine cognitive modeling. In addition to advancements in cognitive modeling, interpretability has become a crucial research focus in cognitive diagnosis. Shen et al. [62] proposed a symbolic tree-based approach that alternates between optimal symbolic representations and parameter mixing, effectively bridging the gap between discrete symbolic representations and continuous parameter optimization while improving both generalization and interpretability. Similarly, Huang et al. [63] introduced a multi-dimensional feature extraction framework that integrates a multi-channel attention mechanism to capture complex student-exercise interactions, enhancing both model expressiveness and interpretability.

Non-deep-learning methods. A variety of non-deep learning approaches have been explored in cognitive diagnosis, leveraging statistical modeling [74,75], causal inference [73,77], and optimization techniques [71] to accurately assess students’ knowledge states. These methods aim to enhance diagnostic accuracy by explicitly modeling knowledge structures, incorporating prior knowledge, and improving assessment efficiency. One major line of research focuses on causal inference to refine cognitive diagnosis. Liu et al. [73] incorporates Structural Causal Models (SCM) to capture causal relationships among students’ mastery levels, enhancing the Q-matrix with an artificial Q-matrix as a prior. This enables a comprehensive assessment of students’ abilities by inferring relationships between exercises and explicit as well as latent knowledge attributes. Causal reasoning was further extended through the proposal of a path-specific causal inference framework for fairness-aware cognitive diagnosis [77].

Another critical research direction involves Bayesian hierarchical modeling. Building on the Bayesian Network, prior knowledge and uncertainty are incorporated into Bayesian hierarchical modeling [74]. Similarly, the Bayesian Network-based Hierarchical Cognitive Diagnostic Model (HierCDF) [75] fuses traditional models (IRT, MIRT, Matrix Factorization [MF]) to capture students’ cognitive states.

In addition to probabilistic modeling, optimization-based approaches have been explored for improving diagnostic efficiency. An adaptive ant colony optimization algorithm (ACO-TC) to construct cognitive diagnostic tests [71]. This method optimizes the selection of test items using historical pheromone-based search strategies and heuristic information derived from item discrimination indices.

3) Integration methods Despite significant progress in both theory-driven and data-driven cognitive diagnosis, balancing interpretability and predictive accuracy remains a challenge. Integrating theoretical insights with data-driven modeling offers a promising direction to enhance diagnostic effectiveness by leveraging both interpretability and flexibility. A notable early effort in hybrid modeling is Deep Learning Enhanced Item Response Theory (DIRT) [78], which extends traditional IRT by incorporating deep learning techniques. Building on this foundation, NeuralCDM [80] introduces a more generalized framework that integrates neural networks to model complex student-exercise interactions while maintaining interpretability by applying monotonicity constraints. The article was later extended by introducing variations based on content awareness and knowledge associations, focusing on the relationships between knowledge concepts [81]. Extending neural cognitive diagnosis to group-level assessments, MGCD [79] employs a multi-task learning framework to jointly model individual and group-level cognitive states. Beyond accuracy and interpretability, fairness is a critical concern in cognitive diagnosis. FairCD [83] addresses fairness issues by decomposing student abilities into bias-sensitive and fair components, using adversarial learning to mitigate the influence of sensitive attributes (e.g., socioeconomic status) on assessments. Additionally, affective factors have been integrated into cognitive diagnosis, further expanding the scope of hybrid models [84].

This chapter has provided a comprehensive overview of cognitive diagnosis. Our analysis reveals a predominant focus on data-driven methods to enhance accuracy, yet the educational implications of all approaches are significant. Pedagogically, theory-driven methods are fundamentally centered on interpretability and trustworthiness, which are crucial for informing instructional decisions. For instance, the DINA model’s explicit parameterization of guessing and slipping probabilities helps teachers determine whether errors stem from fundamental misconceptions or accidental slips, thereby enabling targeted feedback. Empirical studies demonstrate the practical impact of such approaches. Huang et al. [92] applied a DINA-based cognitive diagnosis in a high-school physics course on electromagnetic induction and designed multi-level remedial instruction based on the results. Their study confirmed significantly greater learning gains in the experimental group compared to the control. From a long-term perspective, assessments based on IRT or DINA can reveal individual knowledge structures and uncover specific conceptual gaps masked by aggregate scores, supporting differentiated instruction and mastery learning [93]. Such diagnostic insights enable educators to design targeted exercises, ensuring students master core proficiency before tackling more complex topics.

In contrast, data-driven methods derive pedagogical value from uncovering complex learning patterns in large-scale behavioral data, thus enabling fine-grained diagnosis and personalized support. For instance, GNN-based models analyze intricate relational networks among students, exercises and knowledge concepts to identify latent associations (e.g., between mastery of Concept A and common errors in Concept B) [50]. These insights lay an empirical foundation for instructional refinement. Models such as EIRS [58], which mitigate data sparsity, further boost student engagement and motivation by enabling reliable diagnosis for learners with sparse interaction records.

Integration-based approaches mark the maturation of cognitive diagnosis research by balancing predictive accuracy with interpretability and fairness, thereby enabling more trustworthy and comprehensive educational AI systems. NeuralCDM [80], for instance, leverages the expressive power of neural networks while incorporating monotonicity constraints to uphold fundamental educational principles. This enhances the trustworthiness of the method among both teachers and students. Meanwhile, FairCD [83] addresses fairness concerns to ensure that assessments are not biased by sensitive attributes, thereby fostering educational equity and enhancing learning motivation among disadvantaged student groups. Furthermore, recent models that incorporate affective factors extend beyond diagnosing what students know to understanding how they feel, paving the way for genuinely individualized and affect-sensitive learning environments [84].

In summary, although current research remains largely dominated by data-driven paradigms, practical educational applications demand equal attention to feasibility, transparency, and interpretability. Future work should focus on improving the interpretability of deep learning models, mitigating data sparsity, and ensuring the fairness and robustness of cognitive diagnoses. Furthermore, the deeper integration of domain knowledge and educational theories, the application of causal inference for informed decision-making, and the development of adaptive models capable of generalizing across diverse learning environments will be essential to advancing the field.

4.1.2 Student performance prediction (Cognitive)

Grounded in constructivism and cognitive psychology, cognitive student performance prediction models conceptualize learning as an active process of information acquisition, organization, and schema adaptation. Within this theoretical framework, student performance is not merely a behavioral outcome but a reflection of underlying cognitive processes such as attention, memory, and comprehension. Consequently, predictive models aim to infer latent cognitive states from observable learning interactions, aligning model representations with the cognitive mechanisms articulated in these theories. Student Performance Prediction (SPP) has received considerable attention due to its critical role in educational research and practice. Accurate performance prediction enhances learning outcomes by tailoring instructional strategies to individual needs, addressing weaknesses, and fostering potential abilities. Moreover, these predictions enable early identification of at-risk students, allowing timely interventions to prevent dropouts. Building upon a comprehensive literature review, existing SPP studies can be broadly categorized into two dimensions: cognitive [86,87] and non-cognitive [94–96]. This review systematically examines related studies from these two dimensions, providing a detailed analysis of representative work within each category. This section focuses on cognitive modeling approaches, which aim to capture students’ knowledge states and cognitive processes through the analysis of assessment results, exercise interaction data, and learning trajectories. Non-cognitive modeling approaches, which account for behavioral and emotional factors, are discussed in detail in Section 4.2.4.

Cognitive modeling leverages CDMs to estimate students’ mastery of various knowledge components, enabling the prediction of their future academic performance. A significant advancement in this area is the PCDF model [85], which enhances CDMs by incorporating the Cumulative Category Response Function (CCRF) to process multistage rating data, improving compatibility with binary CDMs while preserving interpretability. Building on this, GLNC [86] integrates cognitive diagnosis and knowledge tracing, adaptively combining local (current state) and global (learning trajectory) information for improved performance prediction. This fusion approach provides a new way of thinking about student performance prediction. Similarly, Ma et al. [45] introduced the innovative concept of the Neutral Set (NS), which combines cognitive diagnostics, NS similarity measures, and collaborative filtering with probability matrix decomposition. This method comprehensively assesses students’ cognitive states from three dimensions, enabling accurate prediction of future test scores in personalized e-learning environments. While these models significantly advance cognitive modeling, they primarily focus on students’ ability levels and overlook the influence of other factors. To address this limitation, the MvRCF model [87] introduces a compensatory mechanism that integrates the student’s ability profile and effort profile to further enhance the comprehensiveness of the prediction.

In summary, cognitive modeling approaches have achieved a more comprehensive understanding of student performance and have significantly advanced the field of student performance prediction. Beyond algorithmic progress, these methods hold profound educational significance. Accurate performance prediction enables the early identification of learning difficulties, facilitating timely feedback and targeted interventions that sustain engagement and confidence [97]. Adaptive models that dynamically update students’ ability profiles can further enhance motivation by offering personalized and attainable learning trajectories. Moreover, incorporating cognitive and emotional cues from multimodal data fosters more supportive and responsive learning experiences [98]. In the long term, precise and adaptive prediction establishes the foundation for continuous, individualized learning cycles within students’ ZPD. Future research should therefore not only pursue methodological innovation but also emphasize how predictive models can be better integrated into teaching practice to promote equity, sustained motivation, and long-term learning success.

4.2 Non-cognitive modeling

Non-cognitive modeling aims to predict or optimize students’ learning behaviors, academic performance, and overall learning experience by systematically collecting, analyzing, and modeling various data generated throughout the learning process. This type of modeling focuses on factors such as learning styles, emotions, motivation, and behavioral patterns, which play a crucial role in the learning process. Unlike traditional cognitive modeling, which primarily focuses on students’ knowledge mastery and cognitive abilities, non-cognitive modeling emphasizes exploring non-intellectual factors that influence the learning process. It deepens understanding through a detailed analysis of students’ learning states and quantitative analysis of the learning process. This study provides a comprehensive exploration of four main aspects of non-cognitive modeling: learning styles analysis, student sentiment analysis, student behavior analysis, and student performance prediction.

4.2.1 Learning style analysis

Learning style analysis systematically examines and categorizes learning styles to gain deeper insights into individual learner differences. A systematic analysis of learning styles enables personalized learning systems to precisely adjust teaching methods and provide customized resources, thereby enhancing knowledge mastery and learning efficiency. Building on this insight, the system can personalize teaching strategies and content based on learners’ cognitive preferences and learning tendencies, ultimately optimizing the learning experience.

Early reviews primarily focused on the diversity of learning style theories and the reliability and validity of their measurement instruments [120,121]. More recent reviews have shifted toward data-analytic perspectives, highlighting the potential of data-driven methods for learning style identification [122]. Existing studies typically analyze learning styles by mapping learners’ behaviors, preferences, or cognitive characteristics onto predefined theoretical frameworks. Accordingly, the present study adopts a classification framework grounded in learning style theories themselves, emphasizing the applicability of different theoretical models within adaptive learning contexts. This perspective places greater emphasis on the adaptability and application trends of different learning style theories in model construction and personalized design, rather than being limited to validity verification or data method analysis. As shown in Fig. 7, to systematically synthesize the literature, studies are categorized according to the learning style theories they adopt. The Felder-Silverman Learning Style Model (FSLSM) is the most widely used, followed by the VARK model [114], known for its simplicity. Although Gardner’s Multiple Intelligences (MI) Theory and Kolb’s Learning Style Model are less frequently applied, they are still explored in some studies and are therefore included in this discussion. We detail the research work based on different learning style theories in the following section, with Table S2 summarizing their core characteristics for reference.

1) FSLSM Traditional learning style assessment relied on questionnaires, but their static nature and potential errors have driven a shift toward machine learning and data mining techniques. Early studies focused on fusing clustering techniques and machine learning to infer learning styles. For instance, mining techniques paired with the K-means algorithm captured learner behaviors [103], while an enhanced K-means method, revising centroids via learning style vectors, was designed for more effective style determination [105].

Building on these foundational approaches, subsequent research has expanded the application of machine learning (ML) techniques to learning style classification. Studies such as [102,106,117] employed data mining methods in conjunction with classical classification algorithms, including Decision Tree (DT), Random Forest (RF), Naïve Bayes (NB), and Support Vector Machine (SVM), to automate the identification of individual learning styles. Deep learning (DL) methods have also been explored in this context. For instance, Anantharaman et al. [99] integrated Convolutional Neural Networks (CNNs), Random Forests, and Long Short-Term Memory (LSTM) networks for learning style recognition. Additionally, Kolekar et al. [108] used the Fuzzy Mean algorithm for FSLSM clustering and presents the Gravitational Search-Based Backpropagation Neural Network for classification.

Recent studies have enhanced FSLSM-based learning style analysis with dynamic and adaptive approaches. Muhammad et al. [111] employed a bipartite graph with LSTM autoencoders and K-means clustering to capture evolving student-resource interactions. In MOOCs, Jebbari et al. [112] leveraged machine learning models, including neural networks and RF, for large-scale learning style classification. Ayyoub et al. [113] introduced a semi-supervised self-training SVM that iteratively expands its labeled dataset, improving classification under data scarcity. Beyond mere classification, the effectiveness of FSLSM-based instructional adaptations in improving learning outcomes has been evaluated [110].

2) VARK In the literature, Kuttattu et al. [114] employed K-means, SVM, and DT methods to correlate learning styles with VARK styles. The approach utilizes SVM for individual learning style predictions and decision trees for predicting combinations of learning styles. Nguyen et al. [115] extended this work by integrating machine-learning-based VARK style recognition as an automated plugin into Moodle and generating responsive, personalized learning paths from predicted learning styles. Abomelha et al. [116] developed an adaptive e-learning system that classifies students via VARK questionnaires and uses predictive analytics to recommend personalized learning resources.

3) MI theory Research that integrates Multiple Intelligences Theory into learning style modeling remains limited. A notable study addresses this gap by proposing a neural network to identify learning styles based on both the FSLSM and MI Theory [117]. The study reveals that the accuracy of the approach varies across different dimensions of the combined theoretical framework.

4) Kolb The SRGSML [118], a two-layer ensemble model for the recognition of Kolb learning style that uses resampling and hybrid labeling (rule-based classification + K-means) to address class imbalance and subjective bias. Hidalgo et al. [119] mapped Kolb’s learning styles to software development roles, highlighting their relevance in engineering education.

Learning style analysis is a critical component of personalized learning and has received increasing attention in recent research. Machine learning and deep learning have significantly enhanced the precision and scalability of learning style identification, while dynamic and semi-supervised approaches have improved model adaptability in evolving learning contexts. Beyond these methodological advances, the educational implications are substantial. For example, adaptive models that capture evolving styles ensure the learning experience remains aligned with the learner’s developing profile, which is crucial for sustaining long-term engagement [111]. By identifying and responding to individual preferences, such as providing infographics for visual learners and designing interactive simulations for kinesthetic learners, we can create more intuitive learning environments. In a quasi-experimental study of an adaptive e-learning system, the experimental group exhibited significantly higher levels of engagement across behavioral, cognitive, and affective dimensions than the control group [123]. In the long term, effective learning style analysis is profoundly significant for cultivating self-directed learning capabilities. When learners consistently engage with materials tailored to their unique traits, they not only grasp complex concepts more readily but also develop metacognitive awareness of their own learning processes. Systematic reviews corroborate that responsiveness to learning-style differences enhances both system adaptability and the overall user experience [124]. Ultimately, aligning instructional strategies with students’ preferred learning patterns transforms the long-standing ideal of teaching in accordance with individual aptitude into evidence-based practice.

Despite these advances and the broad application prospects, certain challenges remain. Many studies assume that learning styles are static, overlooking their potential evolution over time due to contextual and experiential factors. While machine learning has enhanced classification accuracy, most studies focus on testing, comparing, or combining existing models rather than developing fundamentally novel approaches. Moreover, research often prioritizes algorithmic improvements over theoretical rigor and lacks interdisciplinary perspectives and age-diverse samples. Future research should focus on developing more comprehensive learning style prediction models by integrating contextual factors and reinforcing theoretical foundations, thereby enhancing the validity and applicability of learning style analysis in personalized learning environments.

4.2.2 Student sentiment analysis

As an essential component of non-cognitive modeling, sentiment analysis captures and interprets learners’ emotional states, providing crucial insights for precisely adjusting instructional strategies and optimizing learning experiences. Sentimental factors such as interest, motivation, and frustration significantly influence learners’ learning processes and outcomes. This section systematically reviews 27 relevant studies (as shown in Fig. 8) and introduces a dual-dimensional classification framework encompassing both technological approaches and application domains. The technical dimension builds upon prior reviews [152] by grouping studies according to their underlying computational paradigms, ranging from traditional machine learning techniques to recent deep learning–based sentiment recognition approaches, thus ensuring both technological progression and methodological continuity. The application dimension categorizes studies by their pedagogical contexts, such as classroom interaction and online learning, aligning emotional modeling techniques with their educational functions. The aim is to establish a comprehensive research framework to reveal the latest advancements and future directions in the field of sentiment analysis.

Table S3 shows a list of studies related to sentiment analysis. The following is a detailed description of the research work in this table.

1) Technical methodology We have broken down the references in this section into three main categories: traditional machine learning methods, deep learning methods, and studies combining deep learning with other techniques.

● Traditional machine learning. Traditional machine learning methods have been widely applied in sentiment analysis [125–131]. These studies employ various supervised algorithms, including SVM, NB, logistic regression, and RF [125,126], often integrating them with multifactor analytics to achieve a comprehensive understanding of student sentiment. Technical approaches have included integrating NB and SVM with Hadoop to improve processing efficiency [128], and employing particle swarm optimization for feature selection to enhance emotion recognition accuracy [129]. Furthermore, the analysis of qualitative student feedback has utilized multiple classifiers, including NB, SVM, J48 decision trees, and RF, to predict classroom labels and inform educational strategies [131]. Collectively, these studies highlight the effectiveness and versatility of traditional machine learning methods in student sentiment analysis, showcasing their potential to provide deeper insights into educational experiences.

● Deep learning. In recent years, deep learning methods have been widely used in student sentiment analysis [133–136,139] by employing CNN [132,135], LSTM and its variants [134,137,139], and Bidirectional Encoder Representation from Transformers (BERT) [133] and other models, which effectively enhance the accuracy and robustness of sentiment analysis. For example, Yu et al. [135] combined SVM with CNN to recognize the sentiment information in students’ self-evaluations, thus improving the accuracy of students’ performance prediction. The LSTM model and its variants perform well in sentiment polarity determination and deep-level feature extraction, which effectively alleviates the uncertainty problem in sentiment analysis [134,139]. In particular, Shaik et al. [137] used a Bi-directional Long Short-Term Memory (Bi-LSTM) deep learning model to construct a multi-label classification system for the higher education domain to assist in educational decision-making and course optimization.

● Hybrid and advanced techniques. Beyond traditional and deep learning methods, hybrid models and advanced techniques have been explored to improve sentiment analysis accuracy. This includes integrating deep learning with Bayesian Neural Networks [139], capsule networks, and attention mechanisms [138]. Models like BERT-LSTM-CNN leverage multiple architectures for improved performance [141]. Ashwin et al. [142] addressed bias in sentiment analysis by combining the High-Speed Sentiment Recognition Library (HSEmotion) with Multi-task Cascaded Convolutional Networks (MTCNN) to promote equitable educational practices. In addition, Shaikh et al. [143] investigated Large Language Model (LLM) applications in sentiment analysis, using ChatGPT-3.5 to directly generate category labels and comparing its performance with LSTM- and Transformer-based models. Their findings highlighted LLMs’ considerable potential to boost the accuracy and efficiency of student sentiment analysis. Other studies explored the combination of multiple techniques [139,140,144] and the use of BERT-based fusion models to capture contextual nuances and elevate prediction accuracy [133,138]. These innovations underscore the value of advanced integration strategies for optimizing sentiment analysis algorithms.

Categorizing existing studies by research methodology reveals the relative strengths of each technological approach: traditional machine learning serves as the foundational baseline, deep learning enhances the modeling of complex sentiment patterns, and the integration of advanced techniques further improves the overall performance of sentiment analysis algorithms.

2) Application domain A review of existing literature reveals a prominent focus on applying sentiment analysis in educational settings, reflecting the diversity of learning environments and instructional methods. These studies primarily explore its usage in conventional educational environments and MOOCs/Online learning.

● Traditional educational environments. Within this context, sentiment analysis is applied to various multi-channel data sources, including course information [134,135,146], tweets [148], forums, and teacher evaluation websites [145]. Sentiment analysis methods for teaching evaluation examine teaching process feedback, providing valuable guidance to optimize teaching methodologies [131,146,147]. In social media, Tubishat et al. [148] explored ChatGPT’s educational applications, using a tweet sentiment analysis model to identify dominant sentiments and opinions toward ChatGPT.

● MOOCs/Online learning. The evolving landscape of learning styles has led to an increased focus on sentiment analysis in online learning environments [126,132,133,140,150]. These studies leverage user-generated data to identify factors influencing learner satisfaction and evaluate overall perceptions of MOOCs [126,132,133,141,151]. For instance, forum activities provide insights into learners’ social, emotional, and skills (3S) dimensions, as demonstrated by the 3S learning analytics approach proposed in [149], which introduces the visualization tool LAT

∃

S for sentiment analysis through forum comments extraction. Natural Language Processing (NLP) and sentiment analysis techniques have been applied to Coursera reviews to explore key factors influencing MOOC success [150].

A comprehensive review of the existing literature on student sentiment analysis reveals a clear evolution from traditional machine learning to deep learning and hybrid approaches, greatly improving the ability to capture complex affective patterns. These technical advancements enhance not only analytical accuracy and scalability but also hold profound implications for educational practice. By automatically detecting student emotions (such as confusion, frustration, or interest) from forum posts or feedback texts, these systems facilitate a shift away from a one-size-fits-all approach toward a more responsive, student-centered paradigm. For instance, when widespread confusion is detected in MOOC discussion forums, instructors can intervene promptly with targeted clarifications [142], thereby sustaining engagement and preventing learner dropout. In the long term, integrating affective insights into personalized learning systems facilitates the early identification of and proactive response to negative states such as anxiety and burnout [142]. For example, systems can provide personalized encouragement or adapt task difficulty, thereby helping students navigate learning challenges while maintaining their self-efficacy. Notwithstanding the noteworthy advancements, several challenges persist, including the reliance on manual annotation for sentiment labels, the inadequate exploration of domain generalization, and the challenges in managing unstructured text complexities. Future research endeavors should prioritize the development of adaptive and generalized sentiment analysis models, the incorporation of unsupervised or weakly supervised learning, and the exploration of affective dynamics across diverse educational contexts.

4.2.3 Student behavior analysis

Rooted in Behaviorist Theory, student behavior analysis conceptualizes learning as a process shaped by external stimuli, responses, and reinforcement. This perspective emphasizes observable actions over internal cognition, suggesting that consistent behavioral patterns, such as participation frequency, interaction intensity, and response timing, can serve as reliable indicators of engagement and learning outcomes. In personalized learning, these principles underpin the design of feedback and reinforcement mechanisms, enabling adaptive systems to monitor, interpret, and guide student behaviors through data-driven strategies. Behavioral engagement is thus recognized as a key indicator of student participation and attention, directly influencing academic performance [168]. As a critical component of non-cognitive modeling, student behavior analysis supports performance prediction and personalized recommendations. By analyzing behavioral data (e.g., online activities, learning history, and interaction patterns), this approach provides real-time feedback, dynamically optimizes models, and enhances instructional design and resource allocation to foster personalized learning experiences. As shown in Fig. 9, we systematically categorized the literature into four categories based on the analytical methods employed: clustering algorithms, association rule mining, machine learning, and integrated studies. This taxonomy builds upon prior reviews in educational data mining, which commonly organize studies by analytical techniques [169]. This classification helps to sort out the characteristics and application scenarios of different research methods more clearly, thus providing a structured framework for subsequent literature analysis. The relevant research work is summarized in Table S4.

1) Clustering algorithm Clustering algorithms have been widely used to analyze student behavior and uncover learning patterns. Shen et al. [156] applied K-means to categorize behaviors into study, life, and Internet usage, while Li et al. [155] combined Density-Based Spatial Clustering of Applications with Noise (DBSCAN) and K-means to explore the link between behavior and Grade Point Average (GPA), improving clustering accuracy. Complementing these findings, Delgado et al. [154] adopted a Self-Organizing Map (SOM) to capture diverse behavioral patterns at the Universidad Internacional de La Rioja (UNIR). By considering SOM distances in two phases, their method offers a more granular depiction of students’ online learning activities. Recent research extends the underlying clustering algorithm by using Latent Dirichlet Allocation (LDA) to mine the learning preferences and underlying logical relationships of student groups [157]. It is also shown that the introduction of the LDA model can efficiently extract keywords and fuzzy identify student groups at different levels, providing valuable insights for personalized instructional design.

2) Association rule mining This technique is commonly used to discover correlations among various aspects of student behavior, offering insights into learning influences and outcomes. In typical work, a four-layer association architecture paired with a three-step mining process has been proposed, covering stages from data preprocessing to knowledge acquisition [158]. Similarly, the Eclat association rule algorithm integrated with a clustering algorithm has been employed to scrutinize associations among diverse student data categories sourced from information centers and grades, thereby elucidating the primary factors influencing academic performance [153].

3) Machine learning Machine learning is widely used in student behavior analysis, with classifiers detecting key patterns [156,161]. Enhancing accuracy, Chen et al. [159] integrated forest optimization with NB, DT, and RF in a blended learning model. Li et al. [160] integrated neural networks, NB, and DT algorithms using a Spark-based platform, showcasing the scalability and real-time processing capabilities of distributed computing. Although these traditional machine learning approaches effectively identify behavior patterns, they often overlook the temporal dynamics of learning processes. To address this, recent studies have explored dynamic behavioral modeling to capture temporal learning patterns. For example, Villalobos et al. [162] used sequence analysis and temporal dynamics metrics to characterize individual learning behaviors in Blended Learning (BL) environments. By applying Inverse Probability Weighting (IPW), the study also evaluates the causal impact of learning dashboards on student behavior and achievement, offering new insights into personalized interventions.

4) Integrated study Integrated studies combine multiple analytical methods to provide a more comprehensive understanding of student behavior. Early implementations in big data environments utilized frameworks like Hadoop MapReduce alongside statistical and association rule techniques for in-depth analysis [164]. More recently, the rise of Large Language Models (LLMs) has enabled new integrative frameworks. For instance, end-to-end approaches have been proposed that integrate temporal motion detection with LLMs to analyze classroom videos and automatically generate detailed behavior reports [167]. Such work exemplifies the growing influence of LLMs in augmenting AI-assisted educational assessment through the detailed capture and interpretation of behavioral sequences.

4.2.4 Student performance prediction (Non-Cognitive)

Student behavior analysis employs clustering algorithms, association rule mining, machine learning, and integrated studies to gain comprehensive insights into learning patterns. Clustering algorithms identify commonalities and disparities, while association rule mining uncovers connections between behavior and academic achievement. Machine learning methods enhance prediction accuracy and identify complex patterns. Comprehensive research combining multiple approaches and the utilization of LLMs has come to the fore in learning analytics, providing richer educational assessments. However, although current research integrates machine learning and other techniques for student behavior analysis, it predominantly emphasizes data analysis without effectively translating findings into actionable insights. Additionally, limited integration of psychological and educational theories, inadequate use of multimodal data, and shallow exploration of social factors constrain the depth of these studies. Furthermore, most experiments rely on proprietary datasets that are not publicly accessible due to privacy concerns, hindering reproducibility and slowing the field’s advancement.

The detailed methodology of cognitive modeling is presented in Subsection 4.1.2. This section shifts the focus to non-cognitive modeling in student performance prediction, which explores the influence of behavioral factors, such as internet usage and study time, on academic outcomes. These factors, closely tied to students’ learning habits, engagement, and external environments, capture non-cognitive influences beyond traditional knowledge assessments [95,175,177]. Unlike cognitive modeling, which estimates knowledge states, non-cognitive approaches primarily employ data-driven machine learning techniques to predict academic performance rather than infer knowledge mastery. By integrating these factors, non-cognitive modeling offers insights into their underlying influence on academic success [172]. Such studies provide new perspectives and a theoretical basis for the design of personalized interventions and learning support systems. In order to systematically review the existing work, we summarize the techniques, datasets, evaluation metrics, etc. of the existing work in Table S5. Building on established review frameworks, in Educational Data Mining (EDM) and Learning Analytics (LA) [91,169], this study classifies existing literature into our methodological categories, as illustrated in Fig. 10: machine learning-based models, ensemble learning-based models, deep learning-based models, and other predictive models. This framework encapsulates the methodological evolution of student performance prediction, from traditional machine learning to ensemble and deep learning approaches, consistent with the method-oriented classification schemes adopted in recent reviews, while offering a finer distinction between ensemble and deep learning paradigms.

1) Machine learning based models In student performance prediction, many studies directly apply traditional machine learning algorithms, such as DT, RF, SVM, NB, and logistic regression, to compare their predictive effectiveness [95–97,171], or integrate multiple machine learning methods [174]. These approaches primarily focus on evaluating the accuracy and efficiency of various classifiers without extensive model customization or adaptation to educational contexts. Additionally, some studies explore alternative strategies to enhance prediction reliability. A common approach involves utilizing the K-means algorithm to cluster student performance. The predictive validity of this method is often optimized by determining the k-value through objective quantitative analysis [170]. These methods provide baseline evaluations of machine learning models in educational settings but lack advanced optimization tailored to the unique characteristics of student learning data.

2) Ensemble learning-based models Literature in this category leverages ensemble learning methods to improve the accuracy of student performance predictions. For example, the Student Academic Performance Predicting (SAPP) system [177], the Categorical Boosting (CatBoost) model [94,176,179], and applications using ensemble learning techniques [175,178,180]. The SAPP system [177] integrates a 4-layer LSTM network, RF, and Gradient Boosting (GB) within an ensemble learning framework for academic performance prediction. CatBoost [176] enhances model transparency and accuracy by categorizing reasons for improved performance into low, medium, and high levels. Building on this, C-CatBoost [179] introduces a residual error model to address prediction inaccuracies by supplementing the base model’s outputs, effectively capturing complex data relationships. Similarly, the Feature Importance-Based Multi-Layer CatBoost (DCatBoostF) [94] improves accuracy by progressively introducing features ranked by importance using RF. Xu et al. [178] proposed a stepwise prediction algorithm, integrating learning techniques in a two-layer structure. In contrast, the Performance Factors Analysis (PFA) method employs RF, AdaBoost, and XGBoost to model learners in an e-learning system, effectively improving performance prediction accuracy [175].

3) Deep learning based models Deep learning techniques have been widely used in educational data analytics in recent years due to their superior ability to capture complex data patterns. This section systematically reviews deep learning models, including Deep Neural Networks (DNNs) [181], Hypergraph Neural Networks (HGNN) [183], and other variants [182], for modeling student learning behaviors and predicting academic performance. Giannakas et al. [181] introduced a binary classification deep neural network with two hidden layers, emphasizing deep learning’s crucial role in predicting team performance. FD-HGNN [183] integrates a dual hypergraph neural network with different channels for low-pass and high-pass components and incorporates an attention mechanism for multi-layer feature learning and complex relationship modeling.

4) Others models The rise of LLMs in AI has created new opportunities to enhance student performance prediction through better semantic understanding and contextualization. These approaches leverage the powerful representation capabilities of LLMs to address complex educational challenges, such as cold-start problems and multimodal data integration. For example, LmgMF [184] addresses the cold-start problem by leveraging a large language model (GPT-J and Llama 2) as a multimodal information integrator to generate semantically rich embedding vectors, which are then combined with matrix factorization for enhanced semantic embedding and prediction. Ni et al. [185] integrated LLM-generated semantic embeddings with Signed Graph Neural Networks (SGNNs) to model student response accuracy and enhance noise robustness. Combining the contextual embedding capabilities of LLMs with graph-based structural learning effectively addresses cold-start issues and improves predictive accuracy.

Non-cognitive modeling has emerged as a crucial complement to cognitive approaches in student performance prediction, providing deeper insights into the influence of behavioral, emotional, and social factors on academic outcomes. While traditional machine learning and ensemble techniques have established solid baselines, deep learning has markedly enhanced the ability to capture complex behavioral patterns, and LLMs have further expanded semantic understanding and contextual reasoning. Despite these advancements, persistent challenges include data imbalance, limited model interpretability, and the nascent integration of BERT- and LLM-based architectures. Beyond algorithmic optimization, non-cognitive modeling carries profound implications for educational practice. Predictive models can identify students at risk of academic difficulty at an early stage by analyzing behavioral data. For example, ensemble-based systems like SAPP [177] leverage multi-source data to achieve high-precision predictions, thereby enabling timely, data-driven interventions that help reduce dropout rates. Empirical research on early-warning systems confirms the feasibility of detecting engagement declines and dropout risks from digital footprints such as interaction logs and submission patterns [186]. In the long term, incorporating interpretable non-cognitive indicators into learning analytics dashboards allows educators to provide feedback on learning habits, prompting students to reflect on and refine their strategies [187]. Recent empirical analyzes suggest that predictive models yield greater educational impact when integrated with pedagogically grounded interventions and longitudinal support, rather than functioning as isolated alert systems [188]. The introduction of LLMs further enhances the semantic understanding of students’ behavioral patterns. Through these integrated approaches, students can receive personalized guidance that aligns with their non-cognitive traits, which not only enhances academic achievement but also cultivates the self-directed learning skills essential for navigating future challenges.

Future research should therefore move beyond predictive accuracy to explore how non-cognitive insights can be operationalized into practical teaching interventions. Key directions include multimodal data integration, algorithmic optimization, and improving model adaptability and generalization. Moreover, enhancing interpretability and transparency remains essential for effective deployment in real-world educational contexts. Only by integrating computational modeling with teaching practices can we fully realize the educational value of predicting noncognitive performance.

5 Personalized recommendation

Personalized learning recommendations encompass optimizing learning routes, courses, and exercises for individual students. Learning paths aim to minimize costs and achieve learning goals, personalized course recommendations provide suitable courses, and personalized exercise recommendations target knowledge gaps. Figure 11 illustrates three use cases of personalized learning recommendations for English proficiency.

Initially, to enhance the learner’s writing skills, the system provides a learning path: “Vocabulary → Grammar → Imitation,” forming a comprehensive personalized learning path recommendation. Following this sequence, the system utilizes the knowledge graph derived from the student’s learning history to pinpoint additional skills that the student may desire to cultivate, such as listening and reading. Consequently, the personalized course recommendation algorithm recommends advanced courses encompassing aspects of vocabulary, grammar, listening, and reading. Simultaneously, each course incorporates diverse exercises, and the subsequent personalized exercise recommendation is derived from the user’s history of answer sequences. It suggests exercises tailored to the current learning category, exercise difficulty, and the student’s mastery level.

In this study, personalized learning recommendation systems are categorized into a three-dimensional taxonomy comprising personalized learning path recommendation, personalized course recommendation, and personalized exercise recommendation. This classification schema is derived from the hierarchical recommendation needs across various learning stages. Path recommendation involves holistic learning planning by analyzing learning objectives, knowledge graphs, and prerequisite relationships to devise optimal learning sequences. At the course recommendation level, appropriate course content is filtered based on learners’ interest preferences, knowledge mastery levels, and course relevance to assist in resource selection. Exercise recommendation focuses on specific skill consolidation by providing targeted practice problems through real-time knowledge state diagnosis and forgetting curve analysis. This hierarchical framework not only aligns with the pedagogical logic of “goal-content-practice” but also corresponds to the technical implementation pathway of educational data mining from coarse-grained to fine-grained processing, thereby systematically encompassing the learners’ complete cognitive chain.

Existing surveys on personalized learning recommender systems have primarily organized the field along technical methodologies, input data types, and evaluation dimensions, rather than based on the specific learning tasks being recommended. For example, Urdaneta-Ponte et al. [189] classified educational recommender systems according to how recommendations are generated, presented, and evaluated, emphasizing hybrid approaches and accuracy-centered assessments. Similarly, Da Silva et al. [190] conducted a systematic review of recommendation methods, collaborative filtering, content-based, and hybrid, across educational contexts, maintaining a methodological and system-level orientation. More recent reviews focusing on personalized learning structured the field around algorithmic pipelines, including learner and resource modeling, dataset usage, and algorithmic trends, thereby reinforcing an algorithm-centered taxonomy [191,192]. Overall, these surveys mainly focus on “which techniques are used, with which inputs, and how accuracy is measured,” while paying less attention to the pedagogical function or granularity of the recommended learning entities. In contrast, this paper adopts a task-oriented taxonomy that decomposes personalized learning recommendations into three functional strata， i.e., learning path (goal sequencing and prerequisite planning), course (resource selection based on interests and mastery), and exercise (fine-grained practice aligned with learners’ knowledge states). This “goal→content→practice” hierarchy aligns with learners’ cognitive workflows and serves as a classification lens that re-indexes existing research according to the recommended object and decision process rather than algorithmic family alone. Specifically, our taxonomy emphasizes: (i) task granularity (path vs. course vs. exercise) as the primary axis of comparison; and (ii) a technique-to-task mapping that positions common methods, such as knowledge graphs for structural constraint modeling, reinforcement learning for sequencing decisions, knowledge tracing for mastery estimation, and large language models for semantic enhancement, within their corresponding decision stages.

5.1 Personalized learning path recommendation

Grounded in progressivism and constructivism, the evolution of personalized learning path recommendation systems reflects a theoretical shift toward learner-centered and knowledge-constructive paradigms. Progressivism advocates individualized and interest-driven learning trajectories, offering a pedagogical rationale for adaptive path design. Meanwhile, constructivism emphasizes that knowledge is actively constructed through interaction with learning contexts, providing a theoretical foundation for dynamic and semantically linked path generation based on knowledge graph modeling. Building on these theoretical underpinnings, the development of learning path recommendation systems has exhibited distinct phase-specific characteristics in both research paradigms and technical approaches. The initial phase relied predominantly on clustering analysis and historical data mining to establish basic recommendation frameworks by identifying behavioral patterns among similar learners. With the maturation of knowledge graphs and reinforcement learning technologies, subsequent research shifted focus to dynamic competency modeling and semantic relationship mining, marking a paradigm transition from static recommendations to adaptive learning paths. In the most recent phase, the convergence of knowledge graphs with deep reinforcement learning, real-time analytics, and large language models has propelled the development of intelligent recommendation systems featuring real-time adaptability and explainability. These three stages collectively illustrate the technological progression from basic data-driven methods to sophisticated multitechnology integration. As shown in Fig. 12, we have classified the relevant literature into three parts based on the discussion aforementioned. The technical approaches, datasets, evaluation metrics, and other elements of the related work are summarized in Table S6.

5.1.1 Classification

1) Clustering and historical data: early approaches to learning path personalization. Early research primarily focused on learning path recommendations by leveraging clustering methods to identify similar learners based on historical data. Studies such as [194–196,200] propose various algorithms, including an LSTM model [194] and personalized paths derived from individual learning histories [195,196]. Some approaches, like [196], further prioritize paths from influential learners for users with similar profiles.

However, recommendations based solely on historical data lack sufficient personalization. Addressing learners’ adaptability and ability differences is critical, as highlighted by Mansur et al. [197]. For instance, Vanitha et al. [199] proposed aligning learning content with personal characteristics through ant colony optimization and genetic algorithms to generate more tailored learning paths.

Additionally, early studies incorporated time constraints of online learning resources into path recommendations [193,198]. While these methods focused on assessing similarities between learners or resources, their reliance on historical data limited their ability to capture personalized needs, and clustering techniques often struggled to handle diverse cognitive levels effectively.

2) Knowledge graphs and reinforcement learning: dynamic modeling for adaptive paths. With the rise of deep learning, personalized path recommendation methods have increasingly leveraged more personalized learner features, as highlighted in [197,202]. These approaches integrate individual learning needs, habits, and preferences into comprehensive knowledge graphs tailored to specific problems. For example, Shi et al. [203] constructed multidimensional knowledge graphs to model diverse learning relationships for various objectives, while Liu et al. [201] addressed the challenges of low-engaged users by building interaction networks between courses and learners. Similarly, Sun et al. [204] developed personalized knowledge graphs with extensive English practice questions, aiming to create unique learning paths for individual learners.

Reinforcement learning techniques further advance these methods by integrating learner mastery levels. For instance, cognitive diagnostic models analyze user proficiency, with hierarchical reinforcement learning refining mastery-level-based recommendations [205].

These methods are designed to address the diverse and dynamic needs of learners by combining knowledge graphs and reinforcement learning. However, despite advancements, the full potential of these technologies in personalized learning path recommendation remains underexplored, as clustering-based methods continue to dominate much of the research landscape.

3) Integrated intelligent systems: multi-technology convergence for real-time learning paths. Recent research has significantly advanced personalized learning path recommendations by integrating knowledge graphs, deep reinforcement learning, and other cutting-edge technologies. Major advances include the development of multidimensional knowledge graphs with higher-order relevance modeling to capture learner preferences more accurately [209], as well as the creation of dedicated knowledge graphs to address diverse learning needs [199]. Further work has combined temporal and graph attention networks to model dynamic learner-resource interactions for reinforcement learning [206]. To address the oversimplification of learner abilities, graph-based genetic algorithms have also been proposed to optimize feature alignment and generate diverse learning paths [210]. These methods fully leverage the capabilities of knowledge graphs and deep reinforcement learning, incorporating knowledge-tracking models to account for learners’ evolving proficiency levels. For example, Frej et al. [211] enhanced explainability in MOOC recommendations by using graph reasoning to generate interpretable path-based explanations. Knowledge graphs have also been utilized to provide factual context for LLMs, reducing hallucinations and improving the precision of personalized explanations [212]. In addition, end-to-end planning frameworks have been proposed that integrate prompt engineering with models like GPT-4 to generate pedagogically grounded and adaptive learning paths, demonstrating improvements in accuracy and learner satisfaction [216]. The integration of real-time learning analytics, as demonstrated by [215], further refines these methods by dynamically adjusting learning paths based on learner performance and preferences. Meanwhile, Xiao et al. [214] highlighted the importance of extracting logically coherent learning pathways using large language models, optimizing recommendations by preserving the contextual flow of learners’ enrollment histories.

5.1.2 Practical implementation challenges

Although personalized learning path systems have evolved from clustering-based analytics to LLM-driven adaptive frameworks, their large-scale deployment still faces notable practical challenges. First, data fragmentation across learning platforms and repositories hinders the creation of unified learner profiles essential for continuous adaptation. Second, an alignment gap often exists between algorithmic recommendations and curricular objectives, raising concerns about pedagogical validity and accountability. Third, ensuring real-time adaptivity demands high computational efficiency and robust system design, which can be difficult to achieve in large or resource-limited environments. Finally, transparency, ethics, and privacy remain critical issues, particularly as LLM-based models rely on multimodal behavioral data. Addressing these challenges requires moving beyond algorithmic performance toward system-level design, emphasizing interpretable AI, data governance, and teacher-in-the-loop frameworks to ensure pedagogical coherence and sustainable integration.

5.1.3 Educational implications

Personalized learning path systems are not only technical solutions but also pedagogical tools that reshape how learning takes place. By dynamically adjusting the sequence and difficulty of learning materials to match individual progress, these systems create adaptive environments that foster autonomy and sustained engagement. For example, Ouyang [217] revealed that AI-driven adaptive features enhance educational quality through self-regulated learning and engagement, while Hariyanto et al. [218] found that multimodal and reinforcement learning methods improve personalization and educational equity. Further supporting this, Du Plooy et al. [219] confirmed in a higher education context that adaptive learning pathways correlate with higher academic performance and motivation. Students are thus encouraged to take ownership of their trajectories, linking effort with visible progress, which strengthens both motivation and self-efficacy. For teachers, this shift means moving from delivering fixed content to guiding and supporting learners through individualized pathways. From a long-term educational perspective, adaptive path systems can promote motivation, independence, and continuous engagement, making learning more personal and meaningful rather than purely algorithm-driven.

5.2 Personalized course recommendation

Progressivism and sociocultural Theory provide the pedagogical foundation for personalized course recommendation systems. Rooted in learner-centered and experiential learning principles, progressivism informs adaptive recommendation mechanisms that dynamically adjust course sequences through reinforcement learning and generative modeling. Meanwhile, sociocultural theory emphasizes socially mediated knowledge construction, supporting the use of attention-based knowledge graphs that model contextual and relational dependencies among learning resources. Technically, the development of personalized course recommendation systems has exhibited a clear phased evolution in research paradigms and methodologies. The initial phase predominantly addressed fundamental challenges such as data mining and sparsity issues, establishing basic recommendation frameworks through distributed computing and matrix factorization techniques. With the advent of knowledge graphs and attention mechanisms, the research focus shifted toward mining semantic course relationships and modeling dynamic learner competencies, marking a paradigm transition from static to context-aware recommendations. Most recently, the deep integration of knowledge graphs with reinforcement learning and engagement prediction technologies has propelled the field toward intelligent and holistic recommendation systems, capable of real-time adaptation to learners’ evolving needs. As shown in Fig. 13, we have categorized the relevant literature into four parts based on the aforementioned discussion. The technical approaches, datasets, evaluation metrics, and other elements of related works are summarized in Table S7.

5.2.1 Classification

1) Foundations of data-driven recommendations: mining large-scale patterns and addressing sparsity. With the rise of online learning platforms like MOOCs, traditional recommendation methods face challenges due to the rapid growth of learners and course information. Initially, large-scale pattern mining was tackled through distributed computational frameworks and specialized algorithms for extracting course association rules [221]. Concurrently, the challenge of user heterogeneity and sequential decision-making led to the proposal of methods like Hierarchical Bandits for recommending high-reward courses [220]. To combat sparsity, Yang et al. [223] constructed learner and course networks using the Hyperlink-Induced Topic Search algorithm to extend the user’s rating matrix. Beyond sparsity, modeling multi-faceted learner-course interactions has been a focus. This includes employing multi-dimensional matrix factorization models to capture the diverse attributes of courses [222]. Furthermore, to model the dynamic nature of learning, hierarchical reinforcement learning methods have been introduced to determine when and how a learner’s focus should shift among different interests during the learning process [224].

In the initial stages, pioneering efforts were made to extract valuable insights from extensive course data, addressing the issue of interaction sparsity to a certain extent. However, early studies didn’t consider the connections and prerequisites between courses, which are important for personalized course suggestions.

2) Semantic and dynamic modeling: knowledge graphs and attention mechanisms for context-aware recommendations. Subsequent research has increasingly focused on incorporating learner preferences and course characteristics into knowledge graphs to uncover semantic associations between courses. For instance, precedence maps have been developed by analyzing differences in learners’ knowledge backgrounds to identify significant precedence relations between concepts and courses [230]. Similarly, graph-based networks have been proposed to evaluate instructional quality by modeling multiple influences on learners, such as individual interests, peer interactions, and instructor guidance [225]. To overcome the limitations of traditional collaborative filtering in capturing semantic correlations, knowledge graph representation learning has been employed to embed course semantics into a unified vector space, enabling the calculation of semantic similarity for recommendations [227,229]. Attention mechanisms have also been widely adopted to model correlations between courses more effectively. For example, course correlations have been constructed from textual descriptions using an Attention Manhattan Siamese Long Short-Term Memory network combined with an autoencoder and self-attention mechanisms, allowing recommendations to be adaptively tailored to student preferences [226]. Furthermore, researchers have investigated the dynamic modeling of learners’ proficiency levels to enhance recommendation accuracy. Tian et al. [228] proposed a knowledge tracking model that updates learners’ abilities in real-time by incorporating multi-dimensional factors, integrating these estimates as attributes into a collaborative filtering framework for course recommendations.

Despite these advancements, challenges remain in handling the interconnected nature of courses and adapting to learners’ evolving abilities and skills in real time. Future research must focus on developing more robust methods to address these limitations and further improve the personalization of course recommendations.

3) Towards holistic intelligence: knowledge graphs, reinforcement learning, and engagement prediction. Recent research has focused on extracting personalized information from courses and learners and integrating it into knowledge graphs to model complex relationships. For instance, Jung et al. [233] created a hierarchical map of learners and courses using MOOC data and external knowledge, while Ban et al. [231] built a course sequence diagram across learners, analyzing web-learning course representations through a graph structure. To more deeply model learning processes and relationships, a Collaborative Sequence Graph constructed with Graph Convolutional Network (GCN) layers has been proposed [232]. Further, knowledge graphs have been employed to project course-learner interactions into unified embedding spaces [237]. For handling intricate semantic information, self-symmetric meta-paths and meta-graphs within course knowledge graphs have been introduced, utilizing specialized graph embedding techniques [235]. Additionally, Zhang et al. [238] proposed a heterogeneous information network for personalized course recommendations using a factorial memory network and graph neural network.

Reinforcement learning has also been widely adopted to enhance course recommendations. For sequential decision-making, methods including policy gradients [236], attention mechanisms with recursive reinforcement learning [234], and a deep reinforcement learning multi-agent framework incorporating learner sentiment and competency have been proposed [239]. Efforts have also been made to incorporate language models and deep knowledge tracking techniques. For instance, Zhang et al. [7] leveraged unstructured textual data by extracting word vectors using BERT and analyzing their semantic content. Relatedly, Ban et al. [231] addressed the forgetting problem in knowledge tracking by developing a personalized controller that simulates learner forgetting and individual differences based on cognitive psychology principles. Additionally, Gharahighehi et al. [241] integrated Survival Analysis (SA) methods with collaborative filtering to model time-to-dropout and time-to-completion, significantly improving recommendation performance by considering time-to-event information. Moreover, engagement prediction has emerged as a critical factor in reducing dropout rates. Li et al. [240] proposed a Quantified Engagement (QE) method and an Engagement Neural Network (ENN) model to predict learners’ engagement in unenrolled courses. By applying QE and ENN to personalized course recommendations, the study demonstrated a significant reduction in dropout rates, validating the effectiveness of engagement-based recommendations.

4) LLM-enhanced course recommendation: explainability, goal alignment, and adaptive personalization. Recent research increasingly leverages LLMs to enhance adaptability, interpretability, and goal alignment in course recommendation. Systems such as LearnMate [242] and Retrieval-Augmented Generation (RAG)-based course discovery models [243] demonstrate how prompt engineering and semantic retrieval enable context-aware and conversational recommendations. Benchmarking studies further show that LLMs achieve comparable accuracy with greater diversity [246], while ontology-integrated frameworks connect course selection to career competencies [247]. Overall, these advances mark a transition from static filtering toward adaptive, semantically grounded, and learner-centered recommendation paradigms.

5.2.2 Practical implementation challenges

Despite significant advances in integrating knowledge graphs, reinforcement learning, and large language models, personalized course recommendation systems still face major implementation barriers. First, the heterogeneity of course data across disciplines and institutions complicates the construction of unified semantic representations. Second, the alignment issue persists, as automatically generated recommendations may conflict with curricular structures or accreditation requirements. Third, maintaining scalable personalization across thousands of evolving courses demands intensive computation and learner modeling. Finally, ensuring explainability and fairness is crucial—recommendations must transparently link to learner goals while avoiding bias from uneven data distributions. Future efforts should prioritize curriculum-aware, interpretable, and fairness-driven system design to enable educationally valid and institutionally integrated course recommendation frameworks.

5.2.3 Educational implications

Course recommendation systems do more than suggest the next item in a learner’s queue; they actively shape learners’ planning, motivation, and long-term trajectories. By aligning recommended courses with learners’ prior knowledge, interests, and vocational aspirations, these systems enhance the perceived relevance of learning and foster intrinsic motivation, thereby promoting sustained engagement and informed, autonomous learning. Recent studies substantiate these pedagogical effects: Cha et al. [248] demonstrated that AI-based recommenders reduce uncertainty during course selection and strengthen students’ confidence in their academic decisions, while Zhang et al. [249] showed that graph-based and learning-style-aware modeling improves recommendation accuracy and learning engagement. For educators and institutions, such systems enable more flexible curriculum design and data-informed academic advising. However, to realize these benefits in practice, recommendation mechanisms must remain curriculum-aligned, interpretable to educators and learners, and designed to avoid biases and narrow personalization. When appropriately implemented, course recommendations can support self-directed learning and empower students to make informed, confident choices about their educational journeys.

5.3 Personalized exercise recommendation

Behaviorist Theory, Cognitive Psychology, and sociocultural theory jointly underpin personalized exercise recommendation systems. Behaviorism informs reinforcement-based feedback optimization, Cognitive Psychology supports knowledge tracing through cognitive state modeling, and Sociocultural Theory emphasizes social interaction, inspiring peer- and community-aware recommendation mechanisms. Building on these theoretical insights, the evolution of personalized exercise recommendation systems has exhibited distinct phase-specific characteristics. The initial research phase primarily focused on the structural modeling of knowledge points to establish foundational frameworks for recommendation systems. Subsequent advancements in deep learning and knowledge graph technologies prompted a paradigm shift toward semantic comprehension and dynamic knowledge tracing, thereby enhancing recommendation precision. In recent years, the convergence of multiple technologies (e.g., reinforcement learning, deep knowledge tracing, and knowledge graphs) has further propelled the development of systems with enhanced adaptability and intelligent capabilities. The following three stages reflect the temporal progression of technological evolution, illustrating the transformation of research paradigms from static recommendation approaches to dynamic personalization frameworks. As shown in Fig. 14, we have categorized the relevant literature into four parts based on the aforementioned discussion. The technical approaches, datasets, evaluation metrics, and other elements of related works are summarized in Table S8.

5.3.1 Classification

1) Foundations of knowledge modeling: structuring knowledge points for basic exercise recommendations. Early research has established the importance of modeling the relationships between knowledge points and topics, as well as accounting for variations in learners’ knowledge levels. For instance, Xia et al. [252] identified two critical criteria for personalized exercise recommendations: 1) comprehensive coverage of knowledge points relevant to the learner’s objectives, and 2) exercises with difficulty levels tailored to the learner’s current knowledge proficiency. Building on this, Lv et al. [251] constructed a knowledge graph to explicitly capture dependencies between knowledge points, while Diao et al. [250] utilized a knowledge graph enriched with exercise topics, employing tree search algorithms to generate recommendations.

Despite these advancements, a significant limitation of early studies lies in their superficial exploration of the connections between knowledge points. These approaches often lacked depth in incorporating learners’ dynamic knowledge mastery, which is essential for truly personalized and adaptive exercise recommendations.

2) Semantic understanding and mastery tracking: deep knowledge tracing for context-aware recommendations. The significance of knowledge graphs in modeling the relationships between knowledge points and learners has gained increasing attention, particularly in the context of personalized exercise recommendations powered by deep knowledge tracking and knowledge diagnostic models. To address the limitations of earlier approaches in capturing semantic relationships between exercises and knowledge points, knowledge graphs incorporating extensive practice questions and knowledge points have been developed. These models leverage techniques such as word2vec and graph convolutional networks to extract and represent semantic information [253,261]. Furthermore, to overcome the shortcomings of traditional methods in accurately assessing learners’ dynamic knowledge states, deep knowledge tracking models have been introduced. These models predict a learner’s mastery of specific knowledge concepts based on their interaction history [254–256]. These predictions were subsequently integrated with collaborative filtering or clustering techniques to generate personalized exercise recommendations tailored to individual learners’ needs.

However, several challenges remain unexplored, including the development of more sophisticated deep knowledge tracking models, the integration of advanced technologies, and the real-time adaptation of recommendations to reflect the dynamic learning trajectories of individual students. These areas represent critical directions for future research in the field of personalized exercise recommendations.

3) Dynamic personalization: knowledge graphs, deep tracing, and reinforcement learning. Recent research has significantly advanced personalized exercise recommendations by focusing on the construction of knowledge graphs and the application of deep knowledge tracking models. This includes the development of comprehensive knowledge graphs that integrate knowledge points, student profiles, and exercise topics to create a unified resource framework [261]. Building on this, He et al. [259] and Ren et al. [263] further utilized deep knowledge tracking models to dynamically assess and monitor students’ mastery of knowledge concepts, providing real-time insights into their learning progress. Incorporating principles from constructivist learning theory, Yan et al. [271] combined knowledge structure maps with deep knowledge tracking to identify and recommend diverse, challenging, and innovative topics tailored to individual learners. Additionally, Li et al. [262] enhanced existing knowledge graphs by mining personalized topic information, transforming them into multidimensional structures that better capture the complexity of learner-exercise interactions. Furthermore, Li et al. [260] proposed a syllabus-aligned recommendation method that considers selected knowledge points and expected score ranges, ensuring that exercises are closely aligned with teaching objectives.

Reinforcement learning techniques have also been extensively explored to further enhance exercise personalization. Early applications focused on optimizing core learning dynamics, with reinforcement learning frameworks balancing objectives (review, exploration, smoothness, learner engagement) [257]. This multi-objective approach was further extended to explicitly model exercise difficulty, integrating review schedules, difficulty levels and learning efficacy to recommend topics at suitable challenge levels [258]. Moreover, to go beyond immediate recommendations, research has targeted long-term learning needs, using reinforcement learning to adapt and sequence recommendations over time to sustain learner progress [264].

4) From recommendation to generation: LLM-powered evolution of exercise personalization. Distinct from learning path and course recommendation, where personalization focuses on sequencing and selection, recent studies reveal a conceptual shift in exercise recommendation—from selecting pre-existing tasks to generating new and adaptive exercises using LLMs. A growing trend is the development of exercise generation systems that integrate pedagogical reasoning with automated content creation. For instance, Ta et al. [267] introduced ExGen, which leverages LLMs to automatically generate personalized, ready-to-use programming exercises, while Meißner et al. [268] compared GPT-4 and LaMDA for software engineering tasks, emphasizing that LLMs can greatly reduce authoring effort, though human oversight remains essential. Further, Zeng et al. [269] demonstrated that instruction-tuned LLMs using the Evol-Instruct and Adaptive Curriculum for Exercise Synthesis (ACES) algorithms can enhance the quality and diversity of generated exercises, and Luo et al. [270] applied Bloom’s Taxonomy to prompt design, producing physics problems aligned with cognitive levels and learning goals. Additionally, Li et al. [272] proposed CIKT, a collaborative and iterative knowledge tracing framework that integrates LLMs to enhance both prediction accuracy and interpretability in exercise-based learning environments, providing a bridge between explainable student modeling and adaptive content generation. Collectively, these advancements signify a shift from static exercise recommendations toward intelligent content generation, where LLMs act not only as recommenders but also as creative co-designers capable of generating pedagogically coherent, context-aware, and cognitively aligned exercises.

5.3.2 Practical implementation challenges

Unlike path or course recommendation, exercise personalization operates at the item level and increasingly involves on-the-fly generation, introducing distinct deployment challenges. First, ensuring the quality and validity of generated items remains difficult; errors in question stems, solutions, or distractors can compromise reliability and pedagogy. Second, difficulty calibration and alignment with curriculum standards and cognitive levels (e.g., Bloom’s taxonomy) are nontrivial, as improper mapping undermines mastery estimation. Third, coupling with knowledge tracing and diagnostics is fragile due to noisy metadata and shifting concept taxonomies that propagate errors through adaptive feedback loops. Fourth, academic integrity and provenance demand safeguards against prompt inversion, plagiarism, and data leakage, requiring traceable item generation pipelines. Fifth, real-time efficiency and fairness pose additional constraints—generation must balance latency, accuracy, and inclusivity across linguistic and cultural contexts. Addressing these issues requires generation-aware MLOps that integrate instructor-in-the-loop validation, provenance metadata, fairness auditing, and curriculum-aligned difficulty modeling, ensuring that LLM-driven exercise systems evolve from recommendation toward pedagogically sound, reliable generation at scale.

5.3.3 Educational implications

The evolution from recommending to generating exercises represents a paradigm shift in how students practice and consolidate knowledge. LLM-powered systems can now produce exercises tailored to individual proficiency levels and deliver real-time, adaptive feedback, rendering practice sessions both highly efficient and deeply engaging. This pedagogical potential is increasingly supported by empirical research. For adaptive learning, Cui and Sachan [273] demonstrated that combining knowledge tracing with controlled text generation produces adaptive language exercises aligned with learners’ knowledge states. In programming education, Rytilahti et al. [274] found that LLM-generated programming exercises diversify practice and reduce teachers’ workload. When effectively implemented, such systems do more than bolster conceptual understanding and self-efficacy; they also free educators from routine task creation, allowing them to dedicate more attention to high-level instructional guidance. Thus, automated exercise generation transcends its technical role to become a vital pedagogical bridge—seamlessly connecting targeted practice, personalized feedback, and sustained learning growth.

6 Application case

In recent years, the application of personalized learning in the classroom has been quite extensive. Based on our previous categorization, these real-world application cases can be reviewed from three aspects: cognitive modeling, non-cognitive modeling, and personalized recommendations, depending on the types of personalized learning technologies employed in them.

6.1 Concrete examples

We summarize recent applications of personalized learning in teaching and learning, as shown in Table 3.

Examples focusing on cognitive diagnostics and non-cognitive modeling include [275], who proposes a learner model combining stereotyping, fuzzy logic, and similarity techniques to personalize learning paths based on user interactions (e.g., page visits, mouse clicks), enhancing self-regulation. However, in practice, it relies heavily on questionnaire-based initialization, introducing subjective bias. Moreover, the emotional dimension is limited to two coarse variables (“motivation” and “mood”), which fail to capture nuanced affective states. Yang et al. [276] implemented a personalized learning analytics intervention in an undergraduate accounting course using the “BookRoll” e-book system and recommender tools. Their findings showed improved academic performance and behavioral engagement in the personalized group.

In the recommendation domain, Huang et al. [9] studied AI-driven personalized video recommendations in a Taiwanese flipped classroom, reporting increased engagement and learning outcomes, particularly for moderately motivated students. However, the recommendation process depends on IBM Watson, which suffers from limited interpretability. Moreover, the lack of diversity metrics in the recommendation mechanism raises concerns about potential information silos. Lim et al. [277] applied real-time analytics and rule-based AI to deliver personalized scaffolding to German university students, supporting self-regulated learning via real-time behavioral data analysis. However, because the scaffolding rules are manually defined by experts and relatively sparse, the system exhibits limited adaptability to complex or dynamic learning scenarios. Similarly, another system leverages the Learning Object Metadata Standard for content delivery, with its logic also dependent on preset rules [278]. To enhance adaptability, intelligent systems integrating Case-Based Reasoning with learning style models have been designed to tailor resources to learner preferences. Nevertheless, the limited scale and domain diversity of their case libraries constrain cross-context transferability [279]. Karaoglan Yilmaz et al. [280] investigated learning analytics-based personalized recommendation and feedback in a Turkish flipped classroom, showing that customized feedback improved students’ self-assessment, motivation, and learning attitudes. However, the study relies primarily on subjective satisfaction measures, lacking a quantitative evaluation of actual learning gains. Furthermore, a personalized recommendation framework for K-12 education has been proposed, emphasizing adaptation based on individual characteristics and learning styles while discussing integration challenges in practice. Although this framework provides a valuable design reference, it lacks empirical validation and concrete implementation details, leaving its effectiveness in real-world settings to be verified [281].

Overall, most applications currently focus primarily on behavioral and real-time testing, lacking objective, multidimensional variables such as emotional cognition and sociocultural factors. Moreover, the relatively short duration of experimental studies limits the evaluation of long-term persistence and temporal decay of personalization effects. In addition, the interpretability of recommendation mechanisms and the management of data privacy remain critical yet underexplored dimensions that warrant greater attention in future research.

6.2 Intelligent educational software

Intelligent educational software utilizes modern technology, particularly artificial intelligence and data analytics, to enhance the educational experience by offering personalized teaching and learning support based on students’ habits, performance, and preferences. This software can be categorized into six main types: adaptive learning systems, virtual education assistants, learning analytics tools, online learning platforms, learning management systems, and intelligent educational games. Below, we describe each type, highlighting their features, student modeling methods, personalized recommendation strategies, and implementation examples.

Adaptive learning systems (ALS) dynamically model students’ knowledge states and learning behaviors to enable real-time adaptation of instructional content and difficulty. Knewton [282] is an example that employs advanced algorithms to analyze student behavior and performance, providing customized learning paths that align with individual abilities and interests. It employs knowledge modeling to assess conceptual mastery while integrating personalized recommendation strategies, such as collaborative filtering and reinforcement learning, to generate individualized learning paths. In higher education, Knewton’s University Algebra course at the University of Arizona (involving approximately 2,000 students) reported a 17% increase in pass rates, and a 56% decrease in dropout rates. In K-12 education, a randomized controlled trial (RCT) by RAND with 1,400 middle school students using a Knewton-style ALS achieved moderate learning gains (Cohen’s

d = 0.34

) and a 4.2-percentage-point reduction in dropout rates. Another prominent commercial platform, DreamBox, also illustrates the efficacy of adaptive learning systems at scale. During the 2020–2021 academic year, an impact assessment spanning 8,000 schools and 3 million K-8 students revealed that learners who completed at least five lessons per week (approximately 60 minutes) improved their NWEA MAP percentile scores by an average of 9.9 points. ALS platforms like Knewton and DreamBox are highly representative of demonstrating large-scale efficacy. However, their reliance on well-defined learning objectives and structured assessments limits applicability to open-ended or interdisciplinary learning. Moreover, the limited transparency of their recommendation algorithms raises concerns regarding reproducibility, generalizability, and potential instructional bias.

Virtual educational assistants (VEA) are AI-driven learning support systems that provide personalized resource recommendations and real-time tutoring through natural language interaction and multimodal feedback. These systems integrate both cognitive modeling to monitor knowledge acquisition and learning trajectories and non-cognitive modeling, which analyzes motivation, attention, and engagement behaviors to enable adaptive interventions. The language learning platform Duolingo [283] exemplifies a typical VEA application, employing reinforcement learning and sequence modeling to dynamically adjust content difficulty and feedback, thereby offering a highly personalized language learning experience. Empirical studies have consistently confirmed Duolingo’s effectiveness in language acquisition. For instance, in the transitional preparatory English course, a blended learning approach combining Duolingo and ReadTheory increased CEFR-aligned receptive skill test scores from 8.12 ± 2.16 to 15.17 ± 1.79 (

p < 0.001

), indicating substantial improvement [284]. Similarly, a study involving 48 U.S. college students reported that after three months of continuous use, language proficiency advanced from Novice Low to Novice Mid (

r = 0.61, p < 0.001

), with notable gains in writing and speaking (Cohen’s

d ≈ 1.0

) [285]. These results demonstrate that AI-based VEAs can significantly enhance both language learning outcomes and learner motivation. Yet, effectiveness is less documented for complex STEM subjects or project-based tasks, where real-time adaptation may require deeper cognitive modeling and domain expertise.

Online learning platforms (OLP) such as Coursera, Khan Academy, and Udacity empower students to independently select learning content based on personal interests and needs by offering a wide variety of course resources. These platforms emphasize adaptive pacing and resource allocation to accommodate individual learning trajectories. For example, Coursera [286] provides an extensive range of academic and professional training programs that allow learners to progress at their own pace. Evaluation data indicate that instructional interventions—such as requiring at least one instance of human grading—can improve learner retention by approximately 6%, while stricter control of study duration contributes an additional 5% increase in retention. In K-12 education, multiple randomized controlled trials and large-scale empirical studies involving over 11,000 students in grades 3–6 have shown that Khan Academy significantly enhances students’ mathematical achievement and learning attitudes in foundational mathematics courses. Although the user base of OLP has grown steadily in recent years, with increasing incorporation of artificial intelligence technologies, dropout rates remain high in fully online settings. Furthermore, these platforms often face challenges in delivering personalized learning beyond standard course pathways, and their integration of fine-grained cognitive modeling remains limited compared to ALS.

Learning analytics tools (LAT) systematically analyze student data, such as engagement, performance, and learning progress, to generate actionable insights that support adaptive instruction and personalized feedback. These tools also facilitate early warning systems and data-driven decision support for educators. IBM Watson Education [287] exemplifies this paradigm by leveraging cognitive modeling and advanced analytics to identify learning patterns, predict academic outcomes, and assist instructors in optimizing pedagogical strategies. The system further integrates personalized recommendation techniques, including collaborative filtering and sequence modeling, to deliver targeted learning interventions tailored to individual students’ needs. Nonetheless, dependence on institutional data quality and privacy constraints may hinder broader adoption, and predictive models can propagate existing biases in educational data.

Learning management systems (LMS) provide a centralized platform for creating, managing, and delivering educational content. They support personalized curriculum and learning tracking to help students learn at their own pace. Moodle [288], one of the most representative open-source LMSs, empowers educators to flexibly design instructional materials and evaluation methods while fostering online collaboration and learner engagement. Empirical evidence from a study involving Nigerian distance-learning undergraduates (

n = 113

) [289] revealed that the use of Moodle led to significant improvements in both academic performance (

F (3, 336) = 193.19,

p < 0.001, η 2 = 0.963

) and learning satisfaction (

F (3, 336) = 154.11,

p < 0.001, η 2 = 0.957

), confirming its strong efficacy in enhancing learning outcomes and learner experience. LMSs are highly representative in infrastructure provision and curriculum management. Yet, their adaptability relies heavily on educator input, and personalization capabilities remain limited compared to ALS or VEA.

Educational games (EG) integrate pedagogical objectives with entertainment elements to enhance learner motivation and engagement through interactive and immersive experiences. These systems dynamically adapt task difficulty based on player performance and progression, achieving an optimal balance between enjoyment and educational impact. Minecraft: Education Edition (MCEE) [290] exemplifies this approach by enabling students to engage in inquiry-based learning and collaborative problem-solving within a virtual environment. In an empirical study conducted in Irish primary schools (

N ≈ 166

), 87.5%–96.3% of participating students reported that MCEE improved peer collaboration, while 93.7%–100% indicated that it significantly fostered creativity [291]. These results underscore the effectiveness of educational games in cultivating students’ collaborative, problem-solving, and creative abilities. However, measuring learning outcomes quantitatively is challenging, and long-term academic impact is less well-established. The cost and technical requirements can also restrict scalability.

Table S9 summarizes mainstream intelligent educational software and their key features. These cases collectively demonstrate that educational software varies substantially in both target learning stages and technical adaptability. For example, Knewton is deployed across K-12 and higher education; Duolingo primarily serves higher and adult education; DreamBox and Khan Academy focus on K-12 mathematics and foundational subjects; while Coursera and Udacity are oriented toward higher education, degree programs, and vocational training. This variation reflects fundamental differences in technological objectives, data structures, and instructional models across educational levels. In K-12 education, system design typically prioritizes measurable academic outcomes, such as standardized test improvement, course completion, and dropout reduction, supported by structured curricula and teacher-led implementations. RCTs are widely applied in this context to quantify instructional impact, as exemplified by the evaluation studies of DreamBox and Knewton. Data in K-12 environments are highly structured (e.g., item-level logs, assessment scores), which facilitates fine-grained modeling through CDM and Knowledge Tracing (KT) approaches. However, privacy regulations and parental oversight often constrain large-scale data sharing.

In higher education and adult learning, technological priorities shift toward optimizing course pass rates, learner retention, and time efficiency. Learners in these settings exhibit greater heterogeneity in prior knowledge and motivation, requiring systems with stronger generalization and semantic reasoning capabilities. Consequently, adaptive frameworks increasingly integrate knowledge graphs, LLMs, and personalized recommender systems to support open-ended, project-based, and self-directed learning. At the same time, the technical adaptability of cognitive diagnostic models varies across different educational stages. K-12 education places greater emphasis on precise assessment and cognitive level identification, while higher education focuses more on understanding complex learning behaviors and providing personalized feedback. Although intelligent educational software can deliver targeted learning experiences and provide real-time feedback, enhancing efficiency and interactivity, it remains limited in transparency, cross-domain adaptability, long-term efficacy evaluation, and incorporation of non-cognitive factors. Future work should address these gaps by developing interpretable adaptive algorithms, enhancing longitudinal evaluation, and expanding personalization to complex, interdisciplinary learning contexts.

6.3 Paradigm breakthroughs of LLM in personalized learning

In recent years, LLMs have brought paradigm-level transformations to personalized learning. Unlike traditional models that rely on structured learning logs and handcrafted features, LLM-based systems leverage natural language interaction and contextual reasoning to dynamically capture learners’ cognitive, affective, and behavioral states. This section reviews representative studies and systems to elucidate the core breakthroughs achieved by LLMs in personalized learning and the primary challenges they face.

The pioneering EduChat system exemplifies the deep integration of large-scale educational dialogue with cognitive-affective modeling [292]. Trained on massive instructional corpora, it interprets learners’ utterances and feedback to infer their mastery levels and learning intentions in real time, thereby transforming static learner profiles into dynamic, dialogue-based models. Through natural language interaction, it generates personalized learning paths and exercises aligned with learners’ self-stated goals, thereby redefining the logic of recommendation. Nevertheless, its dependence on substantial computational resources and potential privacy risks in cloud-based deployment remain key challenges. Mitigating these issues requires deploying private education servers and optimizing real-time inference pipelines. Similarly, SocraticLM [293], based on the Socratic Dialogue Model, adopts a multi-agent architecture (Dean–Teacher–Student model) to promote reflective reasoning and structured inquiry rather than providing direct answers. Recent research has extended this approach to dialogue-based knowledge tracing. For instance, Scarlatos et al. [294] introduced a framework that uses LLM prompting to identify knowledge components and misconceptions within tutor–student dialogues, achieving superior predictive accuracy in open-ended conversational learning. By simulating strategy-driven, personalized instruction grounded in constructivist learning theory, these systems identify cognitive misconceptions through multi-turn reasoning, while supervised agents enhance content reliability and mitigate biases.

A second line of research leverages LLMs to enhance diagnostic precision and interpretability. Models such as KCD [295] and MalAlgoPy [296] integrate LLM-based semantic reasoning with quantitative cognitive diagnosis frameworks. Building on this foundation, Wang et al. [297] proposed LLM-KT, a plug-and-play alignment framework that connects LLMs with knowledge tracing (KT) tasks through multi-modal context adapters, achieving state-of-the-art performance across benchmark datasets. Similarly, Yang et al. [298] introduced a difficulty-aware programming knowledge tracing model (DPKT) that extracts both textual and conceptual difficulty dimensions of programming problems to refine student knowledge estimation. Furthermore, the DDKT framework incorporates retrieval-augmented generation and dual-channel difficulty modeling, combining subjective and objective difficulty perceptions to enhance interpretability and mitigate cold-start problems [299]. Complementarily, Lee et al. [300] presented a simpler integration of pre-trained language models (PLMs) into KT, showing that semantic representations from textual data can significantly improve prediction accuracy and interpretability. Collectively, these works mark a paradigm shift from modeling purely observable behavior to developing semantically grounded, interpretable, and context-rich diagnostic frameworks that balance quantitative precision with human-readable reasoning.

Additionally, the study presented in [301] conceptualizes LLMs as diagnostic subjects in educational assessments. By applying Bloom’s Taxonomy to examine their internal knowledge structures, the study reveals that LLMs can exhibit overconfidence and conceptual inconsistencies even in fundamental topics. These findings provide critical insights for calibrating LLM-based student models and preventing biased diagnostic inferences. Meanwhile, frameworks such as LLM-SS [302] leverage prior code attempts and expert examples within a contextual learning paradigm to predict learners’ behavioral trajectories in visual programming tasks, thereby enabling adaptive, task-level feedback generation. Recent meta-analyses such as the systematic review by [303] further synthesize over fifty studies, underscoring LLMs’ impact on learner engagement, affective development, and ethical challenges, and calling for stronger integration between AI-driven personalization and educational validity. However, maintaining temporal consistency in synthesized learning trajectories remains a major challenge. Future research could address this by developing hybrid architectures that combine LLMs with recurrent student modeling networks, thereby enhancing stability and coherence over time.

Collectively, these studies highlight a paradigm shift in personalized learning from explicit feature engineering and static response modeling to language-driven, adaptive, and semantically grounded personalization. LLMs are redefining student modeling through dialogue-based cognitive inference, semantic-behavioral integration, and simulation-based augmentation, while also reshaping recommendation logic via natural language-driven learning path generation. Despite these advances, key challenges persist. First, inherent knowledge update latency in pre-trained models restricts integration of evolving curricula and emerging learning materials, which can be alleviated via Retrieval-Augmented Generation (RAG) and continuous fine-tuning for timely incorporation of up-to-date educational content. Second, guaranteeing generated output reliability and fairness remains a critical issue, addressable through post-hoc verification pipelines and evaluation agents based on objective scoring criteria, deployed for automatic assessment of factual accuracy and pedagogical relevance. Finally, challenges around data privacy, computational efficiency, model explainability, and reconciling generative flexibility with curricular constraints remain prominent. Future research should prioritize developing coherent, interpretable, continuously adaptive LLM frameworks that balance educational validity with the deep personalization required for truly intelligent learning systems.

7 Challenges and future directions

Personalized learning, as a cutting-edge application of intelligent education and a leading research focus in education, has flourished but faces challenges. These challenges primarily manifest in data quality, technology, assessment, and ethics, and increasingly intersect with issues introduced by the rapid adoption of Large Language Models (LLMs), including generative uncertainties, temporal dynamics, and pedagogical implications. This chapter provides a comprehensive analysis of these challenges across four key areas, offering insights into future directions and potential innovations to guide further research in personalized learning.

1) Data Data serves as the foundation of personalized learning, encompassing diverse information derived from learners’ academic history, behaviors, and life experiences. This vast array of data enables systems to assess students’ knowledge states, address individual learning needs, and continuously refine models based on learning styles, emotions, and behavioral patterns. However, existing datasets are often limited in scale, noisy, or lack essential metadata, which can lead to biased conclusions. In student behavior analysis, researchers frequently rely on privately collected datasets that are rarely disclosed, creating challenges in data completeness and standardization. The rise of multimodal data (e.g., text, speech, gaze) further increases the complexity of standardization and fusion. A new challenge with LLMs lies in their reliance on static, often outdated corpora, leading to knowledge cutoffs that lag behind curricula and educational standards. Even fine-tuned models can reproduce obsolete facts or outdated pedagogical concepts, aggravating data-quality and alignment issues. Additionally, continuous logging of learner interactions for adaptive feedback increases privacy and security risks through persistent data collection and implicit profiling. Balancing data utility, standardization, transparency, and privacy, while keeping adaptive models current, representative, and ethically grounded, remains a core challenge for next-generation personalized learning systems.

2) Technology Current personalized learning algorithms primarily focus on specific tasks, such as text assessment and recommendation systems but lack sufficient research on generalization across domains. Achieving holistic personalization requires models that are adaptable, interpretable, and grounded in educational and psychological theory. In learning analytics, effectively analyzing student data amid complexity and uncertainty remains a significant challenge. Despite advances in deep learning, most existing methods struggle to effectively handle multimodal signals and contextual uncertainty. The integration of LLMs introduces both opportunities and challenges [304]. LLMs enable dynamic, dialogue-driven learning support and contextual reasoning, but also raise new issues:

– Knowledge update latency, as static LLMs may rely on outdated information.

– Educational suitability, since generated content is not always pedagogically aligned or age-appropriate.

– High computational and energy costs, which hinder deployment in large-scale educational systems.

– Bias amplification and factual hallucination, which may mislead learners or distort educational fairness.

Overcoming these technological challenges requires deeper research into efficient LLM adaptation, continual fine-tuning with educational data, and the establishment of content-alignment mechanisms to ensure that generated materials support verified learning objectives.

3) Assessment Evaluating the effectiveness of personalized learning remains a complex challenge, as traditional metrics such as Hit Ratio (HR), Normalized Discounted Cumulative Gain (NDCG), and Recall fail to capture comprehensive learning progress and real-world impact. Relying solely on offline assessment methods complicates real-time performance evaluation in dynamic learning environments, delaying user feedback and overlooking a crucial aspect of personalized learning, which is enhancing the student experience and addressing individual needs. Existing metrics often fail to accommodate diverse learning styles, limiting the generalizability of evaluation results, while those derived from individual experiments or subjective judgments lack universal applicability and broad recognition. Moreover, the emergence of AI tutors and LLM-driven systems introduces new evaluation challenges, as learning outcomes become intertwined with conversational dynamics, self-directed exploration, and adaptive reasoning. Traditional quantitative metrics are often insufficient to assess such open-ended, interaction-driven learning processes. In summary, current evaluation approaches remain limited in timeliness, adaptability, and generalizability, relying excessively on static performance metrics while insufficiently capturing the interactive, dynamic nature of personalized and LLM-driven learning processes.

4) Ethics Ethical considerations in personalized learning are crucial due to the variability in model quality and the associated risks. The collection of extensive student data raises significant privacy concerns, while potential biases in training data can lead to unfair customization of learning content. With the rapid rise of LLMs, new ethical risks have surfaced: biased or culturally insensitive content generation, lack of interpretability, and potential dissemination of inaccurate or hallucinated information. Moreover, integrating LLMs into educational infrastructures amplifies privacy and accountability challenges, as personal learning records may inadvertently be exposed during model optimization or fine-tuning. In summary, the ethical landscape of personalized learning is becoming increasingly complex, as emerging AI-driven technologies amplify long-standing concerns about bias, transparency, and data protection.

Navigating the challenges of personalized learning requires ongoing technological advancements and in-depth research, which offer a promising pathway for future development. From a data perspective, efforts should focus on establishing standardized collection protocols, privacy-preserving sharing frameworks, and multimodal fusion techniques to ensure data fairness, diversity, and completeness. From a technological standpoint, research should advance lightweight model architectures, federated learning, and on-device inference to alleviate computational and deployment constraints. Equally important is interdisciplinary collaboration, particularly the integration of technological innovation with educational theory and psychological insight, to construct a more holistic and human-centered personalized learning framework.

Meanwhile, the integration of LLMs presents new opportunities to enhance personalized learning guidance. LLMs hold transformative potential to create interactive, adaptive, and context-aware learning environments that tailor content to individual learners’ styles, preferences, and knowledge gaps. However, realizing this potential requires addressing enduring challenges such as curriculum misalignment, knowledge obsolescence, and the generation of hallucinated or pedagogically inappropriate content. Future research opportunities include generating personalized educational questions, evaluating programming courses, and exploring the transformative impact of LLMs on education. The transition of academic models and algorithms from theory to practical implementation requires collaborative efforts to drive innovation and the real-world application of personalized learning technologies. To maintain accountability and contextual relevance, teacher-in-the-loop paradigms and transparent governance frameworks must be incorporated into AI-assisted educational systems.

In summary, despite its challenges, personalized learning offers significant opportunities for intelligent, flexible systems capable of adapting to diverse fields and subject areas, meeting students’ varied needs, and advancing personalized learning to new heights in the future.

8 Conclusion

This paper provides a comprehensive examination of personalized learning within the domain of intelligent education. We elucidated various definitions of personalized learning, examined its connections with educational theories, and traced its historical development, research motivations, and overarching goals. A systematic analysis is conducted from two key dimensions: student modeling and personalized recommendation methods.

For student modeling, we examined both cognitive and non-cognitive perspectives. For cognitive modeling, we reviewed foundational algorithms and their progression, with a particular focus on cognitive diagnosis, tracing its development from early psychometric models to modern attention-based approaches. Non-cognitive modeling includes learning styles, affect assessment, behavior analysis, and performance prediction. For personalized recommendations, we categorized methods into learning path, course, and practice recommendations, and examined their interplay with student modeling. Beyond theoretical discourse, we presented empirical evidence of real-world applications through case studies and an overview of existing platforms, tools, and software. However, it should be acknowledged that the practical cases reviewed in this study are predominantly concentrated in MOOCs and K-12 settings, with relatively limited coverage of vocational training, lifelong learning, and corporate learning applications. While these examples effectively demonstrate the potential of personalized learning technologies, they may not fully capture the diversity and complexity of implementation across all educational contexts. Future work could expand the scope of analysis by incorporating broader institutional types and diverse learner populations to ensure a more holistic understanding of personalized learning in practice.

Despite significant progress, challenges such as privacy, ethical considerations, model evaluation, and interpretability remain critical barriers to widespread adoption. Looking ahead, Large Language Models in education hold great potential to further enhance personalized learning experiences. This survey aims to provide researchers and practitioners with valuable insights, stimulate academic discourse, and contribute to the continuous advancement of personalized learning in intelligent education.

9 Electronic supplementary material

Supplementary material is available in the online version of this article at journal.hep.com.cn and link.springer.com.

References

Publishing order | Descend order by publishing year | Descend order by cited within

[1]	Bloom B S, Engelhart M D, Furst E J, Hill W H, Krathwohl D R. Taxonomy of Educational Objectives. New York: Longmans, Green, 1964

[2]	Skinner B F . Review lecture-the technology of teaching. Proceedings of the Royal Society B: Biological Sciences, 1965, 162( 989): 427–443

[3]	Brusilovsky P . Methods and techniques of adaptive hypermedia. User Modeling and User-Adapted Interaction, 1996, 6( 2−3): 87–129

[4]	Long P, Siemens G . Penetrating the fog: analytics in learning and education. Educause Review, 2011, 46( 5): 31–40

[5]	Berland M, Baker R S, Blikstein P . Educational data mining and learning analytics: applications to constructionist research. Technology, Knowledge and Learning, 2014, 19( 1−2): 205–220

[6]	Lin C F, Yeh Y C, Hung Y H, Chang R I . Data mining for providing a personalized learning path in creativity: an application of decision trees. Computers & Education, 2013, 68: 199–210

[7]	Zhang R . A personalized course resource recommendation method based on deep learning in an online multi-modal multimedia education cloud platform. International Journal of Information Technologies and Systems Approach, 2023, 16( 2): 1–14

[8]	Xu S . Recommendation of online learning resources for personalized fragmented learning based on mobile devices. International Journal of Emerging Technologies in Learning, 2022, 17( 3): 34–49

[9]	Huang A Y Q, Lu O H T, Yang S J H . Effects of artificial intelligence–enabled personalized recommendations on learners’ learning engagement, motivation, and outcomes in a flipped classroom. Computers & Education, 2023, 194: 104684

[10]	Chaipidech P, Srisawasdi N, Kajornmanee T, Chaipah K . A personalized learning system-supported professional training model for teachers’ TPACK development. Computers and Education: Artificial Intelligence, 2022, 3: 100064

[11]	Wang C, Zhu H, Zhu C, Zhang X, Chen E, Xiong H. Personalized employee training course recommendation with career development awareness. In: Proceedings of the Web Conference 2020. 2020, 1648−1659

[12]	Gómez S, Zervas P, Sampson D G, Fabregat R . Context-aware adaptive and personalized mobile learning delivery supported by UoLmP. Journal of King Saud University-Computer and Information Sciences, 2014, 26( 1): 47–61

[13]	Nabil A, Seyam M, Abou-Elfetouh A . Prediction of students’ academic performance based on courses’ grades using deep neural networks. IEEE Access, 2021, 9: 140731–140746

[14]	FitzGerald E, Kucirkova N, Jones A, Cross S, Ferguson R, Herodotou C, Hillaire G, Scanlon E . Dimensions of personalisation in technology-enhanced learning: a framework and implications for design. British Journal of Educational Technology, 2018, 49( 1): 165–181

[15]	Chen S Y, Wang J H . Individual differences and personalized learning: a review and appraisal. Universal Access in the Information Society, 2021, 20( 4): 833–849

[16]	Shemshack A, Spector J M . A systematic literature review of personalized learning terms. Smart Learning Environments, 2020, 7( 1): 33

[17]	Bernacki M L, Greene M J, Lobczowski N G . A systematic review of research on personalized learning: personalized by whom, to what, how, and for what purpose(s)?. Educational Psychology Review, 2021, 33( 4): 1675–1715

[18]	Essa S G, Celik T, Human-Hendricks N E . Personalized adaptive learning technologies based on machine learning techniques to identify learning styles: a systematic literature review. IEEE Access, 2023, 11: 48392–48409

[19]	Zhong L, Wei Y, Yao H, Deng W, Wang Z, Tong M. Review of deep learning-based personalized learning recommendation. In: Proceedings of the 11th International Conference on E-Education, E-Business, E-Management, and E-Learning. 2020, 145−149

[20]	Tian L, Yang B, Yin X, Su Y. A survey of personalized recommendation based on machine learning algorithms. In: Proceedings of the 4th International Conference on Electronic Information Technology and Computer Engineering. 2020, 602−610

[21]	Ren Y, Chi C, Jintao Z. A survey of personalized recommendation algorithm selection based on meta-learning. In: Xu Z, Choo K K R, Dehghantanha A, Parizi R, Hammoudeh M, eds. Cyber Security Intelligence and Analytics. Cham: Springer, 2019, 1383−1388

[22]	Yun Y, Dai H, Zhang Y P, Shang X Q, Li Z H . State-of-the-art survey on personalized learning path recommendation. Journal of Software, 2022, 33( 12): 4590–4615

[23]	Nkambou R, Bourdeau J, Mizoguchi R. Advances in Intelligent Tutoring Systems. Berlin, Heidelberg: Springer, 2010

[24]	Schmid R, Petko D . Does the use of educational technology in personalized learning environments correlate with self-reported digital skills and beliefs of secondary-school students?. Computers & Education, 2019, 136: 75–86

[25]	U.S . Department of Education, Office of Educational Technology. Reimagining the role of technology in education: 2017 national education technology plan update. Washington: U.S. Department of Education, Office of Educational Technology, 2017

[26]	SRI Education. Using technology to personalize learning in K-12 schools. Menlo Park: SRI Education, 2018

[27]	Dewey J. My pedagogic creed. In: Flinders D J, Thornton S J, eds. Curriculum Studies Reader E2. 2nd ed. New York: Routledge, 2005, 29−35

[28]	Dewey J . Democracy in education. The Elementary School Teacher, 1903, 4( 4): 193–204

[29]	Dewey J . Experience and education. The Educational Forum, 1986, 50( 3): 241–252

[30]	Skinner B F. The Behavior of Organisms: An Experimental Analysis. New York: Appleton-Century-Crofts, Inc., 2019

[31]	Watson J B . Psychology as the behaviorist views it. Psychological Review, 1913, 20( 2): 158–177

[32]	Piaget J. The Origins of Intelligence in Children. Cook M, trans. New York: International Universities Press, 1952

[33]	Vygotsky L S, Cole M, John-Steiner V, Scribner S, Souberman E. Mind in Society: The Development of Higher Psychological Processes. Cambridge: Harvard University Press, 1978

[34]	Ebbinghaus H . Memory: a contribution to experimental psychology. Annals of Neurosciences, 2013, 20( 4): 155–156

[35]	Bruner J S. The Process of Education. Cambridge: Harvard University Press, 2009

[36]	Bandura A. Social Foundations of Thought and Action: A Social Cognitive Theory. Englewood Cliffs: Prentice-Hall, 1986

[37]	Vygotsky L. Interaction between learning and development. Linköpings universitet Linköping, Sweden, 2011

[38]	Anderson J R. How Can the Human Mind Occur in the Physical Universe? Oxford: Oxford University Press, 2009

[39]	VanLehn K . The behavior of tutoring systems. International Journal of Artificial Intelligence in Education, 2006, 16( 3): 227–265

[40]	Traub R E . Classical test theory in historical perspective. Educational Measurement: Issues and Practice, 1997, 16( 4): 8–14

[41]	Embretson S E, Reise S P. Item Response Theory. New York: Psychology Press, 2013

[42]	Reckase M D. Multidimensional item response theory models. In: Reckase M D, ed. Multidimensional Item Response Theory. New York, NY: Springer, 2009, 79−112

[43]	Liang Y, Li S, Yan C, Li M, Jiang C . Explaining the black-box model: a survey of local interpretation methods for deep neural networks. Neurocomputing, 2021, 419: 168–182

[44]	Liu Q, Wu R, Chen E, Xu G, Su Y, Chen Z, Hu G . Fuzzy cognitive diagnosis for modelling examinee performance. ACM Transactions on Intelligent Systems and Technology, 2018, 9( 4): 48

[45]	Ma H, Huang Z, Tang W, Zhu H, Zhang H, Li J . Predicting student performance in future exams via neutrosophic cognitive diagnosis in personalized E-learning environment. IEEE Transactions on Learning Technologies, 2023, 16( 5): 680–693

[46]	Zhou Y, Liu Q, Wu J, Wang F, Huang Z, Tong W, Xiong H, Chen E, Ma J. Modeling context-aware features for cognitive diagnosis in student learning. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 2021, 2420−2428

[47]	Guo L, Liu Z, Ma G, Li Q . Multidimensional latent cognitive diagnosis based on rough concept analysis and item response theory. Applied Soft Computing, 2025, 169: 112563

[48]	Wang F, Liu Q, Chen E, Liu C, Huang Z, Wu J, Wang S. Unified uncertainty estimation for cognitive diagnosis models. In: Proceedings of the ACM Web Conference 2024. 2024, 3545−3554

[49]	de la Torre J . Dina model and parameter estimation: a didactic. Journal of Educational and Behavioral Statistics, 2009, 34( 1): 115–130

[50]	Gao W, Liu Q, Huang Z, Yin Y, Bi H, Wang M C, Ma J, Wang S, Su Y. RCD: relation map driven cognitive diagnosis for intelligent education systems. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2021, 501−510

[51]	Ma H, Li M, Wu L, Zhang H, Cao Y, Zhang X, Zhao X. Knowledge-sensed cognitive diagnosis for intelligent education platforms. In: Proceedings of the 31st ACM International Conference on Information & Knowledge Management. 2022, 1451−1460

[52]	Wang X, Huang C, Cai J, Chen L. Using knowledge concept aggregation towards accurate cognitive diagnosis. In: Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 2021, 2010−2019

[53]	Wang S, Zeng Z, Yang X, Zhang X. Self-supervised graph learning for long-tailed cognitive diagnosis. In: Proceedings of the 37th AAAI Conference on Artificial Intelligence. 2023, 110−118

[54]	Song L, He M, Shang X, Yang C, Liu J, Yu M, Lu Y . A deep cross-modal neural cognitive diagnosis framework for modeling student performance. Expert Systems with Applications, 2023, 230: 120675

[55]	Gao W, Wang H, Liu Q, Wang F, Lin X, Yue L, Zhang Z, Lv R, Wang S. Leveraging transferable knowledge concept graph embedding for cold-start cognitive diagnosis. In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2023, 983−992

[56]	Yang H, Qi T, Li J, Guo L, Ren M, Zhang L, Wang X . A novel quantitative relationship neural network for explainable cognitive diagnosis model. Knowledge-Based Systems, 2022, 250: 109156

[57]	Li S, Guan Q, Fang L, Xiao F, He Z, He Y, Luo W. Cognitive diagnosis focusing on knowledge concepts. In: Proceedings of the 31st ACM International Conference on Information & Knowledge Management. 2022, 3272−3281

[58]	Yao F, Liu Q, Hou M, Tong S, Huang Z, Chen E, Sha J, Wang S. Exploiting non-interactive exercises in cognitive diagnosis. In: Proceedings of the 32nd International Joint Conference on Artificial Intelligence. 2023, 2397−2405

[59]	Su Y, Cheng Z, Wu J, Dong Y, Huang Z, Wu L, Chen E, Wang S, Xie F . Graph-based cognitive diagnosis for intelligent tutoring systems. Knowledge-Based Systems, 2022, 253: 109547

[60]	Pei X, Yang S, Huang J, Xu C. Self-attention gated cognitive diagnosis for faster adaptive educational assessments. In: Proceedings of the IEEE International Conference on Data Mining. 2022, 408−417

[61]	Liu S, Yu X, Ma H, Wang Z, Qin C, Zhang X. Homogeneous cohort-aware group cognitive diagnosis: a multi-grained modeling perspective. In: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management. 2023, 4094−4098

[62]	Shen J, Qian H, Zhang W, Zhou A. Symbolic cognitive diagnosis via hybrid optimization for intelligent education systems. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence. 2024, 14928−14936

[63]	Huang T, Geng J, Yang H, Hu S, Ou X, Hu J, Yang Z . Interpretable neuro-cognitive diagnostic approach incorporating multidimensional features. Knowledge-Based Systems, 2024, 304: 112432

[64]	Gao W, Liu Q, Wang H, Yue L, Bi H, Gu Y, Yao F, Zhang Z, Li X, He Y. Zero-1-to-3: domain-level zero-shot cognitive diagnosis via one batch of early-bird students towards three diagnostic objectives. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence. 2024, 8417−8426

[65]	Ma H, Wang C, Zhu H, Yang S, Zhang X, Zhang X. Enhancing cognitive diagnosis using un-interacted exercises: a collaboration-aware mixed sampling approach. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence. 2024, 8877−8885

[66]	Li J, Liu Q, Wang F, Liu J, Huang Z, Yao F, Zhu L, Su Y. Towards the identifiability and explainability for personalized learner modeling: an inductive paradigm. In: Proceedings of the ACM Web Conference 2024. 2024, 3420−3431

[67]	Yu X, Qin C, Shen D, Ma H, Zhang L, Zhang X, Zhu H, Xiong H . RDGT: enhancing group cognitive diagnosis with relation-guided dual-side graph transformer. IEEE Transactions on Knowledge and Data Engineering, 2024, 36( 7): 3429–3442

[68]	Liu S, Shen J, Qian H, Zhou A. Inductive cognitive diagnosis for fast student learning in web-based intelligent education systems. In: Proceedings of the ACM Web Conference 2024. 2024, 4260−4271

[69]	Ma H, Song S, Qin C, Yu X, Zhang L, Zhang X, Zhu H. DGCD: an adaptive denoising GNN for group-level cognitive diagnosis. In: Proceedings of the 33rd International Joint Conference on Artificial Intelligence. 2024, 2261−2269

[70]	Qian H, Liu S, Li M, Li B, Liu Z, Zhou A. ORCDF: an oversmoothing-resistant cognitive diagnosis framework for student learning in online education systems. In: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2024, 2455−2466

[71]	Lin Y, Gong Y J, Zhang J . An adaptive ant colony optimization algorithm for constructing cognitive diagnosis tests. Applied Soft Computing, 2017, 52: 1–13

[72]	Zhu T, Liu Q, Huang Z, Chen E, Lian D, Su Y, Hu G. MT-MCD: a multi-task cognitive diagnosis framework for student assessment. In: Proceedings of the 23rd International Conference on Database Systems for Advanced Applications. 2018, 318−335

[73]	Liu S, Qian H, Li M, Zhou A. QCCDM: a Q-augmented causal cognitive diagnosis model for student learning. In: Proceedings of the 26th European Conference on Artificial Intelligence. 2023, 1536−1543

[74]	Bi H, Chen E, He W, Wu H, Zhao W, Wang S, Wu J. BETA-CD: a Bayesian meta-learned cognitive diagnosis framework for personalized learning. In: Proceedings of the 37th AAAI Conference on Artificial Intelligence. 2023, 5018−5026

[75]	Li J, Wang F, Liu Q, Zhu M, Huang W, Huang Z, Chen E, Su Y, Wang S. HierCDF: a Bayesian network-based hierarchical cognitive diagnosis framework. In: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2022, 904−913

[76]	Tong S, Liu Q, Yu R, Huang W, Huang Z, Pardos Z A, Jiang W. Item response ranking for cognitive diagnosis. In: Proceedings of the 30th International Joint Conference on Artificial Intelligence. 2021, 1750−1756

[77]	Zhang D, Zhang K, Wu L, Tian M, Hong R, Wang M. Path-specific causal reasoning for fairness-aware cognitive diagnosis. In: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2024, 4143−4154

[78]	Cheng S, Liu Q, Chen E, Huang Z, Huang Z, Chen Y, Ma H, Hu G. DIRT: deep learning enhanced item response theory for cognitive diagnosis. In: Proceedings of the 28th ACM International Conference on Information and Knowledge Management. 2019, 2397−2400

[79]	Huang J, Liu Q, Wang F, Huang Z, Fang S, Wu R, Chen E, Su Y, Wang S. Group-level cognitive diagnosis: a multi-task learning perspective. In: Proceedings of the IEEE International Conference on Data Mining. 2021, 210−219

[80]	Wang F, Liu Q, Chen E, Huang Z, Chen Y, Yin Y, Huang Z, Wang S. Neural cognitive diagnosis for intelligent education systems. In: Proceedings of the 34th AAAI Conference on Artificial Intelligence. 2020, 6153−6161

[81]	Wang F, Liu Q, Chen E, Huang Z, Yin Y, Wang S, Su Y . NeuralCD: a general framework for cognitive diagnosis. IEEE Transactions on Knowledge and Data Engineering, 2023, 35( 8): 8312–8327

[82]	Zhou J, Wu Z, Yuan C, Zeng L. A causality-based interpretable cognitive diagnosis model. In: Proceedings of the 30th International Conference on Neural Information Processing. 2024, 214−226

[83]	Zhang Z, Wu L, Liu Q, Liu J, Huang Z, Yin Y, Zhuang Y, Gao W, Chen E . Understanding and improving fairness in cognitive diagnosis. Science China Information Sciences, 2024, 67( 5): 152106

[84]	Wang S, Zeng Z, Yang X, Xu K, Zhang X. Boosting neural cognitive diagnosis with student’s affective state modeling. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence. 2024, 620−627

[85]	Li X, Guo S, Wu J, Zheng C . An interpretable polytomous cognitive diagnosis framework for predicting examinee performance. Information Processing & Management, 2025, 62( 1): 103913

[86]	Su Y, Shen S, Zhu L, Wu L, Huang Z, Cheng Z, Liu Q, Wang S . Global and local neural cognitive modeling for student performance prediction. Expert Systems with Applications, 2024, 237: 121637

[87]	Wang L, Li X, Luo Z, Hu Z, Yan Q . Multivariate cognitive response framework for student performance prediction on MOOC. IEEE Transactions on Knowledge and Data Engineering, 2024, 36( 3): 1221–1233

[88]	DiBello L V, Roussos L A, Stout W . 31A review of cognitively diagnostic assessment and a summary of psychometric models. Handbook of Statistics, 2006, 26: 979–1030

[89]	Liu Y, Zhang T, Wang X, Yu G, Li T . New development of cognitive diagnosis models. Frontiers of Computer Science, 2023, 17( 1): 171604

[90]	Wang F, Gao W, Liu Q, Li J, Zhao G, Zhang Z, Huang Z, Zhu M, Wang S, Tong W, Chen E. A survey of models for cognitive diagnosis: new developments and future directions. 2024, arXiv preprint arXiv: 2407.05458

[91]	Romero C, Ventura S . Educational data mining: a review of the state of the art. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 2010, 40( 6): 601–618

[92]	Huang R, Liu Z, Zi D, Huang Q, Pan S . A multi-level remedial teaching design based on cognitive diagnostic assessment: taking the electromagnetic induction as an example. Frontiers in Psychology, 2022, 13: 851378

[93]	You X, Li M, Xiao Y, Liu H . The feedback of the Chinese learning diagnosis system for personalized learning in classrooms. Frontiers in Psychology, 2019, 10: 1751

[94]	Fan Z, Gou J, Weng S . A feature importance-based multi-layer CatBoost for student performance prediction. IEEE Transactions on Knowledge and Data Engineering, 2024, 36( 11): 5495–5507

[95]	Khairy D, Alharbi N, Amasha M A, Areed M F, Alkhalaf S, Abougalala R A . Prediction of student exam performance using data mining classification algorithms. Education and Information Technologies, 2024, 29( 16): 21621–21645

[96]	Zhang X, Xue R, Liu B, Lu W, Zhang Y. Grade prediction of student academic performance with multiple classification models. In: Proceedings of the 14th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery. 2018, 1086−1090

[97]	Yağcı M . Educational data mining: prediction of students’ academic performance using machine learning algorithms. Smart Learning Environments, 2022, 9( 1): 11

[98]	Akçapınar G, Altun A, Aşkar P . Using learning analytics to develop early-warning system for at-risk students. International Journal of Educational Technology in Higher Education, 2019, 16( 1): 40

[99]	Anantharaman H, Mubarak A, Shobana B T. Modelling an adaptive e-learning system using LSTM and random forest classification. In: Proceedings of the IEEE Conference on e-Learning, e-Management and e-Services. 2018, 29−34

[100]

Bernard J, Popescu E, Graf S . Improving online education through automatic learning style identification using a multi-step architecture with ant colony system and artificial neural networks. Applied Soft Computing, 2022, 131: 109779

[101]

Bernard J, Chang T W, Popescu E, Graf S . Learning style identifier: improving the precision of learning style identification through computational intelligence algorithms. Expert Systems with Applications, 2017, 75: 94–108

[102]

El Aissaoui O, El Madani Y E A, Oughdir L, El Allioui Y . Combining supervised and unsupervised machine learning algorithms to predict the learners’ learning styles. Procedia Computer Science, 2019, 148: 87–96

[103]

El Aissaoui O, El Alami Y E M, Oughdir L, El Allioui Y. Integrating web usage mining for an automatic learner profile detection: a learning styles-based approach. In: Proceedings of the International Conference on Intelligent Systems and Computer Vision. 2018, 1−6

[104]

Gomede E, de Barros R M, de Souza Mendes L . Use of deep multi-target prediction to identify learning styles. Applied Sciences, 2020, 10( 5): 1756

[105]

Hidayat A, Adi K, Surarso B. Determine Felder Silverman learning style model using literature based and K-means clustering. In: Proceedings of the 6th International Conference on Informatics and Computing. 2021, 1−6

[106]

Hmedna B, El Mezouary A, Baz O . A predictive model for the identification of learning styles in MOOC environments. Cluster Computing, 2020, 23( 2): 1303–1328

[107]

Hmedna B, El Mezouary A, Baz O, Mammass D . Identifying and tracking learning styles in MOOCs: a neural networks approach. International Journal of Innovation and Applied Studies, 2017, 19( 2): 267–275

[108]

Kolekar S V, Pai R M, M M M P . Prediction of learner’s profile based on learning styles in adaptive e-learning system. International Journal of Emerging Technologies in Learning, 2017, 12( 6): 31–51

[109]

Rishard M A M, Jayasekara S L, Ekanayake E M P U, Wickramathilake K M J S, Reyal S, Manathunga K, Wickramarathne J. Adaptivo: a personalized adaptive e-learning system based on learning styles and prior knowledge. In: Proceedings of the 7th International Conference on Informatics and Computing. 2022, 1−9

[110]

Domínguez L G I, Robles-Gómez A, Pastor-Vargas R . A data-driven approach to engineering instruction: exploring learning styles, study habits, and machine learning. IEEE Access, 2025, 13: 10978–11002

[111]

Muhammad B A, Qi C, Wu Z, Ahmad H K . An evolving learning style detection approach for online education using bipartite graph embedding. Applied Soft Computing, 2024, 152: 111230

[112]

Jebbari M, Cherradi B, Hamida S, Raihani A . Identifying learning styles in MOOCs environment through machine learning predictive modeling. Education and Information Technologies, 2024, 29( 16): 20977–21014

[113]

Ayyoub H Y, Al-Kadi O S . Learning style identification using semisupervised self-taught labeling. IEEE Transactions on Learning Technologies, 2024, 17: 1093–1106

[114]

Kuttattu A S, Gokul G S, Prasad H, Murali J, Nair L S. Analysing the learning style of an individual and suggesting field of study using machine learning techniques. In: Proceedings of the International Conference on Communication and Electronics Systems. 2019, 1671−1675

[115]

Nguyen H H, Do Trung K, Duc L N, Hoang L D, Ba P T, Nguyen V A . A model to create a personalized online course based on the student’s learning styles. Education and Information Technologies, 2024, 29( 1): 571–593

[116]

Abomelha F, Newbury P . A VARK learning style-based recommendation system for adaptive e-learning. Annals of Computer Science and Information Systems, 2024, 41: 1–8

[117]

Rasheed F, Wahid A . Learning style detection in E-learning systems using machine learning techniques. Expert Systems with Applications, 2021, 174: 114774

[118]

Ni Q, Mi Y, Wu Y, He L, Xu Y, Zhang B . Design and implementation of the reliable learning style recognition mechanism based on fusion labels and ensemble classification. IEEE Transactions on Learning Technologies, 2024, 17: 241–257

[119]

Hidalgo M, Rodriguez K, Astudillo H. Mapping Kolb’s learning style to roles in software development team. In: Proceedings of the L Latin American Computer Conference. 2024, 1−9

[120]

Coffield F, Moseley D, Hall E, Ecclestone K. Learning styles and pedagogy in post-16 learning: a systematic and critical review. London: Learning and Skills Research Centre, 2004

[121]

Pashler H, McDaniel M, Rohrer D, Bjork R . Learning styles: concepts and evidence. Psychological Science in the Public Interest, 2008, 9( 3): 105–119

[122]

Clinton-Lisell V, Litzinger C . Is it really a neuromyth? A meta-analysis of the learning styles matching hypothesis. Frontiers in Psychology, 2024, 15: 1428732

[123]

El-Sabagh H A . Adaptive e-learning environment based on learning styles and its impact on development students’ engagement. International Journal of Educational Technology in Higher Education, 2021, 18( 1): 53

[124]

Oviedo Ramirez D C, Ramirez Salazar D A, Valderrama Muñoz A M, Quiroz Betancur L M, Fletscher L . Exploring the relationship between learning styles and digital educational resources in adaptive learning systems. Education Sciences, 2025, 15( 8): 1075

[125]

Chanaa A, El Faddouli N E . Sentiment analysis on massive open online courses (MOOCs): multi-factor analysis, and machine learning approach. International Journal of Information and Communication Technology Education, 2022, 18( 1): 1–22

[126]

Hew K F, Hu X, Qiao C, Tang Y . What predicts student satisfaction with MOOCs: a gradient boosting trees supervised machine learning and sentiment analysis approach. Computers & Education, 2020, 145: 103724

[127]

Hixson T . Reactions vs. reality: using sentiment analysis to measure university students’ responses to learning ArcGIS. Journal of Map & Geography Libraries, 2019, 15( 2−3): 263–276

[128]

Jena R K . Sentiment mining in a collaborative learning environment: capitalising on big data. Behaviour & Information Technology, 2019, 38( 9): 986–1001

[129]

Liu Z, Liu S, Liu L, Sun J, Peng X, Wang T . Sentiment recognition of online course reviews using multi-swarm optimization-based selected features. Neurocomputing, 2016, 185: 11–20

[130]

Kasumba R, Neumman M. Practical sentiment analysis for education: the power of student crowdsourcing. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence. 2024, 23110−23118

[131]

Dake D K, Gyimah E . Using sentiment analysis to evaluate qualitative students’ responses. Education and Information Technologies, 2023, 28( 4): 4629–4647

[132]

Kastrati Z, Imran A S, Kurti A . Weakly supervised framework for aspect-based sentiment analysis on students’ reviews of MOOCs. IEEE Access, 2020, 8: 106799–106810

[133]

Li X, Zhang H, Ouyang Y, Zhang X, Rong W. A shallow BERT-CNN model for sentiment analysis on MOOCs comments. In: Proceedings of the IEEE International Conference on Engineering, Technology and Education. 2019, 1−6

[134]

Sindhu I, Daudpota S M, Badar K, Bakhtyar M, Baber J, Nurunnabi M . Aspect-based opinion mining on student’s feedback for faculty teaching performance evaluation. IEEE Access, 2019, 7: 108729–108741

[135]

Yu L C, Lee C W, Pan H I, Chou C Y, Chao P Y, Chen Z H, Tseng S F, Chan C L, Lai K R . Improving early prediction of academic failure using sentiment analysis on self-evaluated comments. Journal of Computer Assisted Learning, 2018, 34( 4): 358–365

[136]

Ren P, Yang L, Luo F . Automatic scoring of student feedback for teaching evaluation based on aspect-level sentiment analysis. Education and Information Technologies, 2023, 28( 1): 797–814

[137]

Shaik T, Tao X, Dann C, Quadrelli C, Li Y, O’Neill S. Educational decision support system adopting sentiment analysis on student feedback. In: Proceedings of the IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology. 2022, 377−383

[138]

Liu T, Hu W, Liu F, Li Y. Sentiment analysis for MOOC course reviews. In: Proceedings of Data Science—7th International Conference of Pioneering Computer Scientists, Engineers and Educators. 2021, 78−87

[139]

Mrhar K, Benhiba L, Bourekkache S, Abik M . A Bayesian CNN-LSTM model for sentiment analysis in massive open online courses MOOCs. International Journal of Emerging Technologies in Learning, 2021, 16( 23): 216–232

[140]

Estrada M L B, Cabada R Z, Bustillos R O, Graff M . Opinion mining and emotion recognition applied to learning environments. Expert Systems with Applications, 2020, 150: 113265

[141]

Baqach A, Battou A . A new sentiment analysis model to classify students’ reviews on MOOCs. Education and Information Technologies, 2024, 29( 13): 16813–16840

[142]

Ashwin T S, Biswas G. Identifying and mitigating algorithmic bias in student emotional analysis. In: Proceedings of the 25th International Conference on Artificial Intelligence in Education. 2024, 89−103

[143]

Shaikh S, Daudpota S M, Yayilgan S Y, Sindhu S. Exploring the potential of large-language models (LLMs) for student feedback sentiment analysis. In: Proceedings of the International Conference on Frontiers of Information Technology. 2023, 214−219

[144]

Liu Z, Yang C, Rüdian S, Liu S, Zhao L, Wang T . Temporal emotion-aspect modeling for discovering what students are concerned about in online course forums. Interactive Learning Environments, 2019, 27( 5−6): 598–627

[145]

Onan A . Mining opinions from instructor evaluation reviews: a deep learning approach. Computer Applications in Engineering Education, 2020, 28( 1): 117–138

[146]

Pong-Inwong C, Songpan W . Sentiment analysis in teaching evaluations using sentiment phrase pattern matching (SPPM) based on association mining. International Journal of Machine Learning and Cybernetics, 2019, 10( 8): 2177–2186

[147]

Rani S, Kumar P . A sentiment analysis system to improve teaching and learning. Computer, 2017, 50( 5): 36–43

[148]

Tubishat M, Al-Obeidat F, Shuhaiber A. Sentiment analysis of using ChatGPT in education. In: Proceedings of the International Conference on Smart Applications, Communications and Networking. 2023, 1−7

[149]

Moreno-Marcos P M, Alario-Hoyos C, Munoz-Merino P J, Estevez-Ayres I, Kloos C D . A learning analytics methodology for understanding social interactions in MOOCs. IEEE Transactions on Learning Technologies, 2019, 12( 4): 442–455

[150]

Chen X, Wang F L, Cheng G, Chow M K, Xie H . Understanding learners’ perception of MOOCs based on review data analysis using deep learning and sentiment analysis. Future Internet, 2022, 14( 8): 218

[151]

Li L, Johnson J, Aarhus W, Shah D . Key factors in MOOC pedagogy based on NLP sentiment analysis of learner reviews: what makes a hit. Computers & Education, 2022, 176: 104354

[152]

Yadegaridehkordi E, Noor N F B M, Ayub M N B, Affal H B, Hussin N B . Affective computing in education: a systematic review and future research. Computers & Education, 2019, 142: 103649

[153]

Bao G, Mei Y, Li G, Wang G. Analysis of students behavior characteristics based on K-mediods + Eclat. In: Proceedings of the 24th International Conference on Computer Supported Cooperative Work in Design. 2021, 1239−1244

[154]

Delgado S, Morán F, San José J C, Burgos D . Analysis of students’ behavior through user clustering in online learning settings, based on self organizing maps neural networks. IEEE Access, 2021, 9: 132592–132608

[155]

Li X, Zhang Y, Cheng H, Zhou F, Yin B . An unsupervised ensemble clustering approach for the analysis of student behavioral patterns. IEEE Access, 2021, 9: 7076–7091

[156]

Shen X, Yuan C . A college student behavior analysis and management method based on machine learning technology. Wireless Communications and Mobile Computing, 2021, 2021: 3126347

[157]

Shi D, Zhou J, Wu F, Wang D, Yang D, Pan Q . Characteristics of students’ learning behavior preferences—an analysis of self-commentary data based on the LDA model. Journal of Intelligent & Fuzzy Systems, 2024, 46( 2): 4495–4509

[158]

Wang T, Xiao B, Ma W . Student behavior data analysis based on association rule mining. International Journal of Computational Intelligence Systems, 2022, 15( 1): 32

[159]

Chen J, Fan F, Jia H, Xu Y, Dong H, Huang X, Li J, Zhang Z. Student behavior analysis and performance prediction based on blended learning data. In: Proceedings of the 15th International Conference on Knowledge Science, Engineering and Management. 2022, 597−609

[160]

Li Y, Li D. University students’ behavior characteristics analysis and prediction method based on combined data mining model. In: Proceedings of the 3rd International Conference on Computers in Management and Business. 2020, 9−13

[161]

Shi Y, Sun F, Zuo H, Peng F . Analysis of learning behavior characteristics and prediction of learning effect for improving college students’ information literacy based on machine learning. IEEE Access, 2023, 11: 50447–50461

[162]

Villalobos E, Pérez-Sanagustín M, Broisin J. From learning actions to dynamics: characterizing students’ individual temporal behavior with sequence analysis. In: Proceedings of the 25th International Conference on Artificial Intelligence in Education. 2024, 3−17

[163]

Abhirami K, Devi M K K . Student behavior modeling for an e-learning system offering personalized learning experiences. Computer Systems Science and Engineering, 2022, 40( 3): 1127–1144

[164]

Cantabella M, Martínez-España R, Ayuso B, Yáñez J A, Muñoz A . Analysis of student behavior in learning management systems through a Big Data framework. Future Generation Computer Systems, 2019, 90: 262–272

[165]

Liu S, Hu Z, Peng X, Liu Z, Cheng H N H, Sun J . Mining learning behavioral patterns of students by sequence analysis in cloud classroom. International Journal of Distance Education Technologies, 2017, 15( 1): 15–27

[166]

Kamberovic M, Krivic S, Delic A, Szedmak S, Ljubovic V. Personalized learning systems for computer science students: analyzing and predicting learning behaviors using programming error data. In: Proceedings of the 31st ACM Conference on User Modeling, Adaptation and Personalization. 2023, 86−91

[167]

Yu Z, Xie M, Gao J, Liu T, Fu Y. From raw video to pedagogical insights: a unified framework for student behavior analysis. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence. 2024, 23241−23249

[168]

Appleton J J, Christenson S L, Furlong M J . Student engagement with school: critical conceptual and methodological issues of the construct. Psychology in the Schools, 2008, 45( 5): 369–386

[169]

Baker R S, Martin T, Rossi L M. Educational data mining and learning analytics. In: Rupp A A, Leighton J P, eds. The Wiley Handbook of Cognition and Assessment: Frameworks, Methodologies, and Applications. Chichester: John Wiley & Sons, Inc., 2017, 379−396

[170]

Feng G, Fan M, Chen Y . Analysis and prediction of students’ academic performance based on educational data mining. IEEE Access, 2022, 10: 19558–19571

[171]

Verma U, Garg C, Bhushan M, Samant P, Kumar A, Negi A. Prediction of students’ academic performance using machine learning techniques. In: Proceedings of the International Mobile and Embedded Technology Conference. 2022, 151−156

[172]

Ni Q, Zhu Y, Zhang L, Lu X, Zhang L . Leverage learning behaviour data for students’ learning performance prediction and influence factor analysis. IEEE Transactions on Artificial Intelligence, 2024, 5( 5): 2422–2433

[173]

Xu X, Wang J, Peng H, Wu R . Prediction of academic performance associated with internet usage behaviors using machine learning algorithms. Computers in Human Behavior, 2019, 98: 166–173

[174]

Priyambada S A, Usagawa T, Mahendrawathi E R . Two-layer ensemble prediction of students’ performance using learning behavior and domain knowledge. Computers and Education: Artificial Intelligence, 2023, 5: 100149

[175]

Asselman A, Khaldi M, Aammou S . Enhancing the prediction of student performance based on the machine learning XGBoost algorithm. Interactive Learning Environments, 2023, 31( 6): 3360–3379

[176]

Joshi A, Saggar P, Jain R, Sharma M, Gupta D, Khanna A . CatBoost—An ensemble machine learning model for prediction and classification of student academic performance. Advances in Data Science and Adaptive Analysis, 2021, 13( 3−4): 2141002

[177]

Kukkar A, Mohana R, Sharma A, Nayyar A . Prediction of student academic performance based on their emotional wellbeing and interaction on various e-learning platforms. Education and Information Technologies, 2023, 28( 8): 9655–9684

[178]

Xu J, Han Y, Marcu D, van der Schaar M. Progressive prediction of student performance in college programs. In: Proceedings of the 31st AAAI Conference on Artificial Intelligence. 2017, 1604−1610

[179]

Fan Z, Gou J, Weng S . Complementary CatBoost based on residual error for student performance prediction. Pattern Recognition, 2025, 161: 111265

[180]

Baig M A, Shaikh S A, Khatri K K, Shaikh M A, Khan M Z, Rauf M A . Prediction of students performance level using integrated approach of ML algorithms. International Journal of Emerging Technologies in Learning, 2023, 18( 1): 216–234

[181]

Giannakas F, Troussas C, Voyiatzis I, Sgouropoulou C . A deep learning classification framework for early prediction of team-based academic performance. Applied Soft Computing, 2021, 106: 107355

[182]

Kusumawardani S S, Alfarozi S A I . Transformer encoder model for sequential prediction of student performance based on their log activities. IEEE Access, 2023, 11: 18960–18971

[183]

Yang Y, Shi J, Li M, Fujita H . Framelet-based dual hypergraph neural networks for student performance prediction. International Journal of Machine Learning and Cybernetics, 2024, 15( 9): 3863–3877

[184]

Oh C, Park M, Lim S, Song K . Language model-guided student performance prediction with multimodal auxiliary information. Expert Systems with Applications, 2024, 250: 123960

[185]

Ni L, Wang S, Zhang Z, Li X, Zheng X, Denny P, Liu J. Enhancing student performance prediction on learnersourced questions with SGNN-LLM synergy. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence. 2024, 23232−23240

[186]

Hlosta M, Herodotou C, Papathoma T, Gillespie A, Bergamin P . Predictive learning analytics in online education: a deeper understanding through explaining algorithmic errors. Computers and Education: Artificial Intelligence, 2022, 3: 100108

[187]

Cobos R . Self-regulated learning and active feedback of MOOC learners supported by the intervention strategy of a learning analytics system. Electronics, 2023, 12( 15): 3368

[188]

Huang H, Jew L, Qi D . Take a MOOC and then drop: a systematic review of MOOC engagement pattern and dropout factor. Heliyon, 2023, 9( 4): e15220

[189]

Urdaneta-Ponte M C, Mendez-Zorrilla A, Oleagordia-Ruiz I . Recommendation systems for education: systematic review. Electronics, 2021, 10( 14): 1611

[190]

da Silva F L, Slodkowski B K, da Silva K K A, Cazella S C . A systematic literature review on educational recommender systems for teaching and learning: research trends, limitations and opportunities. Education and Information Technologies, 2023, 28( 3): 3289–3328

[191]

Bin Q, Zuhairi M F, Morcos J . A comprehensive study on personalized learning recommendation in e-learning system. IEEE Access, 2024, 12: 100446–100482

[192]

G M D, Goudar R H, Kulkarni A A, Rathod V N, Hukkeri G S . A digital recommendation system for personalized learning to enhance online education: a review. IEEE Access, 2024, 12: 34019–34041

[193]

Krauss C, Salzmann A, Merceron A. Branched learning paths for the recommendation of personalized sequences of course items. In: Proceedings of the DeLFI Workshops. 2018

[194]

Zhou Y, Huang C, Hu Q, Zhu J, Tang Y . Personalized learning full-path recommendation model based on LSTM neural networks. Information Sciences, 2018, 444: 135–152

[195]

Supic H. Case-based reasoning model for personalized learning path recommendation in example-based learning activities. In: Proceedings of the 27th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises. 2018, 175−178

[196]

Li W, Zhang L. Personalized learning path generation based on network embedding and learning effects. In: Proceedings of the 10th International Conference on Software Engineering and Service Science. 2019, 316−319

[197]

Mansur A B F, Yusof N, Basori A H . Personalized learning model based on deep learning algorithm for student behaviour analytic. Procedia Computer Science, 2019, 163: 125–133

[198]

Nabizadeh A H, Jorge A M, Leal J P . Estimating time and score uncertainty in generating successful learning paths under time constraints. Expert Systems, 2019, 36( 2): e12351

[199]

Vanitha V, Krishnan P, Elakkiya R . Collaborative optimization algorithm for learning path construction in E-learning. Computers & Electrical Engineering, 2019, 77: 325–338

[200]

Xia M, Sun M, Wei H, Chen Q, Wang Y, Shi L, Qu H, Ma X. PeerLens: peer-inspired interactive learning path planning in online question pool. In: Proceedings of 2019 CHI Conference on Human Factors in Computing Systems. 2019, 634

[201]

Liu H, Li X . Learning path combination recommendation based on the learning networks. Soft Computing, 2020, 24( 6): 4427–4439

[202]

Liu Z, Dong L, Wu C . Research on personalized recommendations for students’ learning paths based on big data. International Journal of Emerging Technologies in Learning, 2020, 15( 8): 40–56

[203]

Shi D, Wang T, Xing H, Xu H . A learning path recommendation model based on a multidimensional knowledge graph framework for e-learning. Knowledge-Based Systems, 2020, 195: 105618

[204]

Sun Y, Liang J, Niu P. Personalized recommendation of English learning based on knowledge graph and graph convolutional network. In: Proceedings of the 7th International Conference on Artificial Intelligence and Security. 2021, 157−166

[205]

Li X, Xu H, Zhang J, Chang H H . Optimal hierarchical learning path design with reinforcement learning. Applied Psychological Measurement, 2021, 45( 1): 54–70

[206]

Chen Z, Wu Z, Tang Y, Zhou J. TGKT-based personalized learning path recommendation with reinforcement learning. In: Proceedings of the 16th International Conference on Knowledge Science, Engineering and Management. 2023, 332−346

[207]

Krahn T, Kuo R, Chang M. Personalized study guide: a Moodle plug-in generating personal learning path for students. In: Proceedings of the 19th International Conference on Augmented Intelligence and Intelligent Tutoring Systems. 2023, 333−341

[208]

Li Q, Xia W, Yin L, Shen J, Rui R, Zhang W, Chen X, Tang R, Yu Y. Graph enhanced hierarchical reinforcement learning for goal-oriented learning path recommendation. In: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management. 2023, 1318−1327

[209]

Zhang X, Liu S, Wang H . Personalized learning path recommendation for e-learning based on knowledge graph and graph convolutional network. International Journal of Software Engineering and Knowledge Engineering, 2023, 33( 1): 109–131

[210]

Li T, Wang X, Zhang S, Yang F, Lu W. A personalized learning path recommendation method for learning objects with diverse coverage levels. In: Proceedings of the 24th International Conference on Artificial Intelligence in Education. 2023, 714−719

[211]

Frej J, Shah N, Knezevic M, Nazaretsky T, Käser T. Finding paths for explainable MOOC recommendation: a learner perspective. In: Proceedings of the 14th Learning Analytics and Knowledge Conference. 2024, 426−437

[212]

Abu-Rasheed H, Weber C, Fathi M. Knowledge graphs as context sources for LLM-based explanations of learning recommendations. In: Proceedings of the IEEE Global Engineering Education Conference. 2024, 1−5

[213]

Yekollu R K, Bhimraj Ghuge T, Sunil Biradar S, Haldikar S V, Farook Mohideen Abdul Kader O. AI-driven personalized learning paths: enhancing education through adaptive systems. In: Asokan R, Ruiz D P, Piramuthu S, eds. Smart Data Intelligence. Singapore: Springer, 2024, 507−517

[214]

Xiao Z, Hu W, Huang X. Highlighting learning pathways for enhanced course recommendation using large language models. In: Proceedings of the 10th International Conference on Systems and Informatics. 2024, 1−8

[215]

Raj N S, Renumol V G . An improved adaptive learning path recommendation model driven by real-time learning analytics. Journal of Computers in Education, 2024, 11( 1): 121–148

[216]

Ng C, Fung Y. Educational personalized learning path planning with large language models. 2024, arXiv preprint arXiv: 2407.11773

[217]

Ouyang Z . Self-regulated learning and engagement as serial mediators between AI-driven adaptive learning platform characteristics and educational quality: a psychological mechanism analysis. Frontiers in Psychology, 2025, 16: 1646469

[218]

Hariyanto , Kristianingsih F X D, Maharani R . Artificial intelligence in adaptive education: a systematic review of techniques for personalized learning. Discover Education, 2025, 4( 1): 458

[219]

du Plooy E, Casteleijn D, Franzsen D . Personalized adaptive learning in higher education: a scoping review of key characteristics and impact on academic performance and engagement. Heliyon, 2024, 10( 21): e39630

[220]

Hou Y, Zhou P, Xu J, Wu D O. Course recommendation of MOOC with big data support: a contextual online learning approach. In: Proceedings of the IEEE Conference on Computer Communications Workshops. 2018, 106−111

[221]

Zhang H, Huang T, Lv Z, Liu S, Zhou Z . MCRS: a course recommendation system for MOOCs. Multimedia Tools and Applications, 2018, 77( 6): 7051–7069

[222]

Symeonidis P, Malakoudis D . Multi-modal matrix factorization with side information for recommending massive open online courses. Expert Systems with Applications, 2019, 118: 261–271

[223]

Yang X, Jiang W. Dynamic online course recommendation based on course network and user network. In: Proceedings of the 7th International Conference on Smart City and Informatization. 2019, 180−196

[224]

Zhang J, Hao B, Chen B, Li C, Chen H, Sun J. Hierarchical reinforcement learning for course recommendation in MOOCs. In: Proceedings of the 33rd AAAI Conference on Artificial Intelligence. 2019, 435−442

[225]

Zhu Y, Lu H, Qiu P, Shi K, Chambua J, Niu Z . Heterogeneous teaching evaluation network based offline course recommendation with graph learning and tensor factorization. Neurocomputing, 2020, 415: 84–95

[226]

Tan J, Chang L, Liu T, Zhao X. Attentional autoencoder for course recommendation in MOOC with course relevance. In: Proceedings of the International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery. 2020, 190−196

[227]

Ma D, Wang Y, Chen M, Shen J. SRACR: semantic and relationship-aware online course recommendation. In: Proceedings of the IEEE International Conference on Engineering, Technology & Education. 2021, 367−374

[228]

Tian X, Liu F . Capacity tracing-enhanced course recommendation in MOOCs. IEEE Transactions on Learning Technologies, 2021, 14( 3): 313–321

[229]

Xu G, Jia G, Shi L, Zhang Z . Personalized course recommendation system fusing with knowledge graph and collaborative filtering. Computational Intelligence and Neuroscience, 2021, 2021: 9590502

[230]

Zhao Z, Yang Y, Li C, Nie L . GuessUNeed: recommending courses via neural attention network and course prerequisite relation embeddings. ACM Transactions on Multimedia Computing, Communications, and Applications, 2020, 16( 4): 132

[231]

Ban Q, Wu W, Hu W, Lin H, Zheng W, He L. Knowledge-enhanced multi-task learning for course recommendation. In: Proceedings of the 27th International Conference on Database Systems for Advanced Applications. 2022, 85−101

[232]

Chen W, Ma W, Jiang Y, Fan X. GADN: GCN-based attentive decay network for course recommendation. In: Proceedings of the 15th International Conference on Knowledge Science, Engineering and Management. 2022, 529−541

[233]

Jung H, Jang Y, Kim S, Kim H. KPCR: knowledge graph enhanced personalized course recommendation. In: Proceedings of the 34th Australasian Joint Conference on Artificial Intelligence. 2022, 739−750

[234]

Lin Y, Lin F, Yang L, Zeng W, Liu Y, Wu P . Context-aware reinforcement learning for course recommendation. Applied Soft Computing, 2022, 125: 109189

[235]

Hao P, Li Y, Bai C . Meta-relationship for course recommendation in MOOCs. Multimedia Systems, 2023, 29( 1): 235–246

[236]

Lin Y, Lin F, Zeng W, Xiahou J, Li L, Wu P, Liu Y, Miao C . Hierarchical reinforcement learning with dynamic recurrent mechanism for course recommendation. Knowledge-Based Systems, 2022, 244: 108546

[237]

Zhang H, Shen X, Yi B, Wang W, Feng Y . KGAN: knowledge grouping aggregation network for course recommendation in MOOCs. Expert Systems with Applications, 2023, 211: 118344

[238]

Zhang Y, Yu M, Sun J, Zhang T, Yu G. MG-CR: factor memory network and graph neural network based personalized course recommendation. In: Proceedings of the 28th International Conference on Database Systems for Advanced Applications. 2023, 547−562

[239]

Amin S, Uddin M I, Alarood A A, Mashwani W K, Alzahrani A O, Alzahrani H A . An adaptable and personalized framework for top-N course recommendations in online learning. Scientific Reports, 2024, 14( 1): 10382

[240]

Li S, Zhao Y, Guo L, Ren M, Li J, Zhang L, Li K . Quantification and prediction of engagement: applied to personalized course recommendation to reduce dropout in MOOCs. Information Processing & Management, 2024, 61( 1): 103536

[241]

Gharahighehi A, Ghinis A, Venturini M, Cornillie F, Vens C. Enhancing collaborative filtering-based course recommendations by exploiting time-to-event information with survival analysis. 2025, arXiv preprint arXiv: 2503.00072

[242]

Wang X J, Lee C P, Mutlu B. LearnMate: enhancing online education with LLM-powered personalized learning plans and support. In: Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems. 2025, 373

[243]

Van Deventer H, Mills M, Evrard A. From interests to insights: an LLM approach to course recommendations using natural language queries. 2024, arXiv preprint arXiv: 2412.19312

[244]

Li J, Lyu Q, Qiu W, Khong A W H. Improving course recommendation systems with explainable AI: LLM-based frameworks and evaluations. In: Proceedings of the 18th International Conference on Educational Data Mining. 2025

[245]

Jin X, Kamarthi S. Personalized learning paths: LLM-based course recommendations in manufacturing education. In: Proceedings of the ASEE Annual Conference & Exposition. 2025

[246]

Ma B, Khan M A Z, Yang T, Polyzou A, Konomi S. How good are large language models for course recommendation in MOOCs? 2025, arXiv preprint arXiv: 2504.08208

[247]

Beutling N, Spahic-Bogdanovic M. Personalised course recommender: linking learning objectives and career goals through competencies. In: Proceedings of 2024 AAAI Spring Symposium Series. 2024, 72−81

[248]

Cha S, Loeser M, Seo K . The impact of AI-based course-recommender system on students’ course-selection decision-making process. Applied Sciences, 2024, 14( 9): 3672

[249]

Zhang G, Gao X, Ye H, Zhu J, Lin W, Wu Z, Zhou H, Ye Z, Ge Y, Baghban A . Optimizing learning paths: course recommendations based on graph convolutional networks and learning styles. Applied Soft Computing, 2025, 175: 113083

[250]

Diao X, Zeng Q, Duan H, Lu F, Zhou C . Personalized exercise recommendation driven by learning objective within e-learning systems. International Journal of Performability Engineering, 2018, 14( 10): 2532–2544

[251]

Lv P, Wang X, Xu J, Wang J. Utilizing knowledge graph and student testing behavior data for personalized exercise recommendation. In: Proceedings of the ACM Turing Celebration Conference-China. 2018, 53−59

[252]

Xia J, Li G, Cao Z . Personalized exercise recommendation algorithm combining learning objective and assignment feedback. Journal of Intelligent & Fuzzy Systems, 2018, 35( 3): 2965–2973

[253]

Zhu L, Liu Y, Hei X, Wang Y, Meng H, Jiao J, Pan L. A study on exercise recommendation method using Knowledge Graph for computer network course. In: Proceedings of the International Conference on Networking and Network Applications. 2020, 436−442

[254]

Wu Z, Li M, Tang Y, Liang Q . Exercise recommendation based on knowledge concept prediction. Knowledge-Based Systems, 2020, 210: 106481

[255]

Zhou L, Wang C . Research on recommendation of personalized exercises in English learning based on data mining. Scientific Programming, 2021, 2021: 5042286

[256]

Li Z, Hu H, Xia Z, Zhang J, Li X, Shi J, Li H, Li X. Exercise recommendation algorithm based on improved collaborative filtering. In: Proceedings of the International Conference on Advanced Learning Technologies. 2021, 47−49

[257]

Huang Z, Liu Q, Zhai C, Yin Y, Chen E, Gao W, Hu G. Exploring multi-objective exercise recommendations in online education systems. In: Proceedings of the 28th ACM International Conference on Information and Knowledge Management. 2019, 1261−1270

[258]

Chuang A C, Huang N F, Tzeng J W, Lee C A, Huang Y X, Huang H H. MOOCERS: exercise recommender system in MOOCs based on reinforcement learning algorithm. In: Proceedings of the 8th International Conference on Soft Computing & Machine Intelligence. 2021, 186−190

[259]

He Y, Wang H, Pan Y, Zhou Y, Sun G . Exercise recommendation method based on knowledge tracing and concept prerequisite relations. CCF Transactions on Pervasive Computing and Interaction, 2022, 4( 4): 452–464

[260]

Li Sr Z, Xu F. A personalized exercise recommendation method for teaching objectives. In: Proceedings of the International Conference on Computer Application and Information Security. 2022, 122601Q

[261]

Guan Q, Xiao F, Cheng X, Fang L, Chen Z, Chen G, Luo W. KG4Ex: an explainable knowledge graph-based approach for exercise recommendation. In: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management. 2023, 597−607

[262]

Li L, Wang Z . Knowledge graph-enhanced intelligent tutoring system based on exercise representativeness and informativeness. International Journal of Intelligent Systems, 2023, 2023: 2578286

[263]

Ren Y, Liang K, Shang Y, Zhang Y . MulOER-SAN: 2-layer multi-objective framework for exercise recommendation with self-attention networks. Knowledge-Based Systems, 2023, 260: 110117

[264]

Wu S, Wang J, Zhang W . Contrastive personalized exercise recommendation with reinforcement learning. IEEE Transactions on Learning Technologies, 2024, 17: 691–703

[265]

Lai X, Chen J, Lai Y, Huang S, Cai Y, Sun Z, Wang X, Pan K, Gao Q, Huang C . Using large language models to enhance exercise recommendations and physical activity in clinical and healthy populations: scoping review. JMIR Medical Informatics, 2025, 13: e59309

[266]

Li T, Bian C, Chen K, Lu W. Large language models for zero-shot exercise recommendation in adaptive learning. In: Proceedings of the 26th International Conference on Artificial Intelligence in Education. 2025, 140−147

[267]

Ta N B D, Nguyen H G P, Gottipati S. ExGen: ready-to-use exercise generation in introductory programming courses. In: Proceedings of the 31st International Conference on Computers in Education. 2023, 104−113

[268]

Meißner N, Speth S, Becker S. Automated programming exercise generation in the era of large language models. In: Proceedings of the 36th International Conference on Software Engineering Education and Training. 2024, 1−5

[269]

Zeng G, Xue Q, Lu X. Instruction tuning with LLMs for programming exercise generation. In: Proceedings of the 21st International Conference on Web Information Systems and Applications. 2024, 377−385

[270]

Luo H, Weng J, Lu X, Yu Z. BPE: exploring the prompt framework of physics exercises generated from bloom’s taxonomy in LLM. In: Proceedings of the 5th International Conference on Machine Learning and Computer Application. 2024, 26−29

[271]

Yan Z, Du H, Lin Z, Jianhua Z . Personalization exercise recommendation framework based on knowledge concept graph. Computer Science and Information Systems, 2023, 20( 2): 857–878

[272]

Li R, Wu S, Wang J, Zhang W. CIKT: a collaborative and iterative knowledge tracing framework with large language models. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2025, 19321−19334

[273]

Cui P, Sachan M. Adaptive and personalized exercise generation for online language learning. In: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2023, 10184−10198

[274]

Rytilahti J, Weerakoon O, Kaila E. Exploring AI-driven programming exercise generation. In: Proceedings of the 20th International CDIO Conference. 2024, 708−716

[275]

Bellarhmouch Y, Jeghal A, Tairi H, Benjelloun N . A proposed architectural learner model for a personalized learning environment. Education and Information Technologies, 2023, 28( 4): 4243–4263

[276]

Yang C C Y, Ogata H . Personalized learning analytics intervention approach for enhancing student learning achievement and behavioral engagement in blended learning. Education and Information Technologies, 2023, 28( 3): 2509–2528

[277]

Lim L, Bannert M, van der Graaf J, Singh S, Fan Y, Surendrannair S, Rakovic M, Molenaar I, Moore J, Gašević D . Effects of real-time analytics-based personalized scaffolds on students’ self-regulated learning. Computers in Human Behavior, 2023, 139: 107547

[278]

Raj N S, Renumol V G. A rule-based approach for adaptive content recommendation in a personalized learning environment: an experimental analysis. In: Proceedings of the 10th International Conference on Technology for Education. 2019, 138−141

[279]

Maraza-Quispe B, Alejandro-Oviedo O, Cisneros-Chavez B, Cuentas-Toledo M, Cuadros-Paz L, Fernandez-Gambarini W, Quispe-Flores L, Caytuiro-Silva N. Model to personalize the teaching-learning process in virtual environments using case-based reasoning. In: Proceedings of the 11th International Conference on Education Technology and Computers. 2019, 105−110

[280]

Karaoglan Yilmaz F G, Yilmaz R . Student opinions about personalized recommendation and feedback based on learning analytics. Technology, Knowledge and Learning, 2020, 25( 4): 753–768

[281]

Zayet T M A, Ismail M A, Almadi S H S, Zawia J M H, Mohamad Nor A . What is needed to build a personalized recommender system for K-12 students’ E-learning? recommendations for future systems and a conceptual framework. Education and Information Technologies, 2023, 28( 6): 7487–7508

[282]

Upbin B. Knewton is building the world’s smartest tutor. Forbes Magazine, 2012

[283]

Vesselinov R, Grego J. Duolingo effectiveness study. New York: City University of New York, 2012

[284]

Hajira B, Hamid S, ur Rehman N, Ahmed S N, ur Rehman S . Enhancing speaking and listening skils through mobile-assisted language and learning in a pre-university context. Journal of Political Stability Archive, 2025, 3( 3): 1147–1173

[285]

Smith B, Jiang X, Peters R . The effectiveness of Duolingo in developing receptive and productive language knowledge and proficiency. Language Learning & Technology, 2024, 28( 1): 1–26

[286]

Knox J, Bayne S, Ross J, Macleod H, Sinclair C . MOOC pedagogy: the challenges of developing for Coursera. Association for Learning Technology (ALT) Online Newsletter, 2012, ( 28): 1–12

[287]

Rajeshwari M, Krishna Prasad K . IBM Watson industry cognitive education methods. International Journal of Case Studies in Business, IT and Education, 2020, 4( 1): 38–50

[288]

Al-Ajlan A, Zedan H. Why Moodle. In: Proceedings of the 12th IEEE International Workshop on Future Trends of Distributed Computing Systems. 2008, 58−64

[289]

Mohammed I A, Ekpo C G, Olatunde-Aiyedun T G, Zakari A Y, Ogar S I . The effect of Moodle LMS on distance learning undergraduates’ performance in environmental education. International Journal of Education and Teaching Zone, 2025, 4( 1): 1–20

[290]

Kuhn J . Minecraft: education edition. CALICO Journal, 2018, 35( 2): 214–223

[291]

Slattery E J, Butler D, O’Leary M, Marshall K . Primary school students’ experiences using Minecraft education during a national project-based initiative: an Irish study. TechTrends, 2023, 1–13

[292]

Dan Y, Lei Z, Gu Y, Li Y, Yin J, Lin J, Ye L, Tie Z, Zhou Y, Wang Y, Zhou A, Zhou Z, Chen Q, Zhou J, He L, Qiu X. EduChat: a large-scale language model-based chatbot system for intelligent education. 2023, arXiv preprint arXiv: 2308.02773

[293]

Liu J, Huang Z, Xiao T, Sha J, Wu J, Liu Q, Wang S, Chen E. SocraticLM: exploring Socratic personalized teaching with large language models. In: Proceedings of the 38th International Conference on Neural Information Processing Systems. 2024, 2721

[294]

Scarlatos A, Baker R S, Lan A. Exploring knowledge tracing in tutor-student dialogues using LLMs. In: Proceedings of the 15th International Learning Analytics and Knowledge Conference. 2025, 249−259

[295]

Dong Z, Chen J, Wu F. Knowledge is power: harnessing large language models for enhanced cognitive diagnosis. In: Proceedings of the 39th AAAI Conference on Artificial Intelligence. 2025, 164−172

[296]

Sonkar S, Chen X, Liu N, Baraniuk R G, Sachan M. LLM-based cognitive models of students with misconceptions. 2024, arXiv preprint arXiv: 2410.12294

[297]

Wang Z, Zhou J, Chen Q, Zhang M, Jiang B, Zhou A, Bai Q, He L. LLM-KT: aligning large language models with knowledge tracing using a plug-and-play instruction. 2025, arXiv preprint arXiv: 2502.02945

[298]

Yang L, Sun X, Li H, Xu R, Wei X . Difficulty aware programming knowledge tracing via large language models. Scientific Reports, 2025, 15( 1): 11475

[299]

Cen J, Lin J, Zhou D, Zhong W, Chen J, Yang A, Zhou Y. LLM-driven effective knowledge tracing by integrating dual-channel difficulty. In: Proceedings of the IEEE International Conference on Big Data (BigData). 2025, 647−652

[300]

Lee U, Bae J, Kim D, Lee S, Park J, Ahn T, Lee G, Stratton D, Kim H. Language model can do knowledge tracing: simple but effective method to integrate language model and knowledge tracing task. 2024, arXiv preprint arXiv: 2406.02893

[301]

Zhang Z, Yu J, Li J, Hou L. Exploring the cognitive knowledge structure of large language models: an educational diagnostic assessment approach. In: Proceedings of the Findings of the Association for Computational Linguistics. 2023, 1643−1650

[302]

Nguyen M H, Tschiatschek S, Singla A. Large language models for in-context student modeling: synthesizing student’s behavior in visual programming. In: Proceedings of the 17th International Conference on Educational Data Mining. 2024

[303]

Sharma S, Mittal P, Kumar M, Bhardwaj V . The role of large language models in personalized learning: a systematic review of educational impact. Discover Sustainability, 2025, 6( 1): 243

[304]

Milano S, McGrane J A, Leonelli S . Large language models challenge the future of higher education. Nature Machine Intelligence, 2023, 5( 4): 333–334

RIGHTS & PERMISSIONS

The Author(s) 2026. This article is published with open access at link.springer.com and journal.hep.com.cn