Nov 2025, Volume 19 Issue 11
    

  • Select all
    Artificial Intelligence
  • RESEARCH ARTICLE
    Mengyi YAN, Yaoshu WANG, Xiaohan JIANG, Haoyi ZHOU, Jianxin LI

    Entity Resolution (ER) is vital for data integration and knowledge graph construction. Despite the advancements made by deep learning (DL) methods using pre-trained language models (PLMs), these approaches often struggle with unstructured, long-text entities (ULE) in real-world scenarios, where critical information is scattered across the text, and existing DL methods require extensive human labeling and computational resources. To tackle these issues, we propose a Few-shot Uncertainty-calibrated Structural data Enrichment method for ER (FUSER). FUSER applies unsupervised pairwise enrichment to extract structural attributes from unstructured entities via Large Language Models (LLMs), and integrates an uncertainty-based calibration module to reduce hallucination issues with minimal additional inference cost. It also implements a lightweight ER pipeline that efficiently performs both blocking and matching tasks with as few as 50 labeled positive samples. FUSER was evaluated on six ER benchmark datasets featuring ULE entities, outperforming state-of-the-art methods and significantly boosting the performance of existing ER approaches through its data enrichment component, with a 10× speedup in uncertainty quantification for LLMs compared to baseline methods, demonstrating its efficiency and effectiveness in real-world applications.

  • REVIEW ARTICLE
    Jiaqi HAN, Jiacheng CEN, Liming WU, Zongzhao LI, Xiangzhe KONG, Rui JIAO, Ziyang YU, Tingyang XU, Fandi WU, Zihe WANG, Hongteng XU, Zhewei WEI, Deli ZHAO, Yang LIU, Yu RONG, Wenbing HUANG

    Geometric graphs are a special kind of graph with geometric features, which are vital to model many scientific problems. Unlike generic graphs, geometric graphs often exhibit physical symmetries of translations, rotations, and reflections, making them ineffectively processed by current Graph Neural Networks (GNNs). To address this issue, researchers proposed a variety of geometric GNNs equipped with invariant/equivariant properties to better characterize the geometry and topology of geometric graphs. Given the current progress in this field, it is imperative to conduct a comprehensive survey of data structures, models, and applications related to geometric GNNs. In this paper, based on the necessary but concise mathematical preliminaries, we formalize geometric graph as the data structure, on top of which we provide a unified view of existing models from the geometric message passing perspective. Additionally, we summarize the applications as well as the related datasets to facilitate later research for methodology development and experimental evaluation. We also discuss the challenges and future potential directions of geometric GNNs at the end of this survey.

  • LETTER
    Di ZHOU, Ying GAO, Hui LI, Xiaoya LIU, Qinghua LIN
  • LETTER
    Hehao BAO, Keying DU, Xinyi SU, Jianqi FAN, Jing HUANG, Wenlong DONG, Lu LU, Kun LI
  • RESEARCH ARTICLE
    Tiankai HANG, Shuyang GU, Dong CHEN, Xin GENG, Baining GUO

    This paper presents a novel generative model, Collaborative Competitive Agents (CCA), which leverages the capabilities of multiple Large Language Models (LLMs) based agents to execute complex tasks. Drawing inspiration from Generative Adversarial Networks (GANs), the CCA system employs two equal-status generator agents and a discriminator agent. The generators independently process user instructions and generate results, while the discriminator evaluates the outputs, and provides feedback for the generator agents to further reflect and improve the generation results. Unlike the previous generative model, our system can obtain the intermediate steps of generation. This allows each generator agent to learn from other successful executions due to its transparency, enabling a collaborative competition that enhances the quality and robustness of the system’s results. The primary focus of this study is image editing, demonstrating the CCA’s ability to handle intricate instructions robustly. The paper’s main contributions include the introduction of a multi-agent-based generative model with controllable intermediate steps and iterative optimization, a detailed examination of agent relationships, and comprehensive experiments on image editing.

  • LETTER
    Yi XU, Luoyi FU, Xinbing WANG
  • RESEARCH ARTICLE
    Dongzi WANG, Lilan HUANG, Muning WEN, Yuanxi PENG, Minglong LI, Teng LI

    Symmetry is prevalent in multi-agent systems. The presence of symmetry, coupled with the misuse of absolute coordinate systems, often leads to a large amount of redundant representation space, significantly increasing the search space for learning policies and reducing learning efficiency. Effectively utilizing symmetry and extracting symmetry-invariant representations can significantly enhance multi-agent systems’ learning efficiency and overall performance by compressing the model’s hypothesis space and improving sample efficiency. The issue of rotational symmetry in multi-agent reinforcement learning has received little attention in previous research and is the primary focus of this paper. To address this issue, we propose a rotation-invariant network architecture for continuous action space tasks. This architecture utilizes relative coordinates between agents, eliminating dependence on absolute coordinate systems, and employs a hypernetwork to enhance the model’s fitting capability, enabling it to model MDPs with more complex dynamics. It can be used for both predicting actions and evaluating action values/utilities. In benchmark tasks, experimental results validate the impact of rotational symmetry on multi-agent decision systems and demonstrate the effectiveness of our method. The code of RDHNet has been available at the website of github.com/wang88256187/RDHNet.

  • RESEARCH ARTICLE
    Pengyang SHAO, Kun ZHANG, Chen GAO, Lei CHEN, Miaomiao CAI, Le WU, Yong LI, Meng WANG

    Educational Cognitive Diagnosis (CD) aims to provide students’ mastery levels on different concepts. One common observation is that students often conduct many exercises but engage with a small subset of concepts, leading to a sparsity barrier. Current CD models mostly adopt mastery levels on all concepts as student modeling, overlooking the sparsity barrier. If a student does not interact with all concepts, we can not ensure that each dimension of mastery levels on concepts can be well-trained. In this paper, we propose a novel Enhancing Student Representations in Cognitive Diagnosis (ESR-CD), which combines application abilities and comprehension degrees for mastery levels on concepts. To model application ability, we propose a sparsity-based mask module that solely depends on the dense student-concept entries. Simultaneously, to further enhance comprehension degrees, we propose two layers: a matrix factorization layer and a relation refinement layer. Extensive experiments on two real-world datasets demonstrate the effectiveness of ESR-CD.

  • REVIEW ARTICLE
    Yuemei XU, Ling HU, Jiayi ZHAO, Zihan QIU, Kexin XU, Yuqi YE, Hanwen GU

    Based on the foundation of Large Language Models (LLMs), Multilingual LLMs (MLLMs) have been developed to address the challenges faced in multilingual natural language processing, hoping to achieve knowledge transfer from high-resource languages to low-resource languages. However, significant limitations and challenges still exist, such as language imbalance, multilingual alignment, and inherent bias. In this paper, we aim to provide a comprehensive analysis of MLLMs, delving deeply into discussions surrounding these critical issues. First of all, we start by presenting an overview of MLLMs, covering their evolutions, key techniques, and multilingual capacities. Secondly, we explore the multilingual training corpora of MLLMs and the multilingual datasets oriented for downstream tasks that are crucial to enhance the cross-lingual capability of MLLMs. Thirdly, we survey the state-of-the-art studies of multilingual representations and investigate whether the current MLLMs can learn a universal language representation. Fourthly, we discuss bias on MLLMs, including its categories, evaluation metrics, and debiasing techniques. Finally, we discuss existing challenges and point out promising research directions of MLLMs.

  • Networks and Communication
  • LETTER
    Shengyuan QI, Lin YANG, Linru MA, Yuyang ZHOU, Shanqing JIANG, Lianxiao MENG, Guang CHENG
  • RESEARCH ARTICLE
    Dongming LUAN, En WANG, Wenbin LIU, Yongjian YANG, Jing DENG

    Mobile CrowdSensing (MCS) has become a powerful sensing paradigm for information collection recently. As sensing becomes more complicated, it is beneficial to deploy edge servers between users and the cloud center with a so-called mobile edge computing. Instead of directly offloading the sensing data to the cloud center, mobile users offload the sensing data to the edge servers. Then, the edge server processes and transmits the data to the cloud center in a distributed and parallel manner. It’s however critically important to balance cost, such as energy consumption, and the stability of the queues on both mobile users and edge servers. Therefore, to minimize the data offloading cost while maintaining system stability, we should carefully design the sensing data offloading strategy for edge-based crowdsensing. To this end, we formulate a double-queue Lyapunov optimization problem and propose a sensing data offloading strategy. We analyze the upper bounds of the total offloading cost and queue backlog. We further formulate the heterogeneous sensing data problem as the minimum weight bipartite graph matching problem and develop an approach that is based on Kuhn-Munkres algorithm. Finally, we conduct simulations based on three mobility sets. Simulation results show that the proposed techniques outperform several state-of-art algorithms in overall cost, system stability, and other performance metrics.

  • RESEARCH ARTICLE
    Yangyang LIU, Jingyu HUA, Boyang ZHOU, Zhiqiang RU, Sheng ZHONG

    Software Defined Networking (SDN) offers Traffic Engineering (TE) great flexibility by decoupling the control and data plane. As network services become more diverse, the single-domain control architecture is no longer sufficient to meet scalability requirements, and the multi-domain and multi-level distributed control architecture is gaining popularity. However, traffic engineering across multiple domains poses challenges, particularly when each administrative domain is unwilling to disclose its network topology and resource information due to privacy concerns. To address this issue, this paper adopts the concept of differential privacy and perturbs the domain information to achieve bandwidth indistinguishable TE. Unfortunately, perturbations may decrease the accuracy of the TE algorithm’s resource allocation, negatively affecting performance. To mitigate this problem, we propose BI-TE, which utilizes a GNN-based bandwidth utilization prediction model to assist the controller in selecting the optimal forwarding path, thereby enhancing TE efficiency. Experimental results demonstrate that compared to abstraction-based hierarchical TE, BI-TE can reduce the processing time by nearly 24.35% while ensuring network bandwidth utilization close to 90%. Additionally, the fairness of allocation is also guaranteed.

  • Interdisciplinary
  • LETTER
    Heyang HUA, Sijie LI, Haitian LIANG, Shengquan CHEN