Frontiers of Computer Science

RESEARCH ARTICLE

HACMony: automatically detecting hopping-related audio-stream conflict issues on HarmonyOS

Jinlong HE, Binru HUANG, Changwei XIA, Hengqin YANG, Jiwei YAN, Jun YAN

2027, 21 (6): 2106201. https://doi.org/10.1007/s11704-025-50681-w

Download PDF

HarmonyOS is emerging as a popular distributed operating system for diverse mobile devices. One of its standout features is app-hopping, which allows users to switch apps seamlessly across different HarmonyOS devices. However, when apps play audio-stream-hop between different devices, they can easily trigger Hopping-related Audio-stream Conflict ( $HAC$ ) scenarios. Improper resolution of $HAC$ will lead to significant $HAC$ issues, which are hard to detect comprehensively due to the unclear semantics of HarmonyOS’s app-hopping mechanism and the lack of effective multi-app hopping testing methods. To fill the gap, this paper introduces an automated and efficient approach to detecting $HAC$ issues. We formalize the operational semantics of HarmonyOS’s app-hopping mechanism for audio streams for the first time. Leveraging this formalization, we design an Audio-stream-aware State Transition Graph ( $ASTG$ ) to model the behaviors of audio-streams during window transitions and propose a model-based approach to detect $HAC$ issues automatically. Our techniques are implemented in a tool, $HACMony$ , and evaluated on 20 real-world HarmonyOS apps. Experimental results reveal that 12 of the 20 apps exhibit $HAC$ issues. Among the 53 $HAC$ issues detected, a total of 18 unique $HAC$ issues are manually confirmed. Additionally, we summarize the detected issues into two typical types, namely $MoD$ and $MoR$ , and analyz their characteristics to assist and guide both app and OS developers.

RESEARCH ARTICLE

TestBench: evaluating class-level test case generation capability of large language models

Quanjun ZHANG, Ye SHANG, Chunrong FANG, Siqi GU, Shengcheng YU, Jianyi ZHOU, Zhenyu CHEN

2027, 21 (6): 2106202. https://doi.org/10.1007/s11704-025-50078-9

Download PDF

Software testing is a crucial phase in the software life cycle, helping identify potential risks and reduce maintenance costs. With the advancement of Large Language Models (LLMs), researchers have proposed an increasing number of LLM-based software testing techniques, particularly in the area of test case generation. Despite the growing interest, limited efforts have been made to thoroughly evaluate the actual capabilities of LLMs in this task. In this paper, we introduce TestBench, a benchmark for class-level LLM-based test case generation. We construct a dataset of 108 Java programs from nine real-world, large-scale projects on GitHub, each representing a different thematic domain. We then design three distinct types of prompts based on context descriptions, including self-contained context, full context, and simple context. Besides, we propose a fine-grained evaluation framework that considers five aspects of test cases: syntactic correctness, compilation correctness, test correctness, code coverage rate, and defect detection rate. Furthermore, we propose a heuristic algorithm to repair erroneous test cases generated by LLMs. We evaluate CodeLlama-13b, GPT-3.5, and GPT-4 on the TestBench, and our experimental results indicate that larger models demonstrate a greater ability to effectively utilize contextual information, leading to generate higher-quality test cases. Smaller models may struggle with the noise introduced by the extensive information contained within the full context. However, when using the simplified version, namely the simple context, which is derived from the full context via abstract syntax tree analysis, the performance of these models improves significantly. Our analysis highlights the current progress and pinpoints future directions to further enhance the effectiveness of models by handling contextual information for test case generation.

RESEARCH ARTICLE

Linking C and assembly for denotation-based verified compositional compilation

Zhang CHENG, Jiyang WU, Qinxiang CAO

2027, 21 (6): 2106203. https://doi.org/10.1007/s11704-025-50455-4

Download PDF

Modern software often integrates assembly code into C programs for performance-critical tasks, such as big integer computation in the GMP library and cryptographic algorithms in the OpenSSL library. Though the formal semantics of C and assembly are well-studied, linking these languages for realistic compiler verification remains challenging due to their abstraction gap. In this article, we address the problem of linking C with assembly by introducing a denotation-based framework and we apply this framework to support verified compositional compilation (VCC) for C and x86 assembly in CompCert. Specifically, we propose a novel semantic transformation operator that systematically interprets the denotational semantics of assembly into that of C. This semantic transformation bridges the interaction differences between the two languages, enabling their semantic linking to be effectively implemented under a unified language interface. Furthermore, we demonstrate the soundness of such semantic linking by applying it to the context of VCC, where two key properties are proved: 1) the transformed C semantics is faithfully preserved by the original assembly semantics, and 2) this preservation is compositional with the compilation correctness of other open modules. As a result, our approach advances the use of denotational semantics for VCC when both C and assembly are involved. All results in this article are formally verified in the Coq proof assistant.

LETTER

WABench: a cross-language benchmark suite for WebAssembly performance evaluation

Yanan WANG, Zhexiong LI, Deze ZENG, Lin GU, Quan CHEN

2027, 21 (6): 2106204. https://doi.org/10.1007/s11704-025-51069-6

Download PDF

LETTER

PhyLS: an AI-driven physically aware synthesis platform

Hongyang PAN, Cunqing LAN, Zhiang WANG, Keren ZHU

2027, 21 (6): 2106205. https://doi.org/10.1007/s11704-025-51861-4

Download PDF

RESEARCH ARTICLE

From chaos to clarity: log-based kernel panic root cause analysis for large-scale cloud services

Tianyu CUI, Yang ZHANG, Shenglin ZHANG, Xin WU, Yicheng SUI, Liangyan PENG, Yuhe JI, Feng WANG, Changchang LIU, Zeyu CHE, Xiaozhou LIU, Yongqian SUN, Yu ZHANG

2027, 21 (6): 2106206. https://doi.org/10.1007/s11704-025-50788-0

Download PDF

Operating system (OS) kernel panics, which are triggered by unrecoverable fatal errors, pose serious threats to the stability and reliability of ByteDance’s large-scale cloud services. Diagnosing such failures through log analysis is essential for identifying root causes and preventing recurrence. However, root cause analysis (RCA) for kernel panics faces two key challenges. First, only a small portion of logs explicitly indicate the kernel panic, making relevant signals difficult to extract. Second, there exist complex and long-range dependencies across logs, making it difficult to pinpoint root causes effectively. To address these challenges, we propose LogSage, a novel log-based framework for kernel panic RCA in large-scale cloud environments. LogSage combines unsupervised clustering techniques with large language models (LLMs) to extract fault-indicating log snippets, and further employs a graph-based RCA module that integrates Graph Neural Networks (GNNs) for structured log representation and active learning for efficient label utilization. We evaluate LogSage on three real-world datasets, experimental results show that LogSage achieves high performance, with F1-scores of 92.2%, 95.3%, and 96.3%, respectively. These results outperform the strongest baseline methods by 15.5%, 20.3%, and 20.1%. In addition, LogSage has been deployed in ByteDance’s cloud infrastructure for over six months. It has successfully assisted engineers in real-world RCA tasks.

LETTER

Bridging graph learning and federated optimization for recommendations

Chunxu ZHANG, Zonghan WU, Honglei ZHANG, Jiaxu CUI, Bo YANG

2027, 21 (6): 2106336. https://doi.org/10.1007/s11704-026-51383-7

Download PDF

LETTER

User-differentiated federated unlearning algorithm inspired by collaborative memory modulation

Mengyao LI, Lubing SUN, Zhenhao LIU, Songquan LI, Lu LIU, John PANNEERSELVAM, Rongbo ZHU

2027, 21 (6): 2106337. https://doi.org/10.1007/s11704-026-50984-6

Download PDF

LETTER

BrainCanvas: cross-subject brain decoding with imageset diffusion and domain invariance

Xin ZHOU, Tianyang DONG, Yiqiang LIAO, Wenyuan YING, Wentao DAI, Jing FAN

2027, 21 (6): 2106338. https://doi.org/10.1007/s11704-026-51973-5

Download PDF

RESEARCH ARTICLE

Towards preferred diagnoses through local search MaxSAT

Huisi ZHOU, Pengxu CHEN, Ran TAI, Dantong OUYANG, Liwei WANG, Xinyu ZHANG, Wei HU

2027, 21 (6): 2106401. https://doi.org/10.1007/s11704-025-50829-8

Download PDF

Model-based diagnosis (MBD) is a principled approach for identifying the possible causes of unexpected behavior caused by system malfunctions and is crucial for enhancing the reliability of modern intelligent systems. Previous MBD methods often generate numerous candidate diagnoses without ensuring their accuracy in identifying fault components. In this paper, we introduce a novel Enhanced Model-Based Diagnosis (EMBD) method, which iteratively refines candidate component sets through iterative applications of our fault localization model and our proposed local search MaxSAT algorithm, achieving precise identification of faulty components. Firstly, we establish a formal fault localization model using Weighted Conjunctive Normal Form (WCNF). The model encodes the fault localization problem into a MaxSAT problem by duplicating the original circuit and inserting XOR gates between corresponding gate pairs to enable output comparison and discrepancy detection. Secondly, building upon this model, we develop NuFPS, a local search MaxSAT algorithm that identifies the maximum number of components with output logic variations across different observations, effectively pruning fault-free components from the candidate diagnoses. Experimental evaluations demonstrate that EMBD achieves significantly more accurate candidate diagnoses. Compared with state-of-the-art methods such as HSD, CMMO, DiagDO, and DPDN, EMBD achieves significant improvements, with 673.6%, 254.2%, 211.5%, and 283.2% increase in diagnosis performance, respectively.

About the journal

Aims & scope

Description

Editorial board

Abstracting / indexing

Contact us

Browse

Just accepted

All volumes and issues

Collections

Featured articles

Most accessed

Most cited

Collections

Multimedia collections

Authors & reviewers

Online submission

Call for papers

Guidelines for authors

Download templates

Guidelines for reviewers

Please choose a citation manager