DRMSpell: dynamically reweighting multimodality for Chinese spelling correction

Yinghao LI , Heyan HUANG , Baojun WANG , Yang GAO

Front. Inform. Technol. Electron. Eng ›› 2025, Vol. 26 ›› Issue (3) : 354 -366.

PDF (501KB)
Front. Inform. Technol. Electron. Eng ›› 2025, Vol. 26 ›› Issue (3) : 354 -366. DOI: 10.1631/FITEE.2300816

DRMSpell: dynamically reweighting multimodality for Chinese spelling correction

Author information +
History +
PDF (501KB)

Abstract

Chinese spelling correction (CSC) is a task that aims to detect and correct the spelling errors that may occur in Chinese texts. However, the Chinese language exhibits a high degree of complexity, characterized by the presence of multiple phonetic representations known as pinyin, which possess distinct tonal variations that can correspond to various characters. Given the complexity inherent in the Chinese language, the CSC task becomes imperative for ensuring the accuracy and clarity of written communication. Recent research has included external knowledge into the model using phonological and visual modalities. However, these methods do not effectively target the utilization of modality information to address the different types of errors. In this paper, we propose a multimodal pretrained language model called DRMSpell for CSC, which takes into consideration the interaction between the modalities. A dynamically reweighting multimodality (DRM) module is introduced to reweight various modalities for obtaining more multimodal information. To fully use the multimodal information obtained and to further strengthen the model, an independent-modality masking strategy (IMS) is proposed to independently mask three modalities of a token in the pretraining stage. Our method achieves state-of-the-art performance on most metrics constituting widely used benchmarks. The findings of the experiments demonstrate that our method is capable of modeling the interactive information between modalities and is also robust to incorrect modal information.

Keywords

Chinese spelling correction / Multimodality / Masking strategy

Cite this article

Download citation ▾
Yinghao LI, Heyan HUANG, Baojun WANG, Yang GAO. DRMSpell: dynamically reweighting multimodality for Chinese spelling correction. Front. Inform. Technol. Electron. Eng, 2025, 26(3): 354-366 DOI:10.1631/FITEE.2300816

登录浏览全文

4963

注册一个新账户 忘记密码

References

RIGHTS & PERMISSIONS

Zhejiang University Press

AI Summary AI Mindmap
PDF (501KB)

Supplementary files

FITEE-0354-24003-YGL_suppl_1

FITEE-0354-24003-YGL_suppl_2

185

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/