Supervised topic models with weighted words: multi-label document classification

Yue-peng ZOU , Ji-hong OUYANG , Xi-ming LI

Front. Inform. Technol. Electron. Eng ›› 2018, Vol. 19 ›› Issue (4) : 513 -523.

PDF (533KB)
Front. Inform. Technol. Electron. Eng ›› 2018, Vol. 19 ›› Issue (4) : 513 -523. DOI: 10.1631/FITEE.1601668
Orginal Article
Orginal Article

Supervised topic models with weighted words: multi-label document classification

Author information +
History +
PDF (533KB)

Abstract

Supervised topic modeling algorithms have been successfully applied to multi-label document classification tasks. Representative models include labeled latent Dirichlet allocation (L-LDA) and dependency-LDA. However, these models neglect the class frequency information of words (i.e., the number of classes where a word has occurred in the training data), which is significant for classification. To address this, we propose a method, namely the class frequency weight (CF-weight), to weight words by considering the class frequency knowledge. This CF-weight is based on the intuition that a word with higher (lower) class frequency will be less (more) discriminative. In this study, the CF-weight is used to improve L-LDA and dependency-LDA. A number of experiments have been conducted on real-world multi-label datasets. Experimental results demonstrate that CF-weight based algorithms are competitive with the existing supervised topic models.

Keywords

Supervised topic model / Multi-label classification / Class frequency / Labeled latent Dirichlet allocation (L-LDA) / Dependency-LDA

Cite this article

Download citation ▾
Yue-peng ZOU, Ji-hong OUYANG, Xi-ming LI. Supervised topic models with weighted words: multi-label document classification. Front. Inform. Technol. Electron. Eng, 2018, 19(4): 513-523 DOI:10.1631/FITEE.1601668

登录浏览全文

4963

注册一个新账户 忘记密码

References

RIGHTS & PERMISSIONS

Zhejiang University and Springer-Verlag GmbH Germany, part of Springer Nature

AI Summary AI Mindmap
PDF (533KB)

Supplementary files

FITEE-0513-18005-YPZ_suppl_1

FITEE-0513-18005-YPZ_suppl_2

4045

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/