Emerging topic identification from app reviews via adaptive online biterm topic modeling

Wan ZHOU; Yong WANG; Cuiyun GAO; Fei YANG

doi:10.1631/FITEE.2100465

PDF(1130 KB)

Front. Inform. Technol. Electron. Eng ›› 2022, Vol. 23 ›› Issue (5) : 678-691. DOI: 10.1631/FITEE.2100465

Orginal Article

Emerging topic identification from app reviews via adaptive online biterm topic modeling

Wan ZHOU¹ ,
Yong WANG¹^,² ,
Cuiyun GAO³ ,
Fei YANG⁴

Author information +

History +

Abstract

Emerging topics in app reviews highlight the topics (e.g., software bugs) with which users are concerned during certain periods. Identifying emerging topics accurately, and in a timely manner, could help developers more effectively update apps. Methods for identifying emerging topics in app reviews based on topic models or clustering methods have been proposed in the literature. However, the accuracy of emerging topic identification is reduced because reviews are short in length and offer limited information. To solve this problem, an improved emerging topic identification (IETI) approach is proposed in this work. Specifically, we adopt natural language processing techniques to reduce noisy data, and identify emerging topics in app reviews using the adaptive online biterm topic model. Then we interpret the implicature of emerging topics through relevant phrases and sentences. We adopt the official app changelogs as ground truth, and evaluate IETI in six common apps. The experimental results indicate that IETI is more accurate than the baseline in identifying emerging topics, with improvements in the F1 score of 0.126 for phrase labels and 0.061 for sentence labels. Finally, we release the codes of IETI on Github (https://github.com/wanizhou/IETI).

Keywords

App reviews / Emerging topic identification / Topic model / Natural language processing

Cite this article

EndNote

Ris (Procite)

Bibtex

Download citation ▾

Wan ZHOU, Yong WANG, Cuiyun GAO, Fei YANG. Emerging topic identification from app reviews via adaptive online biterm topic modeling. Front. Inform. Technol. Electron. Eng, 2022, 23(5): 678‒691 https://doi.org/10.1631/FITEE.2100465