Automatic traceability link recovery via active learning

Tian-bao DU; Guo-hua SHEN; Zhi-qiu HUANG; Yao-shen YU; De-xiang WU

doi:10.1631/FITEE.1900222

PDF(495 KB)

Front. Inform. Technol. Electron. Eng ›› 2020, Vol. 21 ›› Issue (8) : 1217-1225. DOI: 10.1631/FITEE.1900222

Orginal Article

Automatic traceability link recovery via active learning

Tian-bao DU¹ ,
Guo-hua SHEN¹^,²^,³ ,
Zhi-qiu HUANG¹^,²^,³ ,
Yao-shen YU¹ ,
De-xiang WU¹

Author information +

History +

Abstract

Traceability link recovery (TLR) is an important and costly software task that requires humans establish relationships between source and target artifact sets within the same project. Previous research has proposed to establish traceability links by machine learning approaches. However, current machine learning approaches cannot be well applied to projects without traceability information (links), because training an effective predictive model requires humans label too many traceability links. To save manpower, we propose a new TLR approach based on active learning (AL), which is called the AL-based approach. We evaluate the AL-based approach on seven commonly used traceability datasets and compare it with an information retrieval based approach and a state-ofthe-art machine learning approach. The results indicate that the AL-based approach outperforms the other two approaches in terms of F-score.

Keywords

Automatic / Traceability link recovery / Manpower / Active learning

Cite this article

EndNote

Ris (Procite)

Bibtex

Download citation ▾

Tian-bao DU, Guo-hua SHEN, Zhi-qiu HUANG, Yao-shen YU, De-xiang WU. Automatic traceability link recovery via active learning. Front. Inform. Technol. Electron. Eng, 2020, 21(8): 1217‒1225 https://doi.org/10.1631/FITEE.1900222