Afeature selection approach based on a similarity measure for software defect prediction

Qiao YU; Shu-juan JIANG; Rong-cun WANG; Hong-yang WANG

doi:10.1631/FITEE.1601322

PDF(605 KB)

Front. Inform. Technol. Electron. Eng ›› 2017, Vol. 18 ›› Issue (11) : 1744-1753. DOI: 10.1631/FITEE.1601322

Article

Afeature selection approach based on a similarity measure for software defect prediction

Author information +

History +

Abstract

Software defect prediction is aimed to find potential defects based on historical data and software features. Software features can reflect the characteristics of software modules. However, some of these features may be more relevant to the class (defective or non-defective), but others may be redundant or irrelevant. To fully measure the correlation between different features and the class, we present a feature selection approach based on a similarity measure (SM) for software defect prediction. First, the feature weights are updated according to the similarity of samples in different classes. Second, a feature ranking list is generated by sorting the feature weights in descending order, and all feature subsets are selected from the feature ranking list in sequence. Finally, all feature subsets are evaluated on a k-nearest neighbor (KNN) model and measured by an area under curve (AUC) metric for classification performance. The experiments are conducted on 11 National Aeronautics and Space Administration (NASA) datasets, and the results show that our approach performs better than or is comparable to the compared feature selection approaches in terms of classification performance.

Keywords

Software defect prediction / Feature selection / Similarity measure / Feature weights / Feature ranking list

Cite this article

EndNote

Ris (Procite)

Bibtex

Download citation ▾

Qiao YU, Shu-juan JIANG, Rong-cun WANG, Hong-yang WANG. Afeature selection approach based on a similarity measure for software defect prediction. Front. Inform. Technol. Electron. Eng, 2017, 18(11): 1744‒1753 https://doi.org/10.1631/FITEE.1601322