Flexible Factor Model for Handling Missing Data in Supervised Learning

Andriette Bekker , Farzane Hashemi , Mohammad Arashi

Communications in Mathematics and Statistics ›› 2023, Vol. 11 ›› Issue (2) : 477 -501.

PDF
Communications in Mathematics and Statistics ›› 2023, Vol. 11 ›› Issue (2) : 477 -501. DOI: 10.1007/s40304-021-00260-9
Article

Flexible Factor Model for Handling Missing Data in Supervised Learning

Author information +
History +
PDF

Abstract

This paper presents an extension of the factor analysis model based on the normal mean–variance mixture of the Birnbaum–Saunders in the presence of nonresponses and missing data. This model can be used as a powerful tool to model non-normal features observed from data such as strongly skewed and heavy-tailed noises. Missing data may occur due to operator error or incomplete data capturing therefore cannot be ignored in factor analysis modeling. We implement an EM-type algorithm for maximum likelihood estimation and propose single imputation of possible missing values under a missing at random mechanism. The potential and applicability of our proposed method are illustrated through analyzing both simulated and real datasets.

Keywords

Automobile dataset / Asymmetry / ECME algorithm / Factor analysis model / Heavy tails / Incomplete data / Liver disorders dataset

Cite this article

Download citation ▾
Andriette Bekker, Farzane Hashemi, Mohammad Arashi. Flexible Factor Model for Handling Missing Data in Supervised Learning. Communications in Mathematics and Statistics, 2023, 11(2): 477-501 DOI:10.1007/s40304-021-00260-9

登录浏览全文

4963

注册一个新账户 忘记密码

References

Funding

National Research Foundation, South Africa(120839)

National Research Foundation, South Africa(71199)

Ferdowsi University of Mashhad(2/54034)

AI Summary AI Mindmap
PDF

82

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/