Flexible Factor Model for Handling Missing Data in Supervised Learning
Andriette Bekker , Farzane Hashemi , Mohammad Arashi
Communications in Mathematics and Statistics ›› 2023, Vol. 11 ›› Issue (2) : 477 -501.
Flexible Factor Model for Handling Missing Data in Supervised Learning
This paper presents an extension of the factor analysis model based on the normal mean–variance mixture of the Birnbaum–Saunders in the presence of nonresponses and missing data. This model can be used as a powerful tool to model non-normal features observed from data such as strongly skewed and heavy-tailed noises. Missing data may occur due to operator error or incomplete data capturing therefore cannot be ignored in factor analysis modeling. We implement an EM-type algorithm for maximum likelihood estimation and propose single imputation of possible missing values under a missing at random mechanism. The potential and applicability of our proposed method are illustrated through analyzing both simulated and real datasets.
Automobile dataset / Asymmetry / ECME algorithm / Factor analysis model / Heavy tails / Incomplete data / Liver disorders dataset
| [1] |
Anderson, T.W.: An introduction to multivariate statistical analysis (Wiley Series in Probability and Statistics), 3 edn. (2003) |
| [2] |
|
| [3] |
Basilevsky, A.T.: Statistical factor analysis and related methods: theory and applications, New York, Wiley (2009) |
| [4] |
|
| [5] |
Fokoué, E., Titterington, D.: Mixtures of factor analysers. Bayesian estimation and inference by stochastic simulation. Machine Learning 50(1), 73–94 (2003) |
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
Schafer, J.L.: Analysis of incomplete multivariate data. CRC Press (1997) |
| [26] |
|
| [27] |
Villasenor Alva, J.A., Estrada, E.G.: A generalization of shapiro-wilk’s test for multivariate normality. Communications in Statistics-Theory and Methods 38(11), 1870–1883 (2009) |
| [28] |
|
| [29] |
|
/
| 〈 |
|
〉 |