Error Bounds of Median-of-Means Estimators with VC-Dimension
Yuxuan Wang , Yiming Chen , Hanchao Wang , Lixin Zhang
Communications in Mathematics and Statistics ›› : 1 -31.
We obtain the upper error bounds of robust estimators for mean vector, using the median-of-means (MOM) method. The method is designed to handle data with heavy tails and contamination, with only a finite second moment, which is weaker than many others, relying on the VC-dimension rather than the Rademacher complexity to measure statistical complexity. This allows us to implement MOM in covariance estimation, without imposing conditions such as
VC-dimension / Robustness / Median of means / Heavy tails / 62G35 / 62H12
| [1] |
|
| [2] |
Boucheron, S., Lugosi, G., Bousquet, O.: Concentration inequalities. In: Summer School on Machine Learning, pp. 208–240. Springer (2003) |
| [3] |
|
| [4] |
Catoni, O., Giulini, I.: Dimension-free pac-Bayesian bounds for matrices, vectors, and linear least squares regression. arXiv preprint arXiv:1712.02747 (2017) |
| [5] |
|
| [6] |
Chan, T.M.: An optimal randomized algorithm for maximum Tukey depth. In: Proceedings of the 15th Annual ACM-SIAM Symposium on Discrete Algorithms. SODA ’04, pp. 430–436. Society for Industrial and Applied Mathematics, USA (2004) |
| [7] |
|
| [8] |
Cheng, Y., Diakonikolas, I., Ge, R.: High-dimensional robust mean estimation in nearly-linear time. In: Proceedings of the 30th Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 2755–2771. SIAM (2019) |
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
Lei, Z., Luh, K., Venkat, P., Zhang, F.: A fast spectral algorithm for mean estimation with sub-gaussian rates. In: Proceedings of 33rd Conference on Learning Theory. Proceedings of Machine Learning Research, vol. 125, pp. 2598–2612. PMLR (2020) |
| [20] |
Lerasle, M., Oliveira, R.I.: Robust empirical mean estimators. arXiv preprint arXiv:1112.3914 (2011) |
| [21] |
Lugosi, G., Mendelson, S.: Multivariate mean estimation with direction-dependent accuracy. arXiv preprint arXiv:2010.11921 (2020) |
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
|
| [26] |
Mendelson, S.: An optimal unrestricted learning procedure. arXiv preprint arXiv:1707.05342 (2017) |
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
|
| [34] |
|
| [35] |
Tukey, J.W.: Mathematics and the picturing of data. In: Proceedings of the International Congress of Mathematicians, Vancouver, vol. 2, pp. 523–531 (1975) |
| [36] |
|
| [37] |
|
| [38] |
Vershynin, R.: High-Dimensional Probability: An Introduction with Applications in Data Science, vol. 47. Cambridge University Press, Cambridge (2018) |
| [39] |
Wei, X., Minsker, S.: Estimation of the covariance structure of heavy-tailed distributions. In: Advances in Neural Information Processing Systems vol. 30 (2017) |
| [40] |
Zwald, L., Blanchard, G.: On the convergence of eigenspaces in kernel principal component analysis. In: Advances in Neural Information Processing Systems, vol. 18, pp. 1649–1656 (2005) |
School of Mathematical Sciences, University of Science and Technology of China and Springer-Verlag GmbH Germany, part of Springer Nature
/
| 〈 |
|
〉 |