Max-margin non-negative matrix factorization with flexible spatial constraints based on factor analysis
Dakun LIU, Xiaoyang TAN
Max-margin non-negative matrix factorization with flexible spatial constraints based on factor analysis
Non-negative matrix factorization (NMF) is a popular feature encoding method for image understanding due to its non-negative properties in representation, but the learnt basis images are not always local due to the lack of explicit constraints in its objective. Various algebraic or geometric local constraints are hence proposed to shape the behaviour of the original NMF. Such constraints are usually rigid in the sense that they have to be specified beforehand instead of learning from the data. In this paper, we propose a flexible spatial constraint method for NMF learning based on factor analysis. Particularly, to learn the local spatial structure of the images, we apply a series of transformations such as orthogonal rotation and thresholding to the factor loading matrix obtained through factor analysis. Then we map the transformed loading matrix into a Laplacian matrix and incorporate this into a max-margin non-negative matrix factorization framework as a penalty term, aiming to learn a representation space which is non-negative, discriminative and localstructure-preserving. We verify the feasibility and effectiveness of the proposed method on several real world datasets with encouraging results.
non-negative matrix factorization / factor analysis / loading matrix / flexible spatial constraints
[1] |
Bishop C M, Nasrabadi N M. Pattern Recognition and Machine Learning. New York: Springer, 2006
|
[2] |
Turk M, Pentland A. Eigenfaces for recognition. Journal of Cognitive Neuroscience, 1991, 3(1): 71–86
CrossRef
Google scholar
|
[3] |
Tipping M E, Bishop C M. Probabilistic principal component analysis. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 1999, 61(3): 611–622
CrossRef
Google scholar
|
[4] |
Zou H, Hastie T, Tibshirani R. Sparse principal component analysis. Journal of Computational and Graphical Statistics, 2006, 15(2): 265–286
CrossRef
Google scholar
|
[5] |
Lee D D, Seung H S. Learning the parts of objects by non-negative matrix factorization. Nature, 1999, 401(6755): 788–791
CrossRef
Google scholar
|
[6] |
Seung D, Lee L. Algorithms for non-negative matrix factorization. Advances in Neural Information Processing Systems, 2001, 13: 556–562
|
[7] |
Ross D A, Zemel R S. Learning parts-based representations of data. The Journal of Machine Learning Research, 2006, 7: 2369–2397
|
[8] |
Lemme A, Reinhart R F, Steil J J. Online learning and generalization of parts-based image representations by non-negative sparse autoencoders. Neural Networks, 2012, 33: 194–203
CrossRef
Google scholar
|
[9] |
Wang S, Uchida S, Liwicki M, Feng Y. Part-based methods for handwritten digit recognition. Frontiers of Computer Science, 2013, 7(4): 514–525
CrossRef
Google scholar
|
[10] |
Zhang Y, Chen L, Jia J, Zhao Z. Multi-focus image fusion based on non-negative matrix factorization and difference images. Signal Processing, 2014, 105: 84–97
CrossRef
Google scholar
|
[11] |
Du H, Hu Q, Zhang X, Hou Y. Image feature extraction via graph embedding regularized projective non-negative matrix factorization. Pattern Recognition, 2014, 483: 196–209
CrossRef
Google scholar
|
[12] |
Wu Y, Shen B, Ling H. Visual tracking via online nonnegative matrix factorization. IEEE Transactions on Circuits and Systems for Video Technology, 2014, 24(3): 374–383
CrossRef
Google scholar
|
[13] |
Wang X, Wang B, Bai X, Liu W, Tu Z. Max-margin multiple-instance dictionary learning. In: Proceedings of the 30th International Conference on Machine Learning. 2013, 846–854
|
[14] |
Wang Y, Jia Y. Fisher non-negative matrix factorization for learning local features. In: Proceedings of Asian Conference on Computer Vision. 2004
|
[15] |
Zafeiriou S, Tefas A, Buciu I, Pitas I. Exploiting discriminant information in nonnegative matrix factorization with application to frontal face verification. IEEE Transactions on Neural Networks, 2006, 17(3): 683–695
CrossRef
Google scholar
|
[16] |
Li X, Fukui K. Fisher non-negative matrix factorization with pairwise weighting. In: Proceedings of MVA. 2007, 380–383
|
[17] |
Kotsia I, Zafeiriou S, Pitas I. A novel discriminant non-negative matrix factorization algorithm with applications to facial image characterization problems. IEEE Transactions on Information Forensics and Security, 2007, 2(3): 588–595
CrossRef
Google scholar
|
[18] |
Nieto O, Jehan T. Convex non-negative matrix factorization for automatic music structure identification. In: Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing. 2013, 236–240
CrossRef
Google scholar
|
[19] |
Huang K, Sidiropoulos N D, Swami A. Non-negative matrix factorization revisited: Uniqueness and algorithm for symmetric decomposition. IEEE Transactions on Signal Processing, 2014, 62(1): 211–224
CrossRef
Google scholar
|
[20] |
Yanez F, Bach F. Primal-dual algorithms for non-negative matrix factorization with the kullback-leibler divergence. arXiv preprint arXiv:1412.1788, 2014
|
[21] |
Wang J J Y, Gao X. Max–min distance nonnegative matrix factorization. Neural Networks, 2015, 61: 75–84
CrossRef
Google scholar
|
[22] |
Kumar B G, Kotsia I, Patras I. Max-margin non-negative matrix factorization. Image and Vision Computing, 2012, 30(4): 279–291
CrossRef
Google scholar
|
[23] |
Kumar B G, Patras I, Kotsia I.Max-margin semi-NMF. In: Proceedings of the 22nd British Machine Vision Conference. 2011
CrossRef
Google scholar
|
[24] |
Donoho D, Stodden V. When does non-negative matrix factorization give a correct decomposition into parts? In: Proceedings of the Neural Information Processing Systems Conference. 2003, 1141–1148
|
[25] |
Hoyer P O. Non-negative matrix factorization with sparseness constraints. The Journal of Machine Learning Research, 2004, 5: 1457–1469
|
[26] |
Tan X, Triggs B. Enhanced local texture feature sets for face recognition under difficult lighting conditions. IEEE Transactions on Image Processing, 2010, 19(6): 1635–1650
CrossRef
Google scholar
|
[27] |
Wang Y, Liu J, Tang X. Robust 3D face recognition by local shape difference boosting. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2010, 32(10): 1858–1870
CrossRef
Google scholar
|
[28] |
Wang X, Ling H, Xu X. Parts-based face super-resolution via nonnegative matrix factorization. Computers & Electrical Engineering, 2014, 40(8): 130–141
CrossRef
Google scholar
|
[29] |
Sharma G, Jurie F, Pérez P. EPML: expanded parts based metric learning for occlusion robust face verification. In: Proceedings of the 12th Asian Conference on Computer Vision. 2014, 1–15
|
[30] |
Tang Z, Zhang X, Zhang S. Robust perceptual image hashing based on ring partition and nmf. IEEE Transactions on Knowledge and Data Engineering, 2014, 26(3): 711–724
CrossRef
Google scholar
|
[31] |
Tian Q, Chen S, Tan X. Comparative study among three strategies of incorporating spatial structures to ordinal image regression. Neurocomputing,2014, 136: 152–161
CrossRef
Google scholar
|
[32] |
Li S Z, Hou X W, Zhang H J, Cheng Q S. Learning spatially localized, parts-based representation. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2001
CrossRef
Google scholar
|
[33] |
Jiang B, Zhao H, Tang J, Luo B. A sparse nonnegative matrix factorization technique for graph matching problems. Pattern Recognition, 2014, 47(2): 736–747
CrossRef
Google scholar
|
[34] |
Zeng K, Yu J, Li C, You J, Jin T. Image clustering by hyper-graph regularized non-negative matrix factorization. Neurocomputing, 2014, 138: 209–217
CrossRef
Google scholar
|
[35] |
Zheng W S, Lai J, Liao S, He R. Extracting non-negative basis images using pixel dispersion penalty. Pattern Recognition, 2012, 45(8): 2912–2926
CrossRef
Google scholar
|
[36] |
Chen X, Li C, Cai D. Spatially correlated nonnegative matrix factorization for image analysis. In: Proceedings of the 3rd Sino-foreigninterchange Workshop on Intelligent Science and Intelligent Data Engineering. 2012, 148–157
|
[37] |
Chen X, Li C, Liu H, Cai D. Spatially correlated nonnegative matrix factorization. Neurocomputing, 2014, 139: 15–21
CrossRef
Google scholar
|
[38] |
Wu J, Qu W, Hu H, Li Z, Xu Y, Tao Y. A discriminative spatial bagof-word scheme with distinct patch. In: Proceedings of the 2014 International Conference on Audio, Language and Image Processing. 2014, 266–271
|
[39] |
Mu Y, Ding W, Tao D. Local discriminative distance metrics ensemble learning. Pattern Recognition, 2013, 46(8): 2337–2349
CrossRef
Google scholar
|
[40] |
Lawton W H, Sylvestre E A. Self modeling curve resolution. Technometrics, 1971, 13(3): 617–633
CrossRef
Google scholar
|
[41] |
Paatero P, Tapper U. Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values. Environmetrics, 1994, 5(2): 111–126
CrossRef
Google scholar
|
[42] |
Chen X, Tong Z, Liu H, Cai D. Metric learning with two-dimensional smoothness for visual analysis. In: Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition. 2012, 2533–2538
CrossRef
Google scholar
|
[43] |
Cai D, He X, Wu X, Han J. Non-negative matrix factorization on manifold. In: Proceedings of the 8th IEEE International Conference on Data Mining. 2008, 63–72
CrossRef
Google scholar
|
[44] |
Cai D, He X, Han J, Huang T S. Graph regularized nonnegative matrix factorization for data representation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011, 33(8): 1548–1560
CrossRef
Google scholar
|
[45] |
Ando R K, Zhang T. Learning on graph with laplacian regularization. Advances in Neural Information Processing Systems, 2007, 19–25
|
[46] |
Fidler S, Skocaj D, Leonardis A. Combining reconstructive and discriminative subspace methods for robust classification and regression by subsampling. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2006, 28(3): 337–350
CrossRef
Google scholar
|
[47] |
Basilevsky A T. Statistical Factor Analysis and Related Methods: Theory and Applications. New York: John Wiley & Sons, 2009
|
[48] |
Martínez A M, Kak A C. PCA versus IDA. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2001, 23(2): 228–233
CrossRef
Google scholar
|
[49] |
Georghiades A S, Belhumeur P N, Kriegman D J. From few to many: illumination cone models for face recognition under variable lighting and pose. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2001, 23(6): 643–660
CrossRef
Google scholar
|
[50] |
Hull J J. A database for handwritten text recognition research. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1994, 16(5): 550–554
CrossRef
Google scholar
|
[51] |
Schuldt C, Laptev I, Caputo B. Recognizing human actions: a local svm approach. In: Proceedings of the 17th International Conference on Pattern Recognition. 2004, 32–36
CrossRef
Google scholar
|
[52] |
Hido S, Tsuboi Y, Kashima H, Sugiyama M, Kanamori T. Inlier-based outlier detection via direct density ratio estimation. In: Proceedings of the 8th IEEE International Conference on Data Mining. 2008, 223–232
CrossRef
Google scholar
|
[53] |
Dalal BN. T. Histograms of oriented gradients for human detection. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2005, 886–893
CrossRef
Google scholar
|
/
〈 | 〉 |