Robust object tracking with RGBD-based sparse learning
Zi-ang MA, Zhi-yu XIANG
Robust object tracking with RGBD-based sparse learning
Robust object tracking has been an important and challenging research area in the field of computer vision for decades. With the increasing popularity of affordable depth sensors, range data is widely used in visual tracking for its ability to provide robustness to varying illumination and occlusions. In this paper, a novel RGBD and sparse learning based tracker is proposed. The range data is integrated into the sparse learning framework in three respects. First, an extra depth view is added to the color image based visual features as an independent view for robust appearance modeling. Then, a special occlusion template set is designed to replenish the existing dictionary for handling various occlusion conditions. Finally, a depth-based occlusion detection method is proposed to efficiently determine an accurate time for the template update. Extensive experiments on both KITTI and Princeton data sets demonstrate that the proposed tracker outperforms the state-of-the-art tracking algorithms, including both sparse learning and RGBD based methods.
Object tracking / Sparse learning / Depth view / Occlusion templates / Occlusion detection
[1] |
Avidan, S., 2007. Ensemble tracking.IEEE Trans. Patt. Anal. Mach. Intell., 29(2):261–271.
|
[2] |
Babenko, B., Yang, M.H., Belongie,S. , 2009. Visual tracking with online multiple instance learning.IEEE Conf. on Computer Vision and Pattern Recognition, p.983–990.
|
[3] |
Bao, C.L., Wu, Y., Ling, H.B.,
|
[4] |
Black, M.J., Jepson, A.D., 1998. EigenTracking: robust matching and tracking of articulated objects using a view-based representation. Int. J. Comput. Vis., 26(1): 63–84. https://doi.org/10.1023/A:1007939232436
|
[5] |
Candes, E.J., Romberg , J.K., Tao, T. , 2006. Stable signal recovery from incomplete and inaccurate measurements.Commun. Pure Appl. Math., 59(8):1207–1223.
|
[6] |
Chen, X., Pan, W.K., Kwok, J.T. ,
|
[7] |
Choi, W., Pantofaru , C., Savarese, S. , 2011. Detecting and tracking people using an RGB-D camera via multiple detector fusion.IEEE Int. Conf. on Computer Vision Workshops, p.1076–1083.
|
[8] |
Comaniciu, D., Ramesh, V., Meer, P., 2003. Kernel-based object tracking.IEEE Trans. Patt. Anal. Mach. Intell., 25(5):564–577.
|
[9] |
Dalal, N., Triggs, B., 2005. Histograms of oriented gradients for human detection.IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, p.886–893.
|
[10] |
Donoho, D.L., 2006. Compressed sensing.IEEE Trans. Inform. Theory, 52(4):1289–1306.
|
[11] |
Hong, Z.B., Mei, X., Prokhorov, D. ,
|
[12] |
Lan, X.Y., Ma, A., Yuen, P., 2014. Multi-cue visual tracking using robust feature-level fusion based on joint sparse representation.IEEE Int. Conf. on Computer Vision and Pattern Recognition, p.1194–1201.
|
[13] |
Ling, H.B., Bai, L., Blasch, E.,
|
[14] |
Liu, B.Y., Yang, L., Huang, J.Z. ,
|
[15] |
Luber, M., Spinello , L., Arras, K.O. , 2011. People tracking in RGB-D data with on-line boosted target models.IEEE/RSJ Int. Conf. on Intelligent Robots and Systems, p.3844–3849.
|
[16] |
Ma, Z.A., Xiang, Z.Y., 2015. Robust visual tracking via bin-ocular multi-task multi-view joint sparse representation.SAI Intelligent Systems Conf., p.714–722.
|
[17] |
Mei, X., Ling, H.B., 2009. Robust visual tracking using ℓ1 minimization.IEEE 12th Int. Conf. on Computer Vision, p.1436–1443.
|
[18] |
Mei, X., Ling, H.B., 2011. Robust visual tracking and vehicle classification via sparse representation.IEEE Trans. Patt.Anal. Mach. Intell., 33(11):2259–2272. https://doi.org/10.1109/TPAMI.2011.66
|
[19] |
Mei, X., Ling, H.B., Wu, Y. ,
|
[20] |
Ojala, T., Pietikäinen , M., Mäenpää, T., 2002. Multiresolution gray-scale and rotation invariant texture classification with local binary patterns.IEEE Trans. Patt. Anal. Mach. Intell., 24(7):971–987.
|
[21] |
Pei, S.C., Lin, C.N., 1995. Image normalization for pattern recognition.Image Vis. Comput., 13(10):711–723.
|
[22] |
Porikli, F., Tuzel, O., Meer, P., 2006. Covariance tracking using model update based on Lie algebra.IEEE Computer Society Conf. on Computer Vision and Pattern Recogni-tion, p.728–735.
|
[23] |
Ross, D.A., Lim, J., Lin, R.S.,
|
[24] |
Song, S.R., Xiao, J.X., 2013. Tracking revisited using RGBD camera: unified benchmark and baselines.IEEE Int. Conf. on Computer Vision, p.233–240.
|
[25] |
Williams, O., Blake, A., Cipolla, R. , 2005. Sparse Bayesian learning for efficient visual tracking.IEEE Trans. Patt. Anal. Mach. Intell., 27(8):1292–1304.
|
[26] |
Wright, J., Yang, A.Y., Ganesh, A. ,
|
[27] |
Wu, Y., Lim, J., Yang, M.H., 2013. Online object tracking: a benchmark.IEEE Conf. on Computer Vision and Pattern Recognition, p.2411–2418.
|
[28] |
Yang, M., Zhang, L., 2010. Gabor feature-based sparse rep-resentation for face recognition with Gabor occlusion dictionary.European Conf. on Computer Vision, p.448–461.
|
[29] |
Yilmaz, A., Javed, O., Shah, M., 2006. Object tracking: a survey. ACM Comput.Surv., 38(4):43–56.
|
[30] |
Yin, Z.Z., Collins , R.T., 2008. Object tracking and detection after occlusion via numerical hybrid local and global mode-seeking.IEEE Conf. on Computer Vision and Pattern Recognition, p.1–8.
|
[31] |
Zhang, K., Zhang, L., Yang, M.H., 2012. Real-time compres-sive tracking.European Conf. on Computer Vision, p.864–877. https://doi.org/10.1007/978-3-642-33712-3_62
|
[32] |
Zhang, T.Z., Ghanem, B., Liu, S.,
|
[33] |
Zhang, T.Z., Ghanem, B., Liu, S.,
|
[34] |
Zhang, T.Z., Ghanem, B., Liu, S.,
|
[35] |
Zhang, T.Z., Liu, S., Ahuja, N.,
|
[36] |
Zhang, T.Z., Liu, S., Xu, C.S.,
|
[37] |
Zhang, Z.Y., 1994. Iterative point matching for registration of free-form curves and surfaces.Int. J. Comput. Vis., 13(2): 119–152.
|
/
〈 | 〉 |