1 Introduction
Fig.1 Comparison of the proposed combined methodology and two conventional frameworks. (a) Matrix decomposition (MD) way; (b) matrix factorization (MF) way; (c) the proposed JMDF: we simultaneously take FG model, BG model, and noise model into account by extending the fuzzy membership degree |
2 Related work
3 Proposed method
3.1 Overall model
Fig.2 Flowchart of the proposed framework. First, the video BG is modeled by fuzzy factorization. Second, background subtraction is conducted, and then the Spatio-temporal constraints are applied to obtain the FG component. After that, the Gaussian noise in residual and the FG component update the membership degree together. Finally, the above process is iterated until convergence |
3.2 Fuzzy factorization of BG
3.3 Spatio-temporal constraints of FG
Fig.4 On the left, the matrix represents the image to be processed, where the label is the index of each pixel. The sub-graph on the right top shows the conventional neighborhood of pixels. The right bottom illustrates some potential group neighborhood patterns located on the different positions of the image under different circular neighborhood radius |
3.4 JMDF model formulation and solution
4 Experiments
4.1 FG estimation on I2R
Tab.1 Comparison of foreground detection results on i2r dataset |
Video | PCP [9] | DECOLOR [15] | E-LSD [43] | ROUTE [12] | OMoGMF+TV [6] | GSTO [62] | JMDF01 | JMDF02 | JMDF03 |
---|---|---|---|---|---|---|---|---|---|
Bootstrap | 0.639 | 0.581 | 0.685 | 0.641 | 0.669 | 0.711 | 0.673 | 0.690 | 0.693 |
Campus | 0.444 | 0.767 | 0.784 | 0.409 | 0.813 | 0.810 | 0.763 | 0.826 | 0.828 |
Curtain | 0.692 | 0.781 | 0.832 | 0.785 | 0.836 | 0.812 | 0.883 | 0.892 | 0.914 |
Escalator | 0.572 | 0.724 | 0.715 | 0.588 | 0.651 | 0.733 | 0.660 | 0.691 | 0.734 |
Fountain | 0.683 | 0.833 | 0.831 | 0.727 | 0.825 | 0.855 | 0.871 | 0.881 | 0.884 |
Hall | 0.520 | 0.643 | 0.671 | 0.615 | 0.652 | 0.673 | 0.671 | 0.683 | 0.690 |
ShopMall | 0.692 | 0.671 | 0.744 | 0.707 | 0.701 | 0.747 | 0.742 | 0.741 | 0.748 |
WaterSurface | 0.781 | 0.836 | 0.887 | 0.853 | 0.902 | 0.901 | 0.929 | 0.938 | 0.940 |
Lobby | 0.652 | 0.607 | 0.742 | 0.711 | 0.787 | 0.813 | 0.835 | 0.844 | 0.834 |
Average | 0.631 | 0.716 | 0.766 | 0.671 | 0.760 | 0.784 | 0.781 | 0.798 | 0.807 |
4.2 FG Estimation on CDnet2014
Tab.2 Comparison of foreground detection results on cdnet2014 dataset |
Sequence | Video | PCP [9] | DECOLOR [15] | E-LSD [43] | ROUTE [12] | OMoGMF+TV [6] | GSTO [62] | JMDF01 | JMDF02 | JMDF03 |
---|---|---|---|---|---|---|---|---|---|---|
Baseline | highway | 0.791 | 0.881 | 0.963 | 0.752 | 0.935 | 0.937 | 0.982 | 0.992 | 0.979 |
office | 0.643 | 0.843 | 0.924 | 0.882 | 0.845 | 0.906 | 0.876 | 0.886 | 0.903 | |
pedestrians | 0.952 | 0.701 | 0.990 | 0.953 | 0.982 | 0.943 | 0.987 | 0.994 | 0.984 | |
PETS2006 | 0.690 | 0.908 | 0.812 | 0.675 | 0.806 | 0.842 | 0.861 | 0.873 | 0.833 | |
Average | 0.769 | 0.833 | 0.922 | 0.816 | 0.892 | 0.907 | 0.927 | 0.936 | 0.925 | |
BadWeather | skating | 0.652 | 0.927 | 0.921 | 0.832 | 0.901 | 0.945 | 0.961 | 0.963 | 0.953 |
snowFall | 0.626 | 0.908 | 0.795 | 0.742 | 0.813 | 0.846 | 0.935 | 0.940 | 0.932 | |
blizzard | 0.883 | 0.851 | 0.873 | 0.902 | 0.867 | 0.918 | 0.942 | 0.942 | 0.941 | |
wetSnow | 0.608 | 0.904 | 0.861 | 0.632 | 0.822 | 0.861 | 0.886 | 0.884 | 0.880 | |
Average | 0.692 | 0.898 | 0.863 | 0.777 | 0.851 | 0.893 | 0.931 | 0.932 | 0.927 | |
LowFramerate | port_0_17fps | 0.062 | 0.041 | 0.454 | 0.052 | 0.382 | 0.251 | 0.314 | 0.394 | 0.442 |
tramCrossroad_1fps | 0.756 | 0.782 | 0.864 | 0.761 | 0.854 | 0.832 | 0.844 | 0.843 | 0.842 | |
tunnelExit_0_35fps | 0.583 | 0.678 | 0.622 | 0.545 | 0.559 | 0.742 | 0.783 | 0.761 | 0.734 | |
turnpike_0_5fps | 0.752 | 0.721 | 0.897 | 0.752 | 0.883 | 0.798 | 0.895 | 0.901 | 0.903 | |
Average | 0.541 | 0.556 | 0.709 | 0.528 | 0.670 | 0.656 | 0.709 | 0.725 | 0.730 | |
Turbulence | turbulence0 | 0.692 | 0.344 | 0.754 | 0.632 | 0.728 | 0.581 | 0.826 | 0.882 | 0.883 |
turbulence1 | 0.382 | 0.421 | 0.693 | 0.543 | 0.798 | 0.715 | 0.859 | 0.870 | 0.852 | |
turbulence2 | 0.043 | 0.472 | 0.987 | 0.512 | 0.993 | 0.871 | 0.982 | 0.984 | 0.973 | |
turbulence3 | 0.845 | 0.702 | 0.936 | 0.859 | 0.926 | 0.860 | 0.941 | 0.954 | 0.960 | |
Average | 0.491 | 0.482 | 0.843 | 0.637 | 0.861 | 0.757 | 0.902 | 0.923 | 0.917 | |
Thermal | corridor | 0.402 | 0.973 | 0.881 | 0.771 | 0.962 | 0.923 | 0.991 | 0.993 | 0.992 |
diningRoom | 0.464 | 0.911 | 0.724 | 0.616 | 0.608 | 0.905 | 0.772 | 0.793 | 0.784 | |
lakeSide | 0.673 | 0.722 | 0.435 | 0.684 | 0.322 | 0.776 | 0.680 | 0.662 | 0.604 | |
library | 0.467 | 0.536 | 0.971 | 0.885 | 0.983 | 0.952 | 0.985 | 0.991 | 0.989 | |
park | 0.618 | 0.814 | 0.727 | 0.634 | 0.617 | 0.859 | 0.856 | 0.851 | 0.853 | |
Average | 0.525 | 0.791 | 0.748 | 0.718 | 0.698 | 0.883 | 0.857 | 0.858 | 0.844 | |
IntermittentObjectMotion | abandonedBox | 0.710 | 0.721 | 0.932 | 0.823 | 0.924 | 0.906 | 0.904 | 0.904 | 0.882 |
parking | 0.622 | 0.211 | 0.382 | 0.383 | 0.274 | 0.801 | 0.768 | 0.767 | 0.624 | |
sofa | 0.634 | 0.732 | 0.703 | 0.632 | 0.690 | 0.693 | 0.691 | 0.698 | 0.696 | |
streetLight | 0.421 | 0.643 | 0.614 | 0.440 | 0.591 | 0.633 | 0.631 | 0.653 | 0.614 | |
tramstop | 0.242 | 0.350 | 0.352 | 0.243 | 0.336 | 0.376 | 0.347 | 0.364 | 0.353 | |
winterDriveway | 0.371 | 0.808 | 0.766 | 0.773 | 0.646 | 0.795 | 0.784 | 0.785 | 0.784 | |
Average | 0.500 | 0.578 | 0.625 | 0.549 | 0.577 | 0.701 | 0.688 | 0.695 | 0.659 | |
DynamicBackground | boats | 0.426 | 0.903 | 0.931 | 0.463 | 0.907 | 0.905 | 0.902 | 0.916 | 0.954 |
canoe | 0.121 | 0.264 | 0.822 | 0.434 | 0.801 | 0.778 | 0.887 | 0.933 | 0.923 | |
fall | 0.445 | 0.707 | 0.701 | 0.363 | 0.566 | 0.828 | 0.679 | 0.764 | 0.842 | |
fountain01 | 0.041 | 0.024 | 0.081 | 0.053 | 0.042 | 0.184 | 0.171 | 0.220 | 0.171 | |
fountain02 | 0.722 | 0.726 | 0.727 | 0.744 | 0.805 | 0.824 | 0.854 | 0.832 | 0.784 | |
overpass | 0.492 | 0.845 | 0.793 | 0.711 | 0.802 | 0.872 | 0.858 | 0.875 | 0.871 | |
Average | 0.375 | 0.578 | 0.676 | 0.461 | 0.654 | 0.732 | 0.725 | 0.762 | 0.758 | |
Overall average | 0.556 | 0.674 | 0.769 | 0.641 | 0.743 | 0.790 | 0.820 | 0.833 | 0.823 |
4.3 Noise robustness of JMDF
Fig.8 Robustness to different noises. The source video frame and the result from our JMDF are displayed in the lower right corner of the figure. We add Gaussian noise, speckle noise, salt and pepper noise, and Poisson noise to the source video, respectively. Except for Poisson noise, we fix the noise mean to 0, and change the noise variance, whose values are shown in the first row of each sub-figure. Then, we plot the noisy frame examples in the second row and the results from JMDF in the third row. As the variance increases, the video gradually becomes blurred, which increases the difficulty of the detection task. Our method achieves good performance even in videos with large noise variance |