Collections

Multimedia and Graphics
Quality article selection in Multimedia and Graphics field
Publication years
Loading ...
Article types
Loading ...
  • Select all
  • RESEARCH ARTICLE
    Xiang FENG, Wanggen WAN, Richard Yi Da XU, Haoyu CHEN, Pengfei LI, J. Alfredo SÁNCHEZ
    Frontiers of Computer Science, 2018, 12(4): 798-812. https://doi.org/10.1007/s11704-017-6328-x

    In computer graphics, various processing operations are applied to 3D triangle meshes and these processes of ten involve distortions, which affect the visual quality of surface geometry. In this context, perceptual quality assessment of 3D triangle meshes has become a crucial issue. In this paper, we propose a new objective quality metric for assessing the visual difference between a reference mesh and a corresponding distorted mesh. Our analysis indicates that the overall quality of a distorted mesh is sensitive to the distortion distribution. The proposed metric is based on a spatial pooling strategy and statistical descriptors of the distortion distribution. We generate a perceptual distortion map for vertices in the reference mesh while taking into account the visual masking effect of the human visual system. The proposed metric extracts statistical descriptors from the distortion map as the feature vector to represent the overall mesh quality. With the feature vector as input, we adopt a support vector regression model to predict the mesh quality score.We validate the performance of our method with three publicly available databases, and the comparison with state-of-the-art metrics demonstrates the superiority of our method. Experimental results show that our proposed method achieves a high correlation between objective assessment and subjective scores.

  • RESEARCH ARTICLE
    Juan ZHANG, Fuqing DUAN, Mingquan ZHOU, Dongcan JIANG, Xuesong WANG, Zhongke WU, Youliang HUANG, Guoguang DU, Shaolong LIU, Pengbo ZHOU, Xiangang SHANG
    Frontiers of Computer Science, 2018, 12(4): 777-797. https://doi.org/10.1007/s11704-016-5511-9

    This paper presents a method for simulating surface crack patterns appearing in ceramic glaze, glass, wood and mud. It uses a physically and heuristically combined method to model this type of crack pattern. A stress field is defined heuristically over the triangle mesh of an object. Then, a first-order quasi-static cracking node method (CNM) is used to model deformation. A novel combined stress and energy combined crack criterion is employed to address crack initiation and propagation separately according to physics. Meanwhile, a highest-stress-first rule is applied in crack initiation, and a breadth-first rule is applied in crack propagation. Finally, a local stress relaxation step is employed to evolve the stress field and avoid shattering artifacts. Other related issues are also discussed, such as the elimination of quadrature sub-cells, the prevention of parallel cracks and spurious crack procession. Using this method, a variety of crack patterns observed in the real world can be reproduced by changing a set of parameters. Consequently, our method is robust because the computational mesh is independent of dynamic cracks and has no sliver elements. We evaluate the realism of our results by comparing them with photographs of realworld examples. Further, we demonstrate the controllability of our method by varying different parameters.

  • REVIEW ARTICLE
    Xueming WANG, Zechao LI, Jinhui TANG
    Frontiers of Computer Science, 2018, 12(3): 406-422. https://doi.org/10.1007/s11704-017-6377-1

    With the rapid increase in social websites that has dramatically increased the volume of social media, which includes the use of images and videos, visual understanding has attracted great interest in several areas such as multimedia, computer vision, and pattern recognition. Valuable auxiliary resources available on social websites, such as user-provided tags, aid in the tasks of visual understanding. Therefore, several methods have been proposed for exploring the auxiliary resources for tag refinement, image retrieval, and media summarization. This work conducts a comprehensive survey of recent advances in visual understanding by mining social media in order to discuss their merits and limitations. We then analyze the difficulties and challenges of visual understanding followed by several possible future research directions.

  • PERSPECTIVE
    Kun ZHOU
    Frontiers of Computer Science, 2017, 11(5): 743-745. https://doi.org/10.1007/s11704-017-7901-z
  • RESEARCH ARTICLE
    Zhenxue HE, Limin XIAO, Fei GU, Tongsheng XIA, Shubin SU, Zhisheng HUO, Rong ZHANG, Longbing ZHANG, Li RUAN, Xiang WANG
    Frontiers of Computer Science, 2017, 11(4): 728-742. https://doi.org/10.1007/s11704-016-5259-2

    Although the genetic algorithm has been widely used in the polarity optimization of mixed polarity Reed- Muller (MPRM) logic circuits, few studies have taken into account the polarity conversion sequence. In order to improve the efficiency of polarity optimization of MPRM logic circuits, we propose an efficient and fast polarity optimization approach (FPOA) considering the polarity conversion sequence. The main idea behind the FPOA is that, firstly, the best polarity conversion sequence of the polarity set waiting for evaluation is obtained by using the proposed hybrid genetic algorithm (HGA); secondly, each of polarity in the polarity set is converted according to the best polarity conversion sequence obtained by HGA. Our proposed FPOA is implemented in C and a comparative analysis has been presented for MCNC benchmark circuits. The experimental results show that for the circuits with more variables, the FPOA is highly effective in improving the efficiency of polarity optimization of MPRM logic circuits compared with the traditional polarity optimization approach which neglects the polarity conversion sequence and the improved polarity optimization approach with heuristic technique.

  • RESEARCH ARTICLE
    Sudipta ROY, Debnath BHATTACHARYYA, Samir Kumar BANDYOPADHYAY, Tai-Hoon KIM
    Frontiers of Computer Science, 2017, 11(4): 717-727. https://doi.org/10.1007/s11704-016-5129-y

    This paper propose a computerized method of magnetic resonance imaging (MRI) of brain binarization for the uses of preprocessing of features extraction and brain abnormality identification. One of the main problems of MRI binarization is that many pixels of brain part cannot be correctly binarized due to extensive black background or large variation in contrast between background and foreground of MRI. We have proposed a binarization that uses mean, variance, standard deviation and entropy to determine a threshold value followed by a non-gamut enhancement which can overcome the binarization problem of brain component. The proposed binarization technique is extensively tested with a variety of MRI and generates good binarization with improved accuracy and reduced error. A comparison is carried out among the obtained outcome with this innovative method with respect to other well-known methods.

  • RESEARCH ARTICLE
    Yang-Yen OU, Ta-Wen KUAN, Anand PAUL, Jhing-Fa WANG, An-Chao TSAI
    Frontiers of Computer Science, 2017, 11(3): 429-443. https://doi.org/10.1007/s11704-016-6190-2

    This work presents a spoken dialog summarization system with HAPPINESS/SUFFERING factor recognition. The semantic content is compressed and classified by factor categories from spoken dialog. The transcription of automatic speech recognition is then processed through Chinese Knowledge and Information Processing segmentation system. The proposed system also adopts the part-of-speech tags to effectively select and rank the keywords. Finally, the HAPPINESS/SUFFERING factor recognition is done by the proposed point-wise mutual information. Compared with the original method, the performance is improved by applying the significant scores of keywords. The experimental results show that the average precision rate for factor recognition in outside test can reach 73.5% which demonstrates the possibility and potential of the proposed system.

  • RESEARCH ARTICLE
    Yougen YUAN, Lei XIE, Zhong-Hua FU, Ming XU, Qi CONG
    Frontiers of Computer Science, 2017, 11(3): 419-428. https://doi.org/10.1007/s11704-016-6182-2

    3D audio effects can provide immersive auditory experience, but we often face the so-called in-head localization (IHL) problem in headphone sound reproduction. To address this problem, we propose an effective sound image externalization approach. Specifically, we consider several important factors related to sound propagation, which include image-source model based early reflections with distance decay, wall absorption and air absorption, late reverberation and other dynamic factors like head movement. We apply our sound image externalization approach to a headphone based real-time 3D audio system. Subjective listening tests show that the sound image externalization performance is significantly improved and the sound source direction is preserved as well. A/B preference test further shows that, as compared with a recent popular approach, the proposed approach is mostly preferred by the listeners.

  • RESEARCH ARTICLE
    Zhong-Hua FU
    Frontiers of Computer Science, 2017, 11(3): 408-418. https://doi.org/10.1007/s11704-016-6110-5

    Beamforming using sensor array is widely used in spatial signal processing since it offers better spatial focusing capability than single sensor. However, in practical applications for broadband signal, there always exists a trade-off issue between the directivity capability of an array and its robustness on system errors. In this paper, in order to combine merits of different beamformers instead of trade-off their performances, we propose a constrained minimum-power combination method. We firstly analyze two optimal beamformers that maximize Directivity Factor (DF) and White Noise Gain (WNG) respectively. Then we propose a non-linear combination method, which automatically selects the best beamformer that has the minimum output power, so as to control the unwanted white noise amplification and keep the maximum DF if possible. Two solutions to the proposed combination strategy are given. They do not need to determine the correct trade-off factor used in linear combination method, and avoid challenge estimations on noise and target statistics required in adaptive beamforming. The performance of the proposed beamformer is evaluated in ideal noise fields and complicated noise fields respectively. It is shown that the proposed beamformer integrates merits of different beamformers. It always achieves the best speech quality and biggest noise reduction compared to other popular beamformers.

  • RESEARCH ARTICLE
    Yue XIE,Ye YUAN,Xiang CHEN,Changxi ZHENG,Kun ZHOU
    Frontiers of Computer Science, 2017, 11(2): 332-346. https://doi.org/10.1007/s11704-016-5465-y

    In this paper we propose an optimization framework for interior carving of 3D fabricated shapes. Interior carving is an important technique widely used in industrial and artistic designs to achieve functional purposes by hollowing interior shapes in objects. We formulate such functional purpose as the objective function of an optimization problem whose solution indicates the optimal interior shape. In contrast to previous volumetric methods, we directly represent the boundary of the interior shape as a triangular mesh. We use Eulerian semiderivative to relate the time derivative of the object function to a virtual velocity field and iteratively evolve the interior shape guided by the velocity field with surface tracking. In each iteration, we compute the velocity field guaranteeing the decrease of objective function by solving a linear programming problem. We demonstrate this general framework in a novel application of designing objects floating in fluid and two previously investigated applications, and print various optimized objects to verify its effectiveness.

  • RESEARCH ARTICLE
    Lyes ABADA,Saliha AOUAT
    Frontiers of Computer Science, 2017, 11(2): 320-331. https://doi.org/10.1007/s11704-016-5255-6

    The number of constraints imposed on the surface, the light source, the camera model and in particular the initial information makes shape from shading (SFS) very difficult for real applications. There are a considerable number of approaches which require an initial data about the 3D object such as boundary conditions (BC). However, it is difficult to obtain these information for each point of the object Edge in the image, thus the application of these approaches is limited. This paper shows an improvement of the Global View method proposed by Zhu and Shi [1]. The main improvement is that we make the resolution done automatically without any additional information on the 3D object. The method involves four steps. The first step is to determine the singular curves and the relationship between them. In the second step, we generate the global graph, determine the sub-graphs, and determine the partial and global configuration. The proposed method to determine the convexity and the concavity of the singular curves is applied in the third step. Finally, we apply the Fast-Marching method to reconstruct the 3D object. Our approach is successfully tested on some synthetic and real images. Also, the obtained results are compared and discussed with some previous methods.

  • REVIEW ARTICLE
    Junhua LU,Wei CHEN,Yuxin MA,Junming KE,Zongzhuang LI,Fan ZHANG,Ross MACIEJEWSKI
    Frontiers of Computer Science, 2017, 11(2): 192-207. https://doi.org/10.1007/s11704-016-6028-y

    A wide variety of predictive analytics techniques have been developed in statistics, machine learning and data mining; however, many of these algorithms take a black-box approach in which data is input and future predictions are output with no insight into what goes on during the process. Unfortunately, such a closed system approach often leaves little room for injecting domain expertise and can result in frustration from analysts when results seem spurious or confusing. In order to allow for more human-centric approaches, the visualization community has begun developing methods to enable users to incorporate expert knowledge into the prediction process at all stages, including data cleaning, feature selection, model building and model validation. This paper surveys current progress and trends in predictive visual analytics, identifies the common framework in which predictive visual analytics systems operate, and develops a summarization of the predictive analytics workflow.

  • REVIEW ARTICLE
    Hao ZHU,Yongming NIE,Tao YUE,Xun CAO
    Frontiers of Computer Science, 2017, 11(2): 175-191. https://doi.org/10.1007/s11704-016-5520-8

    The prior knowledge is the significant supplement to image-based 3D modeling algorithms for refining the fragile consistency-based stereo. In this paper, we review the image-based 3D modeling problem according to prior categories, i.e., classical priors and specific priors. The classical priors including smoothness, silhouette and illumination are well studied for improving the accuracy and robustness of the 3D reconstruction. In recent years, various specific priors which take advantage of Manhattan rule, geometry template and trained category features have been proposed to enhance the modeling performance. The advantages and limitations of both kinds of priors are discussed and evaluated in the paper. Finally, we discuss the trend and challenges of the prior studies in the future.