Oct 2020, Volume 21 Issue 10
    

  • Select all
  • Perspective
    Jun WANG, Rong LI, Jian WANG, Yi-qun GE, Qi-fan ZHANG, Wu-xian SHI
    2020, 21(10): 1413-1425. https://doi.org/10.1631/FITEE.1900527

    The applications of artificial intelligence (AI) and machine learning (ML) technologies in wireless communications have drawn significant attention recently. AI has demonstrated real success in speech understanding, image identification, and natural language processing domains, thus exhibiting its great potential in solving problems that cannot be easily modeled. AI techniques have become an enabler in wireless communications to fulfill the increasing and diverse requirements across a large range of application scenarios. In this paper, we elaborate on several typical wireless scenarios, such as channel modeling, channel decoding and signal detection, and channel coding design, in which AI plays an important role in wireless communications. Then, AI and information theory are discussed from the viewpoint of the information bottleneck. Finally, we discuss some ideas about how AI techniques can be deeply integrated with wireless communication systems.

  • Orginal Article
    Jing WANG, Wei-wei LIANG, Yue-hua NIU, Lan GAO, Wei-gong ZHANG
    2020, 21(10): 1426-1441. https://doi.org/10.1631/FITEE.2000089

    The demise of Dennard’s scaling has created both power and utilization wall challenges for computer systems. As transistors operating in the near-threshold region are able to obtain flexible trade-offs between power and performance, it is regarded as an alternative solution to the scaling challenge. A reduction in supply voltage will nevertheless generate significant reliability challenges, while maintaining an error-free system that generates high costs in both performance and energy consumption. The main purpose of research on computer architecture has therefore shifted from performance improvement to complex multi-objective optimization. In this paper, we propose a three-dimensional optimization approach which can effectively identify the best system configuration to establish a balance among performance, energy, and reliability. We use a dynamic programming algorithm to determine the proper voltage and approximate level based on three predictors: system performance, energy consumption, and output quality. We propose an output quality predictor which uses a hardware/software co-design fault injection platform to evaluate the impact of the error on output quality under near-threshold computing (NTC). Evaluation results demonstrate that our approach can lead to a 28% improvement in output quality with a 10% drop in overall energy efficiency; this translates to an approx mately 20% average improvement in accuracy, power, and performance.

  • Orginal Article
    Zhuo-hao LIU, Chang-yu DIAO, Wei XING, Dong-ming LU
    2020, 21(10): 1442-1454. https://doi.org/10.1631/FITEE.1900451

    Generally, the distributed bundle adjustment (DBA) method uses multiple worker nodes to solve the bundle adjustment problems and overcomes the computation and memory storage limitations of a single computer. However, the performance considerably degrades owing to the overhead introduced by the additional block partitioning step and synchronous waiting. Therefore, we propose a low-overhead consensus framework. A partial barrier based asynchronous method is proposed to early achieve consensus with respect to the faster worker nodes to avoid waiting for the slower ones. A scene summarization procedure is designed and integrated into the block partitioning step to ensure that clustering can be performed on the small summarized scene. Experiments conducted on public datasets show that our method can improve the worker node utilization rate and reduce the block partitioning time. Also, sample applications are demonstrated using our large-scale culture heritage datasets.

  • Orginal Article
    Wei-tao YOU, Hao JIANG, Zhi-yuan YANG, Chang-yuan YANG, Ling-yun SUN
    2020, 21(10): 1455-1466. https://doi.org/10.1631/FITEE.1900367

    Images are widely used by companies to advertise their products and promote awareness of their brands. The automatic synthesis of advertising images is challenging because the advertising message must be clearly conveyed while complying with the style required for the product, brand, or target audience. In this study, we proposed a data-driven method to capture individual design attributes and the relationships between elements in advertising images with the aim of automatically synthesizing the input of elements into an advertising image according to a specified style. To achieve this multi-format advertisement design, we created a dataset containing 13 280 advertising images with rich annotations that encompassed the outlines and colors of the elements, in addition to the classes and goals of the advertisements. Using our probabilistic models, users guided the style of synthesized advertisements via additional constraints (e.g., context-based keywords). We applied our method to a variety of design tasks, and the results were evaluated in several perceptual studies, which showed that our method improved users’ satisfaction by 7.1% compared to designs generated by nonprofessional students, and that more users preferred the coloring results of our designs to those generated by the color harmony model and Colormind.

  • Orginal Article
    Shui-wang LI, Qian-bo JIANG, Qi-jun ZHAO, Li LU, Zi-liang FENG
    2020, 21(10): 1467-1484. https://doi.org/10.1631/FITEE.1900507

    Discriminative correlation filters (DCF) are efficient in visual tracking and have advanced the field significantly. However, the symmetry of correlation (or convolution) operator results in computational problems and does harm to the generalized translation equivariance. The former problem has been approached in many ways, whereas the latter one has not been well recognized. In this paper, we analyze the problems with the symmetry of circular convolution and propose an asymmetric one, which as a generalization of the former has a weak generalized translation equivariance property. With this operator, we propose a tracker called the asymmetric discriminative correlation filter (ADCF), which is more sensitive to translations of targets. Its asymmetry allows the filter and the samples to have different sizes. This flexibility makes the computational complexity of ADCF more controllable in the sense that the number of filter parameters will not grow with the sample size. Moreover, the normal matrix of ADCF is a block matrix with each block being a two-level block Toeplitz matrix. With this well-structured normal matrix, we design an algorithm for multiplying an N × N two-level block Toeplitz matrix by a vector with time complexity O(NlogN) and space complexity O(N), instead of O(N2). Unlike DCF-based trackers, introducing spatial or temporal regularization does not increase the essential computational complexity of ADCF. Comparative experiments are performed on a synthetic dataset and four benchmarks, including OTB-2013, OTB-2015, VOT-2016, and Temple-Color, and the results show that our method achieves state-of-the-art visual tracking performance.

  • Orginal Article
    Li XU, Guo HUANG, Qing-li CHEN, Hong-yin QIN, Tao MEN, Yi-fei PU
    2020, 21(10): 1485-1493. https://doi.org/10.1631/FITEE.1900727

    Given that the existing image denoising methods damage the texture details of an image, a new method based on fractional integration is proposed. First, the fractional-order integral formula is deduced by generalizing the Cauchy integral, and then the approximate value of the fractional-order integral operator is estimated by a numerical method. Finally, a fractional-order integral mask operator of any order is constructed in eight pixel directions of the image. Simulation results show that the proposed image denoising method can protect the edge texture information of the image while removing the noise. Moreover, this method can obtain higher image feature values and better image vision after denoising than the existing denoising methods, because a texture protection mechanism is adopted during the iterative processing.

  • Orginal Article
    Rui ZHOU, Yu FENG, Bin DI, Jiang ZHAO, Yan HU
    2020, 21(10): 1494-1503. https://doi.org/10.1631/FITEE.1900617

    We investigate cooperative target tracking of multiple unmanned aerial vehicles (UAVs) with a limited communication range. This is an integration of UAV motion control, target state estimation, and network topology control. We first present the communication topology and basic notations for network connectivity, and introduce the distributed Kalman consensus filter. Then, convergence and boundedness of the estimation errors using the filter are analyzed, and potential functions are proposed for communication link maintenance and collision avoidance. By taking stable target tracking into account, a distributed potential function based UAV motion controller is discussed. Since only the estimation of the target state rather than the state itself is available for UAV motion control and UAV motion can also affect the accuracy of state estimation, it is clear that the UAV motion control and target state estimation are coupled. Finally, the stability and convergence properties of the coupled system under bounded noise are analyzed in detail and demonstrated by simulations.

  • Orginal Article
    Guan-qing LI, Zhi-yong SONG, Qiang FU
    2020, 21(10): 1504-1520. https://doi.org/10.1631/FITEE.1900523

    Current methods for radar target detection usually work on the basis of high signal-to-clutter ratios. In this paper we propose a novel convolutional neural network based dual-activated clutter suppression algorithm, to solve the problem caused by low signal-to-clutter ratios in actual situations on the sea surface. Dual activation has two steps. First, we multiply the activated weights of the last dense layer with the activated feature maps from the upsample layer. Through this, we can obtain the class activation maps (CAMs), which correspond to the positive region of the sea clutter. Second, we obtain the suppression coefficients by mapping the CAM inversely to the sea clutter spectrum. Then, we obtain the activated range-Doppler maps by multiplying the coefficients with the raw range-Doppler maps. In addition, we propose a sampling-based data augmentation method and an effective multiclass coding method to improve the prediction accuracy. Measurement on real datasets verified the effectiveness of the proposed method.

  • Orginal Article
    Cong-ying CAI, Xiao-lan YAO
    2020, 21(10): 1521-1534. https://doi.org/10.1631/FITEE.1900586

    The super giant slalom (Super-G) is a speed event in alpine skiing, in which the skier trajectory has a significant influence on the athletes’ performances. It is a challenging task to determine an optimal trajectory for the skiers along the entire course because of the complexity and difficulty in the convergence of the optimization model. In this study, a trajectory optimization model for alpine skiers competing in the Super-G is established based on the optimal control theory, in which the objective is to minimize the runtime between the starting point and the finish line. The original trajectory optimization problem is converted into a multi-phase nonlinear optimal control problem solved with a pseudospectral method, and the trajectory parameters are optimized to discover the time-optimal trajectory. Using numerical solution carried out by the MATLAB optimization toolbox, the optimal trajectory is obtained under several equality and inequality constraints. Simulation results reveal the effectiveness and rationality of the trajectory optimization model. A test is carried out to show that our code works properly. In addition, several practical proposals are provided to help alpine skiers improve their training and skiing performance.