A hybrid-model optimization algorithm based on the Gaussian process and particle swarm optimization for mixed-variable CNN hyperparameter automatic search

Han YAN, Chongquan ZHONG, Yuhu WU, Liyong ZHANG, Wei LU

PDF(4471 KB)
PDF(4471 KB)
Front. Inform. Technol. Electron. Eng ›› 2023, Vol. 24 ›› Issue (11) : 1557-1573. DOI: 10.1631/FITEE.2200515
Orginal Article
Orginal Article

A hybrid-model optimization algorithm based on the Gaussian process and particle swarm optimization for mixed-variable CNN hyperparameter automatic search

Author information +
History +

Abstract

Convolutional neural networks (CNNs) have been developed quickly in many real-world fields. However, CNN’s performance depends heavily on its hyperparameters, while finding suitable hyperparameters for CNNs working in application fields is challenging for three reasons: (1) the problem of mixed-variable encoding for different types of hyperparameters in CNNs, (2) expensive computational costs in evaluating candidate hyperparameter configuration, and (3) the problem of ensuring convergence rates and model performance during hyperparameter search. To overcome these problems and challenges, a hybrid-model optimization algorithm is proposed in this paper to search suitable hyperparameter configurations automatically based on the Gaussian process and particle swarm optimization (GPPSO) algorithm. First, a new encoding method is designed to efficiently deal with the CNN hyperparameter mixed-variable problem. Second, a hybrid-surrogate-assisted model is proposed to reduce the high cost of evaluating candidate hyperparameter configurations. Third, a novel activation function is suggested to improve the model performance and ensure the convergence rate. Intensive experiments are performed on image-classification benchmark datasets to demonstrate the superior performance of GPPSO over state-of-the-art methods. Moreover, a case study on metal fracture diagnosis is carried out to evaluate the GPPSO algorithm performance in practical applications. Experimental results demonstrate the effectiveness and efficiency of GPPSO, achieving accuracy of 95.26% and 76.36% only through 0.04 and 1.70 GPU days on the CIFAR-10 and CIFAR-100 datasets, respectively.

Keywords

Convolutional neural network / Gaussian process / Hybrid model / Hyperparameter optimization / Mixed-variable / Particle swarm optimization

Cite this article

Download citation ▾
Han YAN, Chongquan ZHONG, Yuhu WU, Liyong ZHANG, Wei LU. A hybrid-model optimization algorithm based on the Gaussian process and particle swarm optimization for mixed-variable CNN hyperparameter automatic search. Front. Inform. Technol. Electron. Eng, 2023, 24(11): 1557‒1573 https://doi.org/10.1631/FITEE.2200515

RIGHTS & PERMISSIONS

2023 Zhejiang University Press
PDF(4471 KB)

Accesses

Citations

Detail

Sections
Recommended

/