Image enhancement in turbid water using multiscale weighted features and attention mechanisms
Hao Zhang , Wenqi Zhang , Hui Yuan , Shaozhou Bai , Yanbing Tian , Zonghua Liu
Intelligent Marine Technology and Systems ›› 2025, Vol. 3 ›› Issue (1) : 35
Image enhancement in turbid water using multiscale weighted features and attention mechanisms
In this study, we propose a method to improve the image quality in turbid water by combining multiscale weighted features and attention mechanisms, effectively addressing issues such as reduced contrast and color casts. Initially, global color correction was employed using the grayscale space algorithm. Then, a preprocessed dataset was input into an improved Shallow-UWnet network for feature learning. The model included a multiscale weighted feature fusion module to extract global and local features and a parallel attention module to enhance key details and suppress noise in images. To simulate natural turbid water, a dataset of underwater images with different turbidities was constructed. Our method effectively corrected the color of images in different turbid water conditions and significantly enhanced image quality. In the evaluation on the custom dataset, performance improved by 13%–25% across standard image quality metrics compared with the second-best method. Additionally, we conducted generalization performance tests on the EUVP Dark, UIEBD, and UFO-120 datasets, validating the excellent enhancement effect of our method across different types of images. These results highlight the promising potential of our method for applications in underwater detection, rescue, and marine research.
Turbid water / Underwater image enhancement / Improved Shallow-UWnet network / Multiscale weighted features / Attention mechanism
| [1] |
|
| [2] |
Drews P, do Nascimento E, Moraes F, Botelho S, Campos M (2013) Transmission estimation in underwater single images. In: Proceedings of the IEEE International Conference on Computer Vision Workshops. IEEE, pp 825–830. https://doi.org/10.1109/ICCVW.2013.113 |
| [3] |
Duarte A, Codevilla F, Gaya JD, Botelho SSC (2016) A dataset to evaluate underwater image restoration methods. In: OCEANS 2016–Shanghai. IEEE, pp 1–6. https://doi.org/10.1109/OCEANSAP.2016.7485524 |
| [4] |
|
| [5] |
Islam MJ, Luo PG, Sattar J (2020a) Simultaneous enhancement and super-resolution of underwater imagery for improved visual perception. Preprint at arXiv:2002.01155 |
| [6] |
|
| [7] |
Jiang JX, Ye T, Bai JB, Chen SX, Chai WH, Jun S et al (2023) Five A+ Network: you only need 9k parameters for underwater image enhancement. Preprint at arXiv:2305.08824 |
| [8] |
Johnson J, Alahi A, Li FF (2016) Perceptual losses for real-time style transfer and super-resolution. In: Leibe B et al (eds) Computer Vision–ECCV 2016. Lecture notes in computer science, vol 9906. Springer, Cham, pp 694–711. https://doi.org/10.1007/978-3-319-46475-6_43 |
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
Shen X, Zhao XN, Luo R (2023) Semantic embedded deep neural network: a generic approach to boost multi-label image classification performance. Preprint at arXiv:2305.05228 |
| [20] |
Song W, Wang Y, Huang DM, Tjondronegoro D (2018) A rapid scene depth estimation model based on underwater light attenuation prior for underwater image restoration. In: Hong R et al (eds) Advances in Multimedia Information Processing–PCM 2018. Lecture notes in computer science, vol 11164. Springer, Cham, pp 678–688. https://doi.org/10.1007/978-3-030-00776-8_62 |
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
Wang QL, Wu BG, Zhu PF, Li PH, Zuo WM, Hu QH (2020) ECA-Net: efficient channel attention for deep convolutional neural networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, pp 11531–11539. https://doi.org/10.1109/CVPR42600.2020.01155 |
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
Zhang XH, Zou YX, Shi W (2017) Dilated convolution neural network with LeakyReLU for environmental sound classification. In: 2017 22nd International Conference on Digital Signal Processing (DSP). IEEE, pp 1–5. https://doi.org/10.1109/ICDSP.2017.8096153 |
| [30] |
Zhao HY, Kong XT, He JW, Qiao Y, Dong C (2020) Efficient image super-resolution using pixel attention. In: Bartoli A et al (eds) Computer Vision–ECCV 2020 Workshops. Lecture notes in computer science, vol 12537. Springer, Cham, pp 56–72. https://doi.org/10.1007/978-3-030-67070-2_3 |
| [31] |
|
| [32] |
Zhu JY, Park T, Isola P, Efros AA (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. In: 2017 IEEE International Conference on Computer Vision (ICCV). IEEE, pp 2242–2251. https://doi.org/10.1109/ICCV.2017.244 |
The Author(s)
/
| 〈 |
|
〉 |