Removing nonrigid refractive distortions for underwater images using an attention-based deep neural network
Tengyue Li , Jiayi Song , Zhiyu Song , Arapat Ablimit , Long Chen
Intelligent Marine Technology and Systems ›› 2024, Vol. 2 ›› Issue (1)
Removing nonrigid refractive distortions for underwater images using an attention-based deep neural network
Refractive distortions in underwater images usually occur when these images are captured through a dynamic refractive water surface, such as unmanned aerial vehicles capturing shallow underwater scenes from the surface of water or autonomous underwater vehicles observing floating platforms in the air. We propose an end-to-end deep neural network for learning to restore real scene images for removing refractive distortions. This network adopts an encoder-decoder architecture with a specially designed attention module. The use of the attention image and the distortion field generated by the proposed deep neural network can restore the exact distorted areas in more detail. Qualitative and quantitative experimental results show that the proposed framework effectively eliminates refractive distortions and refines image details. We also test the proposed framework in practical applications by embedding it into an NVIDIA JETSON TX2 platform, and the results demonstrate the practical value of the proposed framework.
Underwater image / Refractive distortion / Deep neural network / Attention mechanism
| [1] |
|
| [2] |
|
| [3] |
De Greve B (2006) Reflections and refractions in ray tracing. pp 1–6. Retrieved from https://graphics.stanford.edu/courses/cs148-10-summer/docs/2006--degreve--reflection_refraction.pdf |
| [4] |
|
| [5] |
Donate A, Dahme G, Ribeiro E (2006) Classification of textures distorted by waterwaves. In: 18th International Conference on Pattern Recognition (ICPR’06), Hong Kong, pp 421–424. https://doi.org/10.1109/ICPR.2006.371 |
| [6] |
Donate A, Ribeiro E (2007) Improved reconstruction of images distorted by water waves. In: Braz J et al (eds) Advances in computer graphics and computer vision. Springer, Berlin, pp 264–277. https://doi.org/10.1007/978-3-540-75274-5_18 |
| [7] |
Efros A, Isler V, Shi J, Visontai M (2004) Seeing through water. In: NIPS’04: Proceedings of the 17th International Conference on Neural Information Processing Systems, Vancouver, pp 393–400. https://proceedings.neurips.cc/paper_files/paper/2004 |
| [8] |
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S et al (2014) Generative adversarial nets. In: NIPS’14: Proceedings of the 27th International Conference on Neural Information Processing Systems, Montreal, pp 2672–2680. https://proceedings.neurips.cc/paper_files/paper/2014/file/5ca3e9b122f61f8f06494c97b1afccf3-Paper.pdf |
| [9] |
Isola P, Zhu JY, Zhou TH, Efros AA (2017) Image-to-image translation with conditional adversarial networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, pp 5967–5976. https://doi.org/10.1109/CVPR.2017.632 |
| [10] |
James JG, Agrawal P, Rajwade A (2019) Restoration of non-rigidly distorted underwater images using a combination of compressive sensing and local polynomial image representations. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, pp 7839–7848. https://doi.org/10.1109/ICCV.2019.00793 |
| [11] |
|
| [12] |
Ledig C, Theis L, Huszár F, Caballero J, Cunningham A, Acosta A et al (2017) Photo-realistic single image super-resolution using a generative adversarial network. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, pp 105–114. https://doi.org/10.1109/CVPR.2017.19 |
| [13] |
|
| [14] |
Li C, Wand M (2016) Precomputed real-time texture synthesis with Markovian generative adversarial networks. In: 14th European Conference on Computer Vision (ECCV), Amsterdam, pp 702–716. https://doi.org/10.1007/978-3-319-46487-9_43 |
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
Li ZQ, Murez Z, Kriegman D, Ramamoorthi R, Chandraker M (2018) Learning to see through turbulent water. In: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Tahoe, pp 512–520. https://doi.org/10.1109/WACV.2018.00062 |
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
Thapa S, Li N, Ye J (2020) Dynamic fluid surface reconstruction using deep neural network. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, pp 21–30. https://doi.org/10.1109/CVPR42600.2020.00010 |
| [26] |
Thapa S, Li N, Ye J (2021) Learning to remove refractive distortions from underwater images. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, pp 5007–5016. https://doi.org/10.1109/ICCV48922.2021.00496 |
| [27] |
|
| [28] |
|
| [29] |
Wang XT, Yu K, Wu SX, Gu JJ, Liu YH, Dong C et al (2018) ESRGAN: enhanced super-resolution generative adversarial networks. In: 15th European Conference on Computer Vision (ECCV), Munich, pp 63–79. https://doi.org/10.1007/978-3-030-11021-5_5 |
| [30] |
|
| [31] |
Wen ZY, Fraser D, Lambert A, Li HD (2007) Reconstruction of underwater image by bispectrum. In: IEEE International Conference on Image Processing, San Antonio, pp 1–4. https://doi.org/10.1109/ICIP.2007.4379367 |
| [32] |
|
| [33] |
Xiong J, Heidrich W (2021) In-the-wild single camera 3D reconstruction through moving water surfaces. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, pp 12558–12567. https://doi.org/10.1109/ICCV48922.2021.01233 |
| [34] |
|
| [35] |
|
| [36] |
|
| [37] |
|
| [38] |
|
/
| 〈 |
|
〉 |