Enhanced Panoramic Image Generation with GAN and CLIP Models

Journal of Beijing Institute of Technology ›› 2025, Vol. 34 ›› Issue (1) : 91 -101.

PDF (7157KB)
Journal of Beijing Institute of Technology ›› 2025, Vol. 34 ›› Issue (1) : 91 -101. DOI: 10.15918/j.jbit1004-0579.2024.091

Enhanced Panoramic Image Generation with GAN and CLIP Models

Author information +
History +
PDF (7157KB)

Abstract

Panoramic images, offering a 360-degree view, are essential in virtual reality (VR) and augmented reality (AR), enhancing realism with high-quality textures. However, acquiring complete and high-quality panoramic textures is challenging. This paper introduces a method using generative adversarial networks (GANs) and the contrastive language-image pretraining (CLIP) model to restore and control texture in panoramic images. The GAN model captures complex structures and maintains consistency, while CLIP enables fine-grained texture control via semantic text-image associations. GAN inversion optimizes latent codes for precise texture details. The resulting low dynamic range (LDR) images are converted to high dynamic range (HDR) using the Blender engine for seamless texture blending. Experimental results demonstrate the effectiveness and flexibility of this method in panoramic texture restoration and generation.

Keywords

panoramic images / environment texture / generative adversarial networks (GANs) / contrastive language-image pretraining(CLIP) model / blender engine / fine-grained control / texture generation

Cite this article

Download citation ▾
null. Enhanced Panoramic Image Generation with GAN and CLIP Models. Journal of Beijing Institute of Technology, 2025, 34(1): 91-101 DOI:10.15918/j.jbit1004-0579.2024.091

登录浏览全文

4963

注册一个新账户 忘记密码

References

AI Summary AI Mindmap
PDF (7157KB)

326

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/