Prompt learning in computer vision: a survey

Yiming LEI, Jingqi LI, Zilong LI, Yuan CAO, Hongming SHAN

PDF(26205 KB)
PDF(26205 KB)
Front. Inform. Technol. Electron. Eng ›› 2024, Vol. 25 ›› Issue (1) : 42-63. DOI: 10.1631/FITEE.2300389
Review

Prompt learning in computer vision: a survey

Author information +
History +

Abstract

Prompt learning has attracted broad attention in computer vision since the large pre-trained visionlanguage models (VLMs) exploded. Based on the close relationship between vision and language information built by VLM, prompt learning becomes a crucial technique in many important applications such as artificial intelligence generated content (AIGC). In this survey, we provide a progressive and comprehensive review of visual prompt learning as related to AIGC. We begin by introducing VLM, the foundation of visual prompt learning. Then, we review the vision prompt learning methods and prompt-guided generative models, and discuss how to improve the efficiency of adapting AIGC models to specific downstream tasks. Finally, we provide some promising research directions concerning prompt learning.

Keywords

Prompt learning / Visual prompt tuning (VPT) / Image generation / Image classification / Artificial intelligence generated content (AIGC)

Cite this article

Download citation ▾
Yiming LEI, Jingqi LI, Zilong LI, Yuan CAO, Hongming SHAN. Prompt learning in computer vision: a survey. Front. Inform. Technol. Electron. Eng, 2024, 25(1): 42‒63 https://doi.org/10.1631/FITEE.2300389

RIGHTS & PERMISSIONS

2024 Zhejiang University Press
PDF(26205 KB)

Accesses

Citations

Detail

Sections
Recommended

/