Training large-scale language models with limited GPU memory: a survey

Yu TANG , Linbo QIAO , Lujia YIN , Peng LIANG , Ao SHEN , Zhilin YANG , Lizhi ZHANG , Dongsheng LI

Front. Inform. Technol. Electron. Eng ›› 2025, Vol. 26 ›› Issue (3) : 309 -331.

PDF (623KB)
Front. Inform. Technol. Electron. Eng ›› 2025, Vol. 26 ›› Issue (3) : 309 -331. DOI: 10.1631/FITEE.2300710
Review

Training large-scale language models with limited GPU memory: a survey

Author information +
History +
PDF (623KB)

Abstract

Large-scale models have gained significant attention in a wide range of fields, such as computer vision and natural language processing, due to their effectiveness across various applications. However, a notable hurdle in training these large-scale models is the limited memory capacity of graphics processing units (GPUs). In this paper, we present a comprehensive survey focused on training large-scale models with limited GPU memory. The exploration commences by scrutinizing the factors that contribute to the consumption of GPU memory during the training process, namely model parameters, model states, and model activations. Following this analysis, we present an in-depth overview of the relevant research work that addresses these aspects individually. Finally, the paper concludes by presenting an outlook on the future of memory optimization in training large-scale language models, emphasizing the necessity for continued research and innovation in this area. This survey serves as a valuable resource for researchers and practitioners keen on comprehending the challenges and advancements in training large-scale language models with limited GPU memory.

Keywords

Training techniques / Memory optimization / Model parameters / Model states / Model activations

Cite this article

Download citation ▾
Yu TANG, Linbo QIAO, Lujia YIN, Peng LIANG, Ao SHEN, Zhilin YANG, Lizhi ZHANG, Dongsheng LI. Training large-scale language models with limited GPU memory: a survey. Front. Inform. Technol. Electron. Eng, 2025, 26(3): 309-331 DOI:10.1631/FITEE.2300710

登录浏览全文

4963

注册一个新账户 忘记密码

References

RIGHTS & PERMISSIONS

Zhejiang University Press

AI Summary AI Mindmap
PDF (623KB)

Supplementary files

FITEE-0309-24001-YT_suppl_1

FITEE-0309-24001-YT_suppl_2

230

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/