Accelerating BERT inference with GPU-efficient exit prediction

{{article.zuoZheEn}}

PDF(14533 KB)
PDF(14533 KB)
Front. Comput. Sci. ›› 2024, Vol. 18 ›› Issue (3) : 183308. DOI: 10.1007/s11704-022-2341-9
RESEARCH ARTICLE

Accelerating BERT inference with GPU-efficient exit prediction

  • {{article.zuoZheEn}}
Author information +
History +

Highlights

{{article.highlightEn}}

Abstract

{{article.abstractEn}}

Author summary

{{article.authorSummayEn}}

Graphical abstract

Keywords

Cite this article

Download citation ▾
{{article.zuoZheEn_L}}. {{article.titleEn}}. Front. Comput. Sci., 2024, 18(3): 183308 https://doi.org/10.1007/s11704-022-2341-9

References

References

{{article.reference}}

RIGHTS & PERMISSIONS

{{article.copyright.year}} {{article.copyright.holder}}
PDF(14533 KB)

Accesses

Citations

Detail

Sections
Recommended

/