Accelerating BERT inference with GPU-efficient exit prediction
{{custom_author.name}}, {{article.zuoZheEn}}
Accelerating BERT inference with GPU-efficient exit prediction
[{{custom_ref.label}}] |
{{custom_citation.content}}
|
/
〈 | 〉 |