Front. Comput. Sci. ›› DOI: 10.3868/fcs-video-037

BAFT: bubble-aware fault-tolerant framework for distributed DNN training with hybrid parallelism

  • {{article.zuoZheEn}}
Author information +
History +

Highlights

{{article.highlightEn}}

Abstract

{{article.abstractEn}}

Author summary

{{article.authorSummayEn}}

Keywords

Cite this video

Download citation ▾
{{article.zuoZheEn_L}}. {{article.titleEn}}. , https://doi.org/10.3868/fcs-video-037
Related videos

References

References

{{article.reference}}

RIGHTS & PERMISSIONS

{{article.copyright.year}} {{article.copyright.holder}}

Linked article:

BAFT: bubble-aware fault-tolerant framework for distributed DNN training with hybrid parallelism

Runzhe CHEN, Guandong LU, Yakai WANG, Rui ZHANG, Zheng HU, Yanming MIAO, Zhifang CAI, Jingwen LENG, Minyi GUO

Front. Comput. Sci.. 2025, Vol.19(1): 191102

Accesses

Citations

Recommended

/