Striking the mantissa: how few bits are enough for accurate DNN inference?
Zhiyuan ZHANG , Ping ZHANG , Zhihua FAN , Wenming LI , Xiaochun YE , Xuejun AN
Front. Comput. Sci. ›› 2027, Vol. 21 ›› Issue (5) : 2105105
| [1] |
|
| [2] |
|
| [3] |
Burgess N, Milanovic J, Stephens N, Monachopoulos K, Mansell D. Bfloat16 processing for neural networks. In: Proceedings of the 26th IEEE Symposium on Computer Arithmetic (ARITH). 2019, 88–91 |
| [4] |
NVIDIA Corporation. NVIDIA Hopper architecture. see nvidia.com/en-us/data-center/technologies/hopper-architecture/ website |
Higher Education Press
/
| 〈 |
|
〉 |