Data Format

FP32

Single-precision floating-point format (IEEE 754)

Float_example.svg

FP64

Double-precision floating-point format (IEEE754)

IEEE_754_Double_Floating_Point_Format.svg

FP16

1 bit Sign + 8 bit Exponent + 8 bit Fraction (7 bit explicitly stored)

BF16

Google Brain

commonly used in machine learning

Untitled