binary64 - Infinite Lexicon - Infinite Lexicon

binary64

Binary64 is the IEEE 754 64-bit double-precision floating-point format used to represent real numbers in binary. It encodes numbers with 64 bits: 1 sign bit, 11 exponent bits, and 52 fraction bits. The value is (-1)^sign × (1.fraction) × 2^(exponent − 1023) for normal numbers. If the exponent field is zero, the number is subnormal and has the form (-1)^sign × 0.fraction × 2^(−1022). When the exponent field is all ones (2047), the representation denotes either infinity (fraction zero) or NaN (fraction nonzero).

With 52 fraction bits plus the implicit leading 1, binary64 provides 53 bits of precision, roughly 15

Usage and implications: Binary64 is the default floating-point format in many languages, including C, C++, Java,

1.7976931348623157×10^308.

~2.2250738585072014×10^−308,

~4.9406564584124654×10^−324.

round-to-nearest,

a

a