TFRecord

TFRecord is a binary file format used by TensorFlow to store a sequence of records. Each record is a serialized protocol buffer message, usually a tf.train.Example. An Example contains a Features map that associates feature names with lists of values. The value types supported are int64_list, float_list, and bytes_list, allowing a compact representation of heterogeneous data such as images, labels, and metadata.

TFRecord files are designed for efficient storage and streaming of large datasets. Being binary, they are more

Common workflows involve converting raw data to tf.train.Example messages and writing them with TFRecordWriter, then reading

Advantages of TFRecord include a compact binary representation and compatibility with efficient data pipelines on large-scale

space-efficient

tf.data.TFRecordDataset,

tf.io.parse_single_example

tf.io.parse_example.

a

mapping/parsing

a

self-description