dataDNA
dataDNA is a conceptual framework used in information science to describe a DNA-inspired encoding of digital data. The idea treats a data object as a sequence of symbols that combines the payload with metadata and provenance information in a single, sequence-structured representation.
In dataDNA, data payload and metadata are interleaved or co-encoded using a compact alphabet (for example, four-symbol
A typical dataDNA model comprises a data payload segment, a metadata layer with version history, timestamps,
Potential benefits include improved reproducibility, deeper provenance, compact archiving, and built-in integrity guarantees. It can facilitate
However, dataDNA faces challenges such as encoding overhead, lack of standardized formats, performance trade-offs, and privacy
Origin and status: introduced in theoretical discussions and some experimental studies in data management and storage
See also: DNA data storage, data provenance, metadata standards, content-addressable storage.