Dataatlike
Dataatlike is a term used in data engineering and information management to describe a class of data objects or datasets that embed descriptive metadata and versioning within the data entity itself, enabling self-describing and evolvable data models. It is not a single standardized format but a family of approaches that integrate metadata with content to support schema evolution, provenance tracking, and governance across heterogeneous storage systems.
The term blends the notions of data, attributes, and adaptability, and is used to discuss strategies for
- Self-describing data objects that carry their own schema hints and metadata.
- Versioned records with lineage information to trace origins and transformations.
- Schema-on-read compatibility and flexible evolution that tolerate schema changes.
- Built-in access controls and governance hooks to support data security and compliance.
- Interoperability with data catalogs, metadata stores, and lineage tools.
Use cases and context: data lakes and lakehouse architectures, streaming platforms, feature stores for machine learning,
Limitations: they can introduce complexity, require specialized tooling, and may impact performance and interoperability with systems