dataversioner
Dataversioner is a general concept describing systems, practices, and tooling that track changes to data over time. It encompasses methods for assigning version identifiers to datasets and data files, storing historical snapshots, and recording metadata such as provenance, schema, and checksums. The primary aims are to enable reproducibility, auditability, and governance in data-intensive workflows.
Core concepts include immutability of stored versions, snapshotting or time-travel, and the ability to branch and
Common features involve versioned datasets and files, diffing to identify changes, lineage tracking, and integration with
Applications for dataversioning span data science and machine learning experiments, data governance and regulatory compliance, reproducible