Provformat
Provformat is a term used to describe formats for encoding provenance information, the metadata that describes the origins, history, and transformations of data products, computations, and experiments. It is not a single standardized specification but a family of serialization approaches intended to be interoperable with established provenance models, particularly the W3C PROV data model. Provformat aims to provide representations that are readable by humans and processable by machines, supporting storage in archives and exchange between systems.
Core concepts commonly captured by Provformat include entities (data items, files, results), activities (steps in a
Implementation varies by community; some Provformat variants use line-based, more compact syntax, while others adopt JSON
Applications of Provformat appear in scientific data management, workflow systems, data journals, repositories, and reproducibility initiatives.
See also: PROV data model, PROV-O, PROV-N, data provenance, reproducibility.