contentidentifiers

Content identifiers are values assigned to digital content that identify the item by its data rather than by a storage location. In content-addressable systems, the identifier often takes the form of a cryptographic hash of the content, sometimes with additional structure to describe how the hash was produced and what type of content is being identified. The key properties of content identifiers are immutability (changing the content yields a different identifier), data integrity (the identifier can be recomputed to verify content has not been altered), and facilitation of deduplication (identical content yields the same identifier across storage nodes).

A prominent example is the Content Identifier (CID) used in IPFS and related projects. A CID is

Advantages of content identifiers include improved data integrity verification, easier provenance tracking, and more efficient caching

Limitations involve the need for tooling to generate and resolve identifiers, potential overhead from hashing large

a

self-describing

representation.

a

offline-to-online

a