Identsets
Identsets are a concept used in data management and information systems to designate sets of attributes that can be used to uniquely identify a record in a dataset. An identset is a subset P of attributes such that the projection of the dataset onto P is injective; in other words, no two distinct records share the same values on P. This formalizes the idea of identifying a row by a combination of fields.
Minimal identsets, where no proper subset is itself an identset, are often called identset keys. These correspond
Examples illustrate the idea. Consider a table with attributes (ssn, email, name, dob). The identset {ssn} typically
Applications and properties. Identsets are used in identity resolution, deduplication, and schema design to determine which
Relation to related concepts. Identsets are closely related to primary keys, superkeys, and functional dependencies in