ididx
Ididx is a cross-language indexing framework designed to store, normalize, and query identifiers across large datasets. It focuses on performance, scalability, and robust deduplication, making it suitable for data integration, catalog management, and identity resolution tasks.
Overview of architecture: a modular core consisting of an index engine, a normalization layer, and a pluggable
Key features: normalization and canonicalization of identifiers; exact and range queries; support for composite keys; incremental
Use cases: identity matching across systems, deduplication of customer records, linkage of product identifiers, data governance
History and licensing: ididx was first released in 2020 by the ididx project and has since evolved