SUBstrayt
SUBstrayt is a modular open-source library designed to perform fast substring search and text retrieval over large collections. It aims to be embeddable in other software and to support multi-language text through Unicode, with an emphasis on predictable latency, scalable indexing, and a flexible query interface.
The architecture of SUBstrayt centers on an indexing layer that blends suffix-based structures—such as suffix arrays
Query processing in SUBstrayt supports exact substring matches, pattern-based queries with wildcards, and approximate matching using
Use cases for SUBstrayt span digital archives, code search, log analysis, large-scale literature collections, and even
In discussion and documentation, SUBstrayt is presented as a conceptual design for efficient substring search rather