Unitsoccurs
Unitsoccurs is a term used in the study of symbolic sequences to describe the presence of a predefined unit within a larger sequence. It is used in areas such as formal languages, text processing, bioinformatics, and pattern mining to formalize questions about whether and where a unit appears.
Formally, let A be a finite alphabet and let U be a set of units, where each
Example: with alphabet {a, b}, U = {"ab", "ba"} and s = "ababca", the unit "ab" occurs at positions
Applications include designing and analyzing text search algorithms, implementing regular-expression-like pattern matchers, scanning DNA or protein