digrams
A digram, also called a bigram in many applications, is a pair of adjacent elements in a sequence. Digrams can refer to any two consecutive units, such as letters, phonemes, or words. In statistical text analysis, digrams are the second-order units used to study the structure and patterns of a language.
In language analysis, two common forms are letter-level digrams and word-level digrams. Letter-level digrams are pairs
Applications of digrams span several fields. In natural language processing, digrams are foundational to n-gram models
Computation typically involves sliding a two-element window across the text, tallying occurrences of each digram, and
Limitations include data sparsity for longer or less common digrams, sensitivity to preprocessing choices, and dependence