Thanfrequency
Thanfrequency is a quantitative concept used in corpus linguistics to describe the frequency and distribution of the word than within a text or collection of texts, with particular emphasis on comparative constructions. It is typically reported as occurrences per million words or as a relative frequency within a subcorpus, enabling comparisons across genres, time periods, or languages.
Calculation of thanfrequency involves tokenizing the corpus and identifying instances of than. This often requires part-of-speech
Applications of thanfrequency include tracking shifts in the use of explicit comparison in written and spoken
Limitations should be noted: accurate measurement depends on reliable disambiguation of than, which can be challenging