corpuswide

Corpuswide is a term used in corpus linguistics and related fields to describe analyses, statistics, or models that are computed for an entire text corpus rather than for individual documents or smaller subsets. A corpuswide approach seeks to describe the corpus as a whole, capturing aggregate patterns that hold across its included texts, genres, or time periods.

Common corpuswide analyses include computing overall word frequency distributions, type-token ratios across the complete corpus, dispersion

Applications of corpuswide analysis span dictionary development and lexicography (estimating word frequencies and lexical coverage), vocabulary

Limitations include dependence on the representativeness and quality of the underlying corpus. Corpuswide results can obscure

characteristics

Methodologically,

a

interpretation.