ordhyppighed
Ordhyppighed, in linguistics, refers to how often words occur within a text or corpus. It can be expressed as absolute frequency (the count) or relative frequency (the proportion or per-million words). Frequency can be measured for tokens (each actual word form) or types (distinct word forms). A frequency list orders words from most to least frequent and is a common resource in lexicography, language teaching, and natural language processing.
Frequency data come from corpora—text collections intended to represent a language or domain. Results depend on
Word-frequency distributions typically follow Zipf's law: a small set of function words accounts for a large
Applications include lexicography (prioritizing entries), language teaching (focusing on high-frequency vocabulary), and natural language processing tasks
Limitations include sampling bias, domain differences, and diachronic change. Frequency lists describe a corpus, not a