worldcount
Worldcount is a term used to describe the total number of words in a body of text or across a collection of texts. In writing and publishing, it is used as a size metric to gauge length and manage production. In computational linguistics and text analytics, worldcount can refer to the aggregate word count of a corpus, as distinct from the per-document word counts.
Counting methods vary. A simple approach tokenizes on whitespace, counting each sequence of characters separated by
Applications include editing, where target word counts guide revisions; estimation of reading time and cost; natural
Limitations: no universal standard defines worldcount. Results depend on tokenization, language, and included categories (e.g., numbers,
See also: word count, tokenization, corpus linguistics, text analytics, readability metrics.