textuellen
Textuellen is a term used in some circles of linguistics and digital humanities to denote the basic textual units used in analysis. It refers to the units that carry meaning within a text under a given analytic framework, ranging from individual words to larger segments such as phrases or clauses, depending on the granularity chosen by the researcher. The term is a neologism and not part of a universally adopted standard; its precise definition varies with methodology and domain.
In practice, defining textuellen involves setting a segmentation strategy and a criterion for semantic coherence. Tokenization,
Textuellen are used in corpus linguistics, authorship and stylistic analysis, information retrieval, and natural language processing.
Related concepts include tokens, lexemes, n-grams, discourse units, segmentation, corpus linguistics, and text analysis. The term