termdocument
Termdocument is a concept used in information retrieval and text analytics to describe the association between a term and a specific document. In practice, a termdocument refers to a record that captures the occurrence or presence of a term within a document and it forms part of larger data structures such as a term-document matrix or an inverted index.
A typical termdocument entry includes fields such as the term itself, the document identifier, and a measure
Inverted indexes use a collection of termdocuments to map each term to the documents in which it
Applications include search engines, document clustering, topic modeling, and keyword extraction. A well-designed termdocument representation supports
See also: term frequency, inverse document frequency, TF-IDF, inverted index, document-term matrix.