vektorutrymmesmodell
A vektorutrymmesmodell, often translated as vector space model, is a fundamental concept in information retrieval and natural language processing. It represents documents and queries as vectors in a high-dimensional space. Each dimension in this space corresponds to a unique term (word) from a predefined vocabulary. The value in each dimension for a given document vector indicates the importance or presence of that term within the document.
The process of creating these vectors typically involves tokenization, where text is broken down into individual
Once documents and queries are represented as vectors, their similarity can be calculated using mathematical measures,