Sanastotietoja
Sanastotietoja is a Finnish compound term that can be translated as "vocabulary data" or "lexical information." It refers to structured collections of words, terms and related metadata used for language description, terminology management, lexicography and language technology. Sanastotietoja may appear as glossaries, termbases, lexical databases or entries in corpora and dictionaries.
Typical contents include headwords, part of speech, morphological forms, definitions, translations, usage examples, domain labels and
Sources and methods for creating sanastotietoja range from manual expert curation and crowd‑sourcing to automated extraction
Applications include machine translation, information retrieval, terminology work in specialized domains, language learning tools and linguistic