skannatussa
Skannatussa is a term used in digital archiving to describe the state and outputs of mass digitization efforts. It encompasses the scanned images, the optical character recognition (OCR) text, and the metadata that accompany digital surrogates in a repository. The concept emphasizes image fidelity, text searchability, and long-term preservation readiness.
Typical workflows produce digital objects that combine high-resolution image files (often TIFF or JPEG 2000), OCR-derived
Skannatussa is widely applied in libraries, archives, and museums to improve access to historical and fragile
Etymology and usage: In Finnish-language documentation, skannatussa is a participial form derived from skannata (to scan)
Challenges include OCR accuracy for older or non-Latin scripts, multilingual content, and handwriting. Costs of high-quality
See also: Digital preservation, Digitization, Optical character recognition, Metadata standards, PDF/A, TIFF.