Rådata
Rådata, short for rådata in Swedish, refers to data in its original, unprocessed form as collected from a source. Also called raw data, it has not been cleaned, transformed, or aggregated, and it often contains errors, missing values, duplicates, or inconsistencies. Rådata serves as the baseline for subsequent analysis and processing.
Sources of rådata include measurement instruments and sensors, logs from information systems, transaction records, surveys, and
The role of rådata in the data lifecycle is to provide an immutable reference point for analyses.
Formats and storage for raw data vary widely and include CSV, JSON, Parquet, log formats, and multimedia
Common challenges in handling rådata include noise, sensor drift, time synchronization issues, and biases introduced during