ASRbaserade
ASRbaserade describes methods, systems and services that rely on automatic speech recognition (ASR) to convert spoken language into text or actionable output. The term is used across software, hardware and service domains to indicate that ASR is the primary processing step.
Core components of ASRbaserade systems typically include an acoustic model, a language model and a decoder.
Applications of ASRbaserade span real-time transcription, voice assistants, subtitling, call-centre analytics and assistive technologies. Benefits include
Evaluation is commonly based on word error rate (WER), but deployments may also consider latency, robustness
Prominent tools and platforms include open-source toolchains, such as Kaldi and Vosk, along with commercial offerings