Vosk
Vosk is an open-source offline speech recognition toolkit designed for real-time transcription on mobile devices, desktops, and embedded systems. It provides lightweight, streaming recognition across multiple languages and runs entirely offline on CPUs. The project supports Windows, Linux, macOS, Android, iOS, and embedded platforms such as Raspberry Pi, making it suitable for client-side and server-side deployments.
The recognition engine behind Vosk is built around Kaldi-based acoustic modeling with neural network front-ends. It
Vosk offers bindings for several programming languages, including Python, Java, C#, and JavaScript (Node.js), enabling integration
Licensing and availability: Vosk is released under the Apache License 2.0. It is maintained by Alpha Cephei
See also: Kaldi-based speech recognition, offline ASR, language models, speech recognition APIs.