Ivectors
i-vectors, short for identity vectors, are compact fixed-length representations of speech utterances designed to capture both speaker characteristics and channel or recording variability. They provide a single vector that summarizes an utterance, enabling standard statistical methods to be applied for speaker recognition tasks such as verification and identification.
The core idea is to model all variability in speech with a total variability space. A large
Typical i-vector dimensions are a few hundred, commonly in the 400–600 range. Training requires substantial data
Applications include speaker verification, speaker identification, and diarization. The approach played a dominant role in speaker