VCTK
The VCTK Corpus is a publicly available dataset designed for research in speech synthesis and voice conversion. It was created by the Centre for Speech Technology Research (CSTR) at the University of Edinburgh. The corpus consists of speech recordings from 109 native English speakers, each reading out a set of 400 sentences. These sentences are designed to cover a wide range of phonetic contexts, making the corpus particularly useful for training and evaluating speech synthesis models.
The VCTK Corpus is notable for its high-quality recordings, which were made in a professional studio environment.
The corpus has been widely used in the research community for tasks such as text-to-speech synthesis, voice