Centre for Speech Technology Research (CSTR) research projects
Browse by
A variety of speech technology data. Recordings include The Rainbow Passage. Image: Detail showing a rainbow from "Late Autumn Landscape, Cambuskenneth" by Thomas Fenwick © The University of Edinburgh, all rights reserved.
Items in this Collection
-
The Edinburgh International Accents of English Corpus
English is the most widely spoken language in the world, used daily by millions of people as a first or second language in many different contexts. As a result, there are many varieties of English. Although the great many ... -
SUPERSEDED - The Edinburgh International Accents of English Corpus
## This item has been replaced by the one which can be found at https://datashare.ed.ac.uk/handle/10283/4836 - https://doi.org/10.7488/ds/3832 ##. English is the most widely spoken language in the world, used daily by ... -
REYD Yiddish TTS Corpus
* The Reading Electronic Yiddish Documents (REYD) Dataset. The REYD TTS dataset is a speech dataset for Yiddish consisting of 4,892 short audio clips, with a total duration of 475.7 minutes. The recordings are of three ... -
CSTR VCTK Corpus: English Multi-speaker Corpus for CSTR Voice Cloning Toolkit (version 0.92)
This CSTR VCTK Corpus includes speech data uttered by 110 English speakers with various accents. Each speaker reads out about 400 sentences, which were selected from a newspaper, the rainbow passage and an elicitation ... -
Parallel Audiobook Corpus
The Parallel Audiobook Corpus (version 1.0) is a collection of parallel readings of audiobooks. The corpus consists of approximately 121 hours of speech at 22.05KHz across 4 books and 59 speakers. The data is provided in ... -
Noisy reverberant speech database for training speech enhancement algorithms and TTS models
Noisy reverberant speech database. The database was designed to train and test speech enhancement (noise suppression and dereverberation) methods that operate at 48kHz. Clean speech was made reverberant and noisy by ... -
Noisy speech database for training speech enhancement algorithms and TTS models
Clean and noisy parallel speech database. The database was designed to train and test speech enhancement methods that operate at 48kHz. A more detailed description can be found in the papers associated with the database. ... -
96kHz version of the CSTR VCTK Corpus
This dataset includes 96kHz version of the CSTR VCTK Corpus including speech data uttered by 109 native speakers of English with various accents. The main dataset can be found at https://doi.org/10.7488/ds/1994 (containing ... -
SUPERSEDED - CSTR VCTK Corpus: English Multi-speaker Corpus for CSTR Voice Cloning Toolkit
## This item has been replaced by the one which can be found at https://doi.org/10.7488/ds/2645 ##' This CSTR VCTK Corpus (Centre for Speech Technology Voice Cloning Toolkit) includes speech data uttered by 109 native ... -
The SIWIS French Speech Synthesis Database
The SIWIS French Speech Synthesis Database includes high quality French speech recordings and associated text files, aimed at building TTS systems, investigate multiple styles, and emphasis. A total of 9750 utterances from ... -
SUPERSEDED - CSTR VCTK Corpus: English Multi-speaker Corpus for CSTR Voice Cloning Toolkit
# SUPERSEDED - This item has been replaced by the one which can be found at https://doi.org/10.7488/ds/1994 . # This CSTR VCTK Corpus (Centre for Speech Technology Voice Cloning Toolkit) includes speech data uttered by 109 ... -
Reverberant speech database for training speech dereverberation algorithms and TTS models
Reverberant speech database. The database was designed to train and test speech dereverberation methods that operate at 48kHz. Clean speech was made reverberant by convolving it with a room impulse response. The room impulse ...