Show simple item record

Depositordc.contributorYamagishi, Junichi
Funderdc.contributor.otherEPSRC - Engineering and Physical Sciences Research Council
Funderdc.contributor.otherThe Royal Society of Edinburgh
Funderdc.contributor.otherJapan Science & Technology Agency (JST). Core Research for Evolutionary Science and Technology (CREST)
Data Creatordc.creatorVeaux, Christophe
Data Creatordc.creatorYamagishi, Junichi
Date Accessioneddc.date.accessioned2017-07-21T12:16:57Z
Date Availabledc.date.available2017-07-21T12:16:57Z
Citationdc.identifier.citationVeaux, Christophe; Yamagishi, Junichi. (2017). 96kHz version of the CSTR VCTK Corpus, [sound]. University of Edinburgh. The Centre for Speech Technology Research (CSTR). https://doi.org/10.7488/ds/2101.en
Persistent Identifierdc.identifier.urihttps://hdl.handle.net/10283/2774
Persistent Identifierdc.identifier.urihttps://doi.org/10.7488/ds/2101
Dataset Description (abstract)dc.description.abstractThis dataset includes 96kHz version of the CSTR VCTK Corpus including speech data uttered by 109 native speakers of English with various accents. The main dataset can be found at https://doi.org/10.7488/ds/1994 (containing a lower quality set of the same recordings) - for further details about this work please see the README.txt file and metadata at https://doi.org/10.7488/ds/1994.
Dataset Description (TOC)dc.description.tableofcontentsPlease see the README.txt file at https://doi.org/10.7488/ds/1994.
Publisherdc.publisherUniversity of Edinburgh. The Centre for Speech Technology Research (CSTR)
Relation (Is Version Of)dc.relation.isversionofhttps://doi.org/10.7488/ds/1994
Relation (Is Referenced By)dc.relation.isreferencedbyhttps://arxiv.org/pdf/1609.03499.pdf
Relation (Is Referenced By)dc.relation.isreferencedbyvan den Oord, A et. al "WaveNet: A Generative Model for Raw Audio" arXiv:1609.03499v2 [cs.SD] 19 Sep 2016
Relation (Is Referenced By)dc.relation.isreferencedby"WaveNet: A Generative Model for Raw Audio" https://deepmind.com/blog/wavenet-generative-model-raw-audio/
Rightsdc.rightsThis corpus is licensed under Open Data Commons Attribution License(ODC-By) v1.0. http://opendatacommons.org/licenses/by/1.0/ http://opendatacommons.org/licenses/by/summary/
Sourcedc.sourceThe Rainbow Passage which the speakers read out can be found in the International Dialects of English Archive: (http://web.ku.edu/~idea/readings/rainbow.htm).
Sourcedc.sourceThe elicitation paragraph which the speakers read out is identical to the one used for the speech accent archive (http://accent.gmu.edu).
Subjectdc.subjectspeech synthesis
Subjectdc.subjectHMM
Subject Classificationdc.subject.classificationMathematical and Computer Sciences::Speech and Natural Language Processing
Titledc.title96kHz version of the CSTR VCTK Corpus
Alternative Titledc.title.alternativeHigher quality version of the Centre for Speech Technology Research Voice Cloning Toolkit
Typedc.typesound

Download All
zip file MD5 Checksum: db9ecdb250062dcc32ff651fa655323b

Files in this item

Thumbnail
Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record