Show simple item record

Depositordc.contributorYamagishi, Junichi
Funderdc.contributor.otherEPSRC - Engineering and Physical Sciences Research Councilen_UK
Data Creatordc.creatorYamagishi, Junichi
Date Accessioneddc.date.accessioned2015-07-08T08:24:19Z
Date Availabledc.date.available2015-07-08T08:24:19Z
Citationdc.identifier.citationYamagishi, Junichi. (2015). Listening test materials for "Multiple Feed-forward Deep Neural Networks for Statistical Parametric Speech Synthesis", [dataset]. The Centre for Speech Technology Research (CSTR). https://doi.org/10.7488/ds/282.en
Persistent Identifierdc.identifier.urihttps://hdl.handle.net/10283/824
Persistent Identifierdc.identifier.urihttps://doi.org/10.7488/ds/282
Dataset Description (abstract)dc.description.abstractIn the paper which this data accompanies, we investigate a combination of several feed-forward deep neural networks (DNNs) for a high-quality statistical parametric speech synthesis system. Recently, DNNs have significantly improved the performance of essential components in the statistical parametric speech synthesis, e.g. spectral feature extraction, acoustic modelling and spectral post-filter. In this paper our proposed technique combines these feed-forward DNNs so that the DNNs can perform all standard steps of the statistical speech synthesis from end to end, including the feature extraction from STRAIGHT spectral amplitudes, acoustic modelling, smooth trajectory generation and spectral post-filter. The proposed DNN-based speech synthesis system is then compared to the state-of-the-art speech synthesis systems, i.e. conventional HMM-based, DNN-based and unit selection ones.en
Languagedc.language.isoengen_UK
Publisherdc.publisherThe Centre for Speech Technology Research (CSTR)en_UK
Relation (Is Referenced By)dc.relation.isreferencedbyShinji Takaki, SangJin Kim, Junichi Yamagishi, JongJin Kim, "Multiple Feed-forward Deep Neural Networks for Statistical Parametric Speech Synthesis" Interspeech 2015en_UK
Rightsdc.rightsCreative Commons Attribution 4.0 International Public Licenseen
Titledc.titleListening test materials for "Multiple Feed-forward Deep Neural Networks for Statistical Parametric Speech Synthesis"en_UK
Typedc.typedataseten_UK

Download All
zip file MD5 Checksum: a55cea91a508502eb46a99f9e7a258ef

Files in this item

Thumbnail
Thumbnail
Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record