Low-dimensional representation of spectral envelope using deep auto-encoder for speech synthesis

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 167
  • Download : 0
This paper proposes a deep auto-encoder structure to extract robust spectral features for statistical parametric speech synthesis. The technique allows us to compress the low-dimensional features from high dimensional spectral envelope without degradation for full-band speech in a data-driven way. We carried out a subjective evaluation and found that the optimum auto-encoder architecture. Experimental results showed that an analysis-by-synthesis using the proposed auto-encoder has lower reconstruction error of spectral envelope than conventional mel-cepstral analysis in narrow-band as well as full-band. Our results confirm that the proposed method increases the quality of synthesized speech in text-to-speech experiments.
Publisher
Association for Computing Machinery
Issue Date
2018-02-22
Language
English
Citation

2nd International Conference on Mechatronics Systems and Control Engineering, ICMSCE 2018, pp.107 - 111

DOI
10.1145/3185066.3185088
URI
http://hdl.handle.net/10203/247542
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0