Enhanced Maximum Voiced Frequency Estimation Scheme for HTS Using Two-Band Excitation Model

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 775
  • Download : 866
In a hidden Markov model based speech synthesis system using a two-band excitation model, a maximum voiced frequency (MVF) is the most important feature as an excitation parameter because the synthetic speech quality depends on the MVF. This paper proposes an enhanced MVF estimation scheme based on a peak picking method. In the proposed scheme, both local peaks and peak lobes are picked from the spectrum of a linear predictive residual signal. The average of the normalized distances of local peaks and peak lobes is calculated and utilized as a feature to estimate an MVF. Experimental results of both objective and subjective tests show that the proposed scheme improves the synthetic speech quality compared with that of a conventional one in a mobile device as well as a PC environment.
Publisher
ELECTRONICS TELECOMMUNICATIONS RESEARCH INST
Issue Date
2015-12
Language
English
Article Type
Article
Keywords

SPEECH SYNTHESIS SYSTEM; PARAMETER GENERATION

Citation

ETRI JOURNAL, v.37, no.6, pp.1211 - 1219

ISSN
1225-6463
DOI
10.4218/etrij.15.0115.0124
URI
http://hdl.handle.net/10203/205504
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
000366151900016.pdf(407.56 kB)Download

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0