Psychoacoustically Constrained and Distortion Minimized Speech Enhancement

This paper considers a psychoacoustically constrained and distortion minimized speech enhancement algorithm. Noise reduction, in general, leads to speech distortion, and a balanced tradeoff between noise reduction and speech distortion must be attained. A constrained optimization problem is set to reduce noise so that speech distortion is minimized while the sum of speech distortion and residual noise is kept below the masking threshold of the clean speech. Obtaining a solution to the optimization problem may be infeasible under certain conditions, and a slack variable is introduced to allow certain deviation from the constraint conditions. To estimate the power spectral density and also the masking threshold of clean speech, a speech model that assumes coexisting deterministic and stochastic components in speech is used. Experimental results show that the considered algorithm outperforms some of the more popular algorithms in terms of improvement in segmental signal-to-noise ratio (SegSNR), spectral distance (SD), modified Bark spectral distortion (MBSD), and mean opinion score (MOS).
Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Issue Date
2010-11
Language
ENG
Keywords

SPECTRAL AMPLITUDE ESTIMATOR; SIGNAL SUBSPACE APPROACH; HIDDEN MARKOV-MODELS; KALMAN FILTER; MASKING PROPERTIES; NOISE SUPPRESSION; AUDITORY-SYSTEM; COLORED NOISE

Citation

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, v.18, no.8, pp.2099 - 2110

ISSN
1558-7916
DOI
10.1109/TASL.2010.2041119
URI
http://hdl.handle.net/10203/97047
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
  • Hit : 206
  • Download : 0
  • Cited 0 times in thomson ci
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡClick to seewebofscience_button
⊙ Cited 6 items in WoSClick to see citing articles inrecords_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0