Speech Segregation based on Pitch Track Correction and Music-Speech Classification

Cited 2 time in webofscience Cited 0 time in scopus
  • Hit : 988
  • Download : 343
DC FieldValueLanguage
dc.contributor.authorKim, Han-Gyuko
dc.contributor.authorJang, Gil-Jinko
dc.contributor.authorPark, Jeong-Sikko
dc.contributor.authorKim, Ji-Hwanko
dc.contributor.authorOh, Yung-Hwanko
dc.date.accessioned2013-03-12T18:39:17Z-
dc.date.available2013-03-12T18:39:17Z-
dc.date.created2012-08-08-
dc.date.created2012-08-08-
dc.date.issued2012-
dc.identifier.citationADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, v.12, no.2, pp.15 - 20-
dc.identifier.issn1582-7445-
dc.identifier.urihttp://hdl.handle.net/10203/103164-
dc.description.abstractA novel approach for pitch track correction and music-speech classification is proposed in order to improve the performance of the speech segregation system. The proposed pitch track correction method adjusts unreliable pitch estimates from adjacent reliable pitch streaks, in contrast to the previous approach using a single pitch streak which is the longest among the reliable pitch streaks in a sentence. The proposed music and speech classification method finds continuous pitch streaks of the mixture, and labels each streak as music-dominant or speech-dominant based on the observation that music pitch seldom changes in a short-time period whereas speech pitch fluctuates a lot. The speech segregation results for mixtures of speech and various competing sound sources demonstrated that the proposed methods are superior to the conventional method, especially for mixtures of speech and music signals.-
dc.languageEnglish-
dc.publisherUNIV SUCEAVA-
dc.subjectSEPARATION-
dc.titleSpeech Segregation based on Pitch Track Correction and Music-Speech Classification-
dc.typeArticle-
dc.identifier.wosid000305608000003-
dc.identifier.scopusid2-s2.0-84865301789-
dc.type.rimsART-
dc.citation.volume12-
dc.citation.issue2-
dc.citation.beginningpage15-
dc.citation.endingpage20-
dc.citation.publicationnameADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING-
dc.identifier.doi10.4316/AECE.2012.02003-
dc.contributor.localauthorOh, Yung-Hwan-
dc.contributor.nonIdAuthorJang, Gil-Jin-
dc.contributor.nonIdAuthorPark, Jeong-Sik-
dc.contributor.nonIdAuthorKim, Ji-Hwan-
dc.description.isOpenAccessY-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorSource separation-
dc.subject.keywordAuthorSpeech processing-
dc.subject.keywordAuthorSpeech analysis-
dc.subject.keywordAuthorSignal denoising-
dc.subject.keywordAuthorNoise cancellation-
dc.subject.keywordPlusSEPARATION-
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 2 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0