Vocal Removal From Multiobject Audio Using Harmonic Information for Karaoke Service

Cited 1 time in webofscience Cited 3 time in scopus
  • Hit : 424
  • Download : 10
DC FieldValueLanguage
dc.contributor.authorPark, Ji-Hoonko
dc.contributor.authorKim, Kwang-Kiko
dc.contributor.authorHahn, Min-Sooko
dc.date.accessioned2013-03-04T10:41:01Z-
dc.date.available2013-03-04T10:41:01Z-
dc.date.created2013-02-28-
dc.date.created2013-02-28-
dc.date.issued2013-04-
dc.identifier.citationIEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, v.21, no.4-
dc.identifier.issn1558-7916-
dc.identifier.urihttp://hdl.handle.net/10203/82426-
dc.description.abstractInteractive audio services (IASs) usually provide users with audio editing functionality and they can render their own sounds according to their preference. For IASs, the spatial audio object coding (SAOC) is an appropriate multichannel coding tool that satisfies most of the required functionalities with relatively low bit rate. Nevertheless, the SAOC usually fails to remove a specific object successfully, especially the vocal object in the case of the Karaoke service. In addition, to expand the service to mobile environments, lower bit rate and complexity are required. Thus, we propose a new SAOC vocal harmonic coding technique to improve the background music quality in the Karaoke service. Namely, utilizing the harmonic information of the vocal object, we removed the harmonics of the vocal object remaining in the background music. Our experimental results confirm that the background music quality is improved by the proposed algorithm even with the low bit rate and complexity.-
dc.languageEnglish-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.subjectPREDOMINANT-FO ESTIMATION-
dc.subjectSPEECH SYNTHESIS SYSTEM-
dc.subject2-BAND EXCITATION-
dc.subjectALGORITHM-
dc.subjectSIGNALS-
dc.subjectMUSIC-
dc.titleVocal Removal From Multiobject Audio Using Harmonic Information for Karaoke Service-
dc.typeArticle-
dc.identifier.wosid000314019700005-
dc.identifier.scopusid2-s2.0-84872918082-
dc.type.rimsART-
dc.citation.volume21-
dc.citation.issue4-
dc.citation.publicationnameIEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING-
dc.identifier.doi10.1109/TASL.2012.2234116-
dc.embargo.liftdate9999-12-31-
dc.embargo.terms9999-12-31-
dc.contributor.localauthorHahn, Min-Soo-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorAudio object-
dc.subject.keywordAuthorKaraoke service-
dc.subject.keywordAuthorspatial audio object coding-
dc.subject.keywordAuthorvocal harmonic information-
dc.subject.keywordPlusPREDOMINANT-FO ESTIMATION-
dc.subject.keywordPlusSPEECH SYNTHESIS SYSTEM-
dc.subject.keywordPlus2-BAND EXCITATION-
dc.subject.keywordPlusALGORITHM-
dc.subject.keywordPlusSIGNALS-
dc.subject.keywordPlusMUSIC-
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 1 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0