Suggesting sounds for images from video collections

Cited 6 time in webofscience Cited 0 time in scopus
  • Hit : 339
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorSolèr, Matthiasko
dc.contributor.authorBazin, Jean-Charlesko
dc.contributor.authorWang, Oliverko
dc.contributor.authorKrause, Andreasko
dc.contributor.authorSorkine-Hornung, Alexanderko
dc.date.accessioned2017-09-08T05:34:10Z-
dc.date.available2017-09-08T05:34:10Z-
dc.date.created2017-09-04-
dc.date.created2017-09-04-
dc.date.issued2016-10-08-
dc.identifier.citation14th European Conference on Computer Vision, ECCV 2016, pp.900 - 917-
dc.identifier.urihttp://hdl.handle.net/10203/225727-
dc.description.abstractGiven a still image, humans can easily think of a sound associated with this image. For instance, people might associate the picture of a car with the sound of a car engine. In this paper we aim to retrieve sounds corresponding to a query image. To solve this challenging task, our approach exploits the correlation between the audio and visual modalities in video collections. A major difficulty is the high amount of uncorrelated audio in the videos, i.e., audio that does not correspond to the main image content, such as voice-over, background music, added sound effects, or sounds originating off-screen. We present an unsupervised, clustering-based solution that is able to automatically separate correlated sounds from uncorrelated ones. The core algorithm is based on a joint audio-visual feature space, in which we perform iterated mutual kNN clustering in order to effectively filter out uncorrelated sounds. To this end we also introduce a new dataset of correlated audio-visual data, on which we evaluate our approach and compare it to alternative solutions. Experiments show that our approach can successfully deal with a high amount of uncorrelated audio.-
dc.languageEnglish-
dc.publisherEuropean Conference on Computer Vision Committee-
dc.titleSuggesting sounds for images from video collections-
dc.typeConference-
dc.identifier.wosid000389501700059-
dc.identifier.scopusid2-s2.0-84996931564-
dc.type.rimsCONF-
dc.citation.beginningpage900-
dc.citation.endingpage917-
dc.citation.publicationname14th European Conference on Computer Vision, ECCV 2016-
dc.identifier.conferencecountryNE-
dc.identifier.conferencelocationOudemanhuispoort, University of Amsterdam-
dc.identifier.doi10.1007/978-3-319-48881-3_59-
dc.contributor.localauthorBazin, Jean-Charles-
dc.contributor.nonIdAuthorSolèr, Matthias-
dc.contributor.nonIdAuthorWang, Oliver-
dc.contributor.nonIdAuthorKrause, Andreas-
dc.contributor.nonIdAuthorSorkine-Hornung, Alexander-
Appears in Collection
GCT-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 6 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0