You Said That?: Synthesising Talking Faces from Audio

Cited 54 time in webofscience Cited 0 time in scopus
  • Hit : 158
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorJamaludin, Amirko
dc.contributor.authorChung, Joon Sonko
dc.contributor.authorZisserman, Andrewko
dc.date.accessioned2021-11-27T06:40:31Z-
dc.date.available2021-11-27T06:40:31Z-
dc.date.created2021-11-26-
dc.date.created2021-11-26-
dc.date.created2021-11-26-
dc.date.issued2019-12-
dc.identifier.citationINTERNATIONAL JOURNAL OF COMPUTER VISION, v.127, no.11-12, pp.1767 - 1779-
dc.identifier.issn0920-5691-
dc.identifier.urihttp://hdl.handle.net/10203/289580-
dc.description.abstractWe describe a method for generating a video of a talking face. The method takes still images of the target face and an audio speech segment as inputs, and generates a video of the target face lip synched with the audio. The method runs in real time and is applicable to faces and audio not seen at training time. To achieve this we develop an encoder-decoder convolutional neural network (CNN) model that uses a joint embedding of the face and audio to generate synthesised talking face video frames. The model is trained on unlabelled videos using cross-modal self-supervision. We also propose methods to re-dub videos by visually blending the generated face into the source video frame using a multi-stream CNN model.-
dc.languageEnglish-
dc.publisherSPRINGER-
dc.titleYou Said That?: Synthesising Talking Faces from Audio-
dc.typeArticle-
dc.identifier.wosid000492425300012-
dc.identifier.scopusid2-s2.0-85061626959-
dc.type.rimsART-
dc.citation.volume127-
dc.citation.issue11-12-
dc.citation.beginningpage1767-
dc.citation.endingpage1779-
dc.citation.publicationnameINTERNATIONAL JOURNAL OF COMPUTER VISION-
dc.identifier.doi10.1007/s11263-019-01150-y-
dc.contributor.localauthorChung, Joon Son-
dc.contributor.nonIdAuthorJamaludin, Amir-
dc.contributor.nonIdAuthorZisserman, Andrew-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorComputer vision-
dc.subject.keywordAuthorMachine learning-
dc.subject.keywordAuthorVisual speech synthesis-
dc.subject.keywordAuthorVideo synthesis-
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 54 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0