Efficient adversarial audio synthesis via progressive upsampling

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 100
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorCho, Youngwooko
dc.contributor.authorChang, Minwookko
dc.contributor.authorLee, Sanghyeonko
dc.contributor.authorLee, Hyoungwooko
dc.contributor.authorKim, Gerard-jounghyunko
dc.contributor.authorChoo, Jaegulko
dc.date.accessioned2021-12-14T06:53:59Z-
dc.date.available2021-12-14T06:53:59Z-
dc.date.created2021-12-03-
dc.date.issued2021-06-06-
dc.identifier.citation2021 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2021, pp.3410 - 3414-
dc.identifier.issn1520-6149-
dc.identifier.urihttp://hdl.handle.net/10203/290641-
dc.description.abstractThis paper proposes a novel generative model called PUGAN, which progressively synthesizes high-quality audio in a raw waveform. Progressive upsampling GAN (PUGAN) leverages the progressive generation of higher-resolution output by stacking multiple encoder-decoder architectures. Compared to an existing state-of-the-art model called WaveGAN, which uses a single decoder architecture, our model generates audio signals and converts them to a higher resolution in a progressive manner, while using a significantly smaller number of parameters, e.g., 3.17x smaller for 16 kHz output, than WaveGAN. Our experiments show that the audio signals can be generated in real time with a comparable quality to that of WaveGAN in terms of the inception scores and human perception.-
dc.languageEnglish-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleEfficient adversarial audio synthesis via progressive upsampling-
dc.typeConference-
dc.identifier.scopusid2-s2.0-85115183852-
dc.type.rimsCONF-
dc.citation.beginningpage3410-
dc.citation.endingpage3414-
dc.citation.publicationname2021 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2021-
dc.identifier.conferencecountryCN-
dc.identifier.conferencelocationVirtual, Toronto-
dc.identifier.doi10.1109/ICASSP39728.2021.9413954-
dc.contributor.localauthorCho, Youngwoo-
dc.contributor.localauthorChoo, Jaegul-
dc.contributor.nonIdAuthorChang, Minwook-
dc.contributor.nonIdAuthorLee, Sanghyeon-
dc.contributor.nonIdAuthorLee, Hyoungwoo-
dc.contributor.nonIdAuthorKim, Gerard-jounghyun-
Appears in Collection
RIMS Conference Papers
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0