SEMANTIC ASSOCIATION NETWORK FOR VIDEO CORPUS MOMENT RETRIEVAL

Cited 1 time in webofscience Cited 0 time in scopus
  • Hit : 65
  • Download : 0
This paper considers Semantic Association Network (SAN) for Video Corpus Moment Retrieval (VCMR) which localizes temporal moment that best corresponds to the given text query in a corpus of videos. Collaborations among common semantics from multi-modal inputs are essential for effectively understanding video together with subtitle and text query. For this collaboration, SAN associates common semantics within the same modality (by Intra Semantic Association) and across different modalities (by Inter Semantic Association) with dedicated module referred to as Modality Semantic Association (MSA). SAN surpasses existing state-of-the-art performance on the TVR and DiDeMo benchmark datasets. Extensive ablation studies and qualitative analyses show the effectiveness of the proposed model.
Publisher
Institute of Electrical and Electronics Engineers Inc.
Issue Date
2022-05-24
Language
English
Citation

47th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022, pp.1720 - 1724

ISSN
1520-6149
DOI
10.1109/ICASSP43922.2022.9747523
URI
http://hdl.handle.net/10203/298775
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 1 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0