Leveraging hierarchical relationship between sounds for few-shot sound event classification오디오 라벨의 위계 정보를 활용한 퓨샷 음향 사건 분류

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 65
  • Download : 0
Machine learning research for sound event recognition has generally focused on audio classes with abundant data. In this work, we exploit hierarchical relationships between sound events in a few-shot learning setup to enable classification of a wider set of sound events, given just a few examples at inference. By iteratively calculating prototypes for each level according to a given hierarchy system, our network’s feature space is encouraged to mirror the predefined relationships between sound events. Compared to a non-hierarchical few-shot baseline, our method leads to a significant increase in classification accuracy and significant decrease in mistake severity on unseen classes. Our work also proposes a new audio label taxonomy with descriptive labels that reflect actual acoustic characteristics.
Advisors
Nam, Juhanresearcher남주한researcher
Description
한국과학기술원 :문화기술대학원,
Publisher
한국과학기술원
Issue Date
2022
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 문화기술대학원, 2022.8,[iv, 27 p. :]

Keywords

Sound Event Detection▼aSound Event Recognition▼aLabel Taxonomy▼aFew-Shot Learning▼aEnvironmental Sound; 음향 사건 탐지▼a음향 사건 인식▼a라벨 위계정보▼a퓨샷러닝▼a환경음

URI
http://hdl.handle.net/10203/308284
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1008248&flag=dissertation
Appears in Collection
GCT-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0