Effective labeling on unlabeled data for imbalanced training dataset불균등 분포 학습 데이터셋에서 비라벨 데이터의 효과적인 라벨링에 대한 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 87
  • Download : 0
When a small amount of labeled data and a large amount of unlabeled data exist, the research on semi-supervised learning that trains a network by efficiently utilizing the unlabeled data is actively conducting. Existing studies about semi-supervised learning only deal with situations in which the class distribution of training dataset is balanced. If class distribution of training dataset is imbalanced, those method makes the network over-fit to a class with a lot of data (major class), so the network produces wrong pseudo-label with high probability for a class with less data (minor class). In this thesis, we propose a training framework that prevents the over-fitting problem when the class distribution is imbalanced by making reliable pseudo-label of unlabeled data. We separately design the auxiliary labeling network to focus only on making pseudo-label. In order to utilize the labeled data and unlabeled data together, the labeling network has an auto-encoder-based structure. The proposed method can be applied without changing the main classification network because it increases the labeled data by producing high-reliability pseudo-labeled data through a auxiliary labeling network. In addition, a more accurate pseudo-label is produced by proposing the distance loss function of the latent feature considering the class distribution. Part of the prepared pseudo-labeled data is progressively added to the training dataset to mitigate the imbalance of class distribution.
Advisors
Kim, Munchurlresearcher김문철researcher
Description
한국과학기술원 :전기및전자공학부,
Publisher
한국과학기술원
Issue Date
2021
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2021.2,[v, 44 p. :]

Keywords

Semi-supervised learning (SSL)▼aImbalanced learning▼aVariational auto-encoder (VAE); 준지도학습▼a불균등 완화▼a가변 오토인코더

URI
http://hdl.handle.net/10203/295934
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=948703&flag=dissertation
Appears in Collection
EE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0