Deep generative positive-unlabeled learning under selection bias선택 편향 상황에서 딥 생성 모델을 이용한 양성-미분류 문제 학습

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 105
  • Download : 0
Positive-unlabeled (PU) learning is the learning of a binary classifier, but it is different from an ordinary setting because we only have some positively labeled data instances, and others are unlabeled. Learning in the positive-unlabeled (PU) setting is prevalent in real world applications. Many previous works depend upon the ‘Selected Completely At Random’ (SCAR) assumption to utilize unlabeled data, but the SCAR assumption is not often applicable to the real world due to selection bias in label observations. This paper is the first generative PU learning model without the SCAR assumption. Specifically, we derive the PU risk function without the SCAR assumption, and we generate a set of virtual PU examples to train the classifier. Although our PU risk function is more generalizable, the function requires PU instances that do not exist in the observations. Therefore, we introduce the VAE-PU, which is a variant of variational autoencoders to separate two latent variables that generate either features or observation indicators. The separated latent information enables the model to generate virtual PU instances. We test the VAE-PU on benchmark datasets with and without the SCAR assumption. The results indicate that the VAE-PU is superior when selection bias exists, and the VAE-PU is also competent under the SCAR assumption. The results also emphasize that the VAE-PU is effective when there are few positive-labeled instances due to modeling on selection bias.
Advisors
Moon, Il-Chulresearcher문일철researcher
Description
한국과학기술원 :산업및시스템공학과,
Publisher
한국과학기술원
Issue Date
2021
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 산업및시스템공학과, 2021.2,[iv, 28 p. :]

Keywords

Positive-unlabeled learning▼aSelection bias▼aSelected completely at random assumption▼aRisk function▼aDeep generative model▼aVariational autoencoders; 양성-미분류 문제 학습▼a선택 편향▼a완전 무작위 선택 가정▼a위험 함수▼a딥 생성 모델▼a변분 오토인코더

URI
http://hdl.handle.net/10203/295309
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=948501&flag=dissertation
Appears in Collection
IE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0