Plausibility assessment of triples with distant supervision원거리학습을 활용한 트리플 타당성 평가

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 444
  • Download : 0
Building knowledge bases by extracting triples from text has attracted significant attention but introduced another fundamental problem: rampant triples with erroneous expressions which are rarely found in human expressions. Recent validation research has not paid much attention to the generators of expressions when validating triples. Machine-extracted expressions should be validated in a different way unlike human-generated expressions because they are extracted without intention or consciousness. Focusing on the plausibility assessment of triples, this research proposes a new plausible/nonsensical framework overlaid with a true/false framework. Then it conceptualizes the validation of machine-extracted triples as a two-step procedure: a domain-independent plausibility assessment and a domain-dependent truth validation only for plausible triples. Furthermore, this research introduces two learning methods. A distant supervision method consistently builds both positive and negative training data, eliminating the need for indefinable but obligatory negative training data. A lazy learning algorithm skips the generation of pre-defined models that have difficulty in dealing with triples various expressions. These algorithms also learn some form of semantic relationships that improve the performance of plausibility assessment. The experimental results support the proposed approach, which outperformed several unsupervised baselines. The proposed approach can be used to filter out newly extracted nonsensical triples and existing nonsensical triples in knowledge bases. It can be used on its own, or it can complement existing truth validation process. Extending background knowledge for better coverage and implementing converging algorithms remain for future investigation.
Advisors
Yi, Mun Yongresearcher이문용researcher
Description
한국과학기술원 :지식서비스공학대학원,
Publisher
한국과학기술원
Issue Date
2018
Identifier
325007
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 지식서비스공학대학원, 2018.8,[vi, 73 p. :]

Keywords

Plausibility assessment▼atriple validation▼aparadigmatic similarity▼asemantic similarity▼apredicate inference▼ainstance-based learning▼adistant supervision▼abackground knowledge▼aknowledge base generation and population▼ainformation extraction; 타당성 평가▼a트리플 검증▼a계열적 유사도▼a의미적 유사도▼a술어 추론▼a인스턴스기반 학습▼a원거리지도 학습▼a백그라운드 지식▼a지식베이스 생성 및 활성화▼a정보 추출

URI
http://hdl.handle.net/10203/265386
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=828241&flag=dissertation
Appears in Collection
KSE-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0