Relationship-oriented qualification scheme for data objects in automated data modeling = 데이터 모델링 자동화를 위한 관계 중심 데이터 객체 판별 기법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 191
  • Download : 0
Data has become a substantial source of corporate competitive advantage, as information technology dramatically has changed industry structure and market. The data model is the foundation stone for companies to strategically manage and utilize their data. The existing data model is written for technical purposes to develop and operate corporate database, which makes the data model isolated from field users. The intervention of data designer without user engagement induces misinterpretation of data requirements and consumes time and cost for data modeling. Automated data modeling research has been actively conducted to enable the users to take a proactive role in data modeling so that companies can leverage data more agile. The data modeling system needs to automate the process of data object extraction and qualification performed by experts. For decades, knowledge-based and rule-based research has been conducted to extract and identify data objects. However, these studies have been unable to incorporate agile business requirements into the data model due to relying heavily on previous results. Moreover, the existing systems have limitations in field applicability because the systems are semi - automated methods that qualify data objects interacting with users who do not have knowledge of data model. In this thesis, we propose a relationship-oriented data modeling automation (ROM) that fully automates data modeling from textual job descriptions freely created by field users without knowledge base construction that consumes a lot of time and money or any strict restrictions for job descriptions. ROM extracts object candidates from job descriptions, constructs a network including contextual information, and automatically qualify data objects by using relationship information between objects. ROM also exploits a domain corpus to eliminate the ambiguity of job descriptions. The domain corpus is constructed by transforming field vocabulary into context vectors using neural network language model. In the final data object qualification step, we use a discrete choice model including relational variables such as centrality and structural hole, which are computed in relation to each other in contextual network. In order to evaluate the applicability of the proposed ROM, we developed a pilot system as well. Experimental results have shown that ROM greatly improves the performance of data object qualification over conventional automation methods.
Advisors
Moon, Songchunresearcher문송천researcher
Description
한국과학기술원 :경영공학부,
Publisher
한국과학기술원
Issue Date
2018
Identifier
325007
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 경영공학부, 2018.2,[iv, 86 p. :]

Keywords

data model▼aautomated data modeling▼acontextual network▼arecurrent neural network language model▼achoice model▼aobject qualification; 데이터 모델▼a데이터 모델링 자동화▼a컨텍스트 네트워크▼a순환 신경망 언어 모형▼a선택 모형▼a데이터 객체 판별

URI
http://hdl.handle.net/10203/264426
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=842550&flag=dissertation
Appears in Collection
MT-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0