EHR-SeqSQL : A sequential Text-to-SQL dataset for interactively exploring electronic health recordsEHR-SeqSQL : 전자건강기록의 상호 작용적 탐색을 위한 순차 Text-to-SQL 데이터셋

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 2
  • Download : 0
Text-to-SQL parsing is a task that translates natural language into SQL, enabling users who are not database experts to retrieve information from databases using only natural language. There are several important yet under-explored objectives in this field: interactivity, compositionality, and efficiency. In this paper, we present EHR-SeqSQL, a sequential Text-to-SQL dataset for interactively exploring Electronic Health Record (EHR) databases. We demonstrate the benefits of multi-turn setting over single-turn setting with respect to compositionality, and provide a new data split and an additional test set to evaluate compositional generalization. Furthermore, we introduce unique special tokens in SQL queries to enhance execution efficiency. This study represents the first attempt in the Text-to-SQL parsing field to simultaneously consider interactivity, compositionality, and efficiency, aiming to narrow the gap between industrial demands and academic research.
Advisors
최윤재researcher
Description
한국과학기술원 :김재철AI대학원,
Publisher
한국과학기술원
Issue Date
2024
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 김재철AI대학원, 2024.2,[iii, 28 p. :]

Keywords

전자건강기록▼a다중 턴 Text-to-SQL▼a문맥적 시맨틱 파싱▼a질의응답▼a구성성; Electronic Health Record(EHR)▼aMulti-turn Text-to-SQL▼aSemantic parsing in context▼aQuestion answering▼aCompositionality

URI
http://hdl.handle.net/10203/321373
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1096078&flag=dissertation
Appears in Collection
AI-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0