Improving performance of quantitative question answering by reducing distraction in argument recognition인수 인식에서의 산만 감소를 통한 정량적 질의응답 성능 향상

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 27
  • Download : 0
Quantitative question answering, which utilizes long-form documents containing tables and textual data, is being actively studied in finance. To deal with long-form documents, such as financial statements, an architecture combining a retriever and a generator is typically used. The retriever finds evidence sentences in a given document, while the generator recognizes the proper arguments and produces an answer program. However, argument recognition suffers from a distraction problem since evidence sentences retrieved from financial statements contain many numerical data that could be candidate arguments. To address this problem, it is necessary to supervise which of the candidate arguments are required to the answer program during the generator's training process. In this paper, we propose an approach for training a generator in argument recognition by focusing on the probabilities in a candidate generation so that the arguments comprising the ground-truth have higher weights. The proposed approach consists of an argument aggregator to model the probabilities in each candidate generation, and an argument set loss to compute the cross-entropy between that probabilities and the candidates' existence in the ground-truth in terms of the argument set. In our experiments, we show performance improvements of 3.62% and 3.98% in execution accuracy and program accuracy, respectively, over the existing FinQANet model based on a financial quantitative question answering dataset. Also, we observed that the similarity of argument sets between the generated program and the ground truth improved by about 2.9%, indicating a mitigation of the distraction problem.
Advisors
최호진researcher
Description
한국과학기술원 :전산학부,
Publisher
한국과학기술원
Issue Date
2024
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전산학부, 2024.2,[iv, 36 p. :]

Keywords

Quantitative question answering▼aDistraction problem▼aFinancial domain▼aMathematical reasoning▼aHybrid data; 정량적 질의 응답▼a산만 문제▼a금융 도메인▼a수학적 추론▼a혼합형 데이터

URI
http://hdl.handle.net/10203/321781
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1097306&flag=dissertation
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0