FinePrompt: Unveiling the Role of Finetuned Inductive Bias on Compositional Reasoning in GPT-4

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 116
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorKim, Jeonghwanko
dc.contributor.authorHong, Giwonko
dc.contributor.authorMyaeng, Sung-Hyonko
dc.contributor.authorWhang, Joyce Jiyoungko
dc.date.accessioned2023-12-27T00:00:59Z-
dc.date.available2023-12-27T00:00:59Z-
dc.date.created2023-12-24-
dc.date.created2023-12-24-
dc.date.issued2023-12-08-
dc.identifier.citationThe 2023 Conference on Empirical Methods in Natural Language Processing, pp.3763 - 3775-
dc.identifier.urihttp://hdl.handle.net/10203/316862-
dc.description.abstractCompositional reasoning across texts has been a long-standing challenge in natural language processing. With large language models like GPT-4 taking over the field, prompting techniques such as chain-of-thought (CoT) were proposed to unlock compositional, multi-step reasoning capabilities of LLMs. Despite their success, the prompts demand significant human effort to discover and validate them. Our work draws attention to the idea of transferring task-specific inductive biases from finetuned models to prompts, as a way of improving GPT-4’s compositional reasoning capabilities. To leverage these inductive biases, we formulate prompt templates to ease the transfer of inductive biases. The experimental results on multi-hop question answering and numerical reasoning over text show that our proposed prompt scheme shows competitive zero-shot and few-shot performances compared to existing prompts on complicated reasoning tasks, highlighting the importance of adopting the validated biases of the previous paradigm.-
dc.languageEnglish-
dc.publisherAssociation for Computational Linguistics-
dc.titleFinePrompt: Unveiling the Role of Finetuned Inductive Bias on Compositional Reasoning in GPT-4-
dc.typeConference-
dc.type.rimsCONF-
dc.citation.beginningpage3763-
dc.citation.endingpage3775-
dc.citation.publicationnameThe 2023 Conference on Empirical Methods in Natural Language Processing-
dc.identifier.conferencecountrySI-
dc.identifier.conferencelocationResorts World Convention Centre, Singapore-
dc.identifier.doi10.18653/v1/2023.findings-emnlp.245-
dc.contributor.localauthorMyaeng, Sung-Hyon-
dc.contributor.localauthorWhang, Joyce Jiyoung-
dc.contributor.nonIdAuthorKim, Jeonghwan-
dc.contributor.nonIdAuthorHong, Giwon-
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0