DSpace at KOASAS: SQuARe: A Large-Scale Dataset of Sensitive Questions and Acceptable Responses Created through Human-Machine Collaboration

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Conference Papers(학술회의논문)

SQuARe: A Large-Scale Dataset of Sensitive Questions and Acceptable Responses Created through Human-Machine Collaboration

Cited 0 time in webofscience

Cited 0 time in

Hit : 47
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Lee, Hwaran	ko
dc.contributor.author	Hong, Seokhee	ko
dc.contributor.author	Park, Joonsuk	ko
dc.contributor.author	Kim, Takyoung	ko
dc.contributor.author	Cha, Meeyoung	ko
dc.contributor.author	Choi, Yejin	ko
dc.contributor.author	Kim, Byoung Pil	ko
dc.contributor.author	Kim, Gunhee	ko
dc.contributor.author	Lee, Eun Ju	ko
dc.contributor.author	Lim, Yong	ko
dc.contributor.author	Oh, Alice Haeyun	ko
dc.contributor.author	Park, Sangchul	ko
dc.contributor.author	Ha, Jung Woo	ko
dc.date.accessioned	2023-11-14T08:00:45Z	-
dc.date.available	2023-11-14T08:00:45Z	-
dc.date.created	2023-11-14	-
dc.date.created	2023-11-14	-
dc.date.issued	2023-07	-
dc.identifier.citation	The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), pp.6692 - 6712	-
dc.identifier.uri	http://hdl.handle.net/10203/314631	-
dc.description.abstract	The potential social harms that large language models pose, such as generating offensive content and reinforcing biases, are steeply rising. Existing works focus on coping with this concern while interacting with ill-intentioned users, such as those who explicitly make hate speech or elicit harmful responses. However, discussions on sensitive issues can become toxic even if the users are well-intentioned. For safer models in such scenarios, we present the Sensitive Questions and Acceptable Response (SQUARE) dataset, a large-scale Korean dataset of 49k sensitive questions with 42k acceptable and 46k non-acceptable responses. The dataset was constructed leveraging HyperCLOVA in a human-in-the-loop manner based on real news headlines. Experiments show that acceptable response generation significantly improves for HyperCLOVA and GPT-3, demonstrating the efficacy of this dataset.	-
dc.publisher	Association for Computational Linguistics	-
dc.title	SQuARe: A Large-Scale Dataset of Sensitive Questions and Acceptable Responses Created through Human-Machine Collaboration	-
dc.type	Conference	-
dc.identifier.scopusid	2-s2.0-85173753754	-
dc.type.rims	CONF	-
dc.citation.beginningpage	6692	-
dc.citation.endingpage	6712	-
dc.citation.publicationname	The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023)	-
dc.identifier.conferencecountry	CN	-
dc.identifier.conferencelocation	Toronto	-
dc.contributor.localauthor	Cha, Meeyoung	-
dc.contributor.localauthor	Oh, Alice Haeyun	-
dc.contributor.nonIdAuthor	Lee, Hwaran	-
dc.contributor.nonIdAuthor	Hong, Seokhee	-
dc.contributor.nonIdAuthor	Park, Joonsuk	-
dc.contributor.nonIdAuthor	Kim, Takyoung	-
dc.contributor.nonIdAuthor	Choi, Yejin	-
dc.contributor.nonIdAuthor	Kim, Gunhee	-
dc.contributor.nonIdAuthor	Lee, Eun Ju	-
dc.contributor.nonIdAuthor	Lim, Yong	-
dc.contributor.nonIdAuthor	Park, Sangchul	-
dc.contributor.nonIdAuthor	Ha, Jung Woo	-

Appears in Collection: CS-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

SQuARe: A Large-Scale Dataset of Sensitive Questions and Acceptable Responses Created through Human-Machine Collaboration

KOASAS

Communities & Collections