DSpace at KOASAS: Theoretical study on leveraging privacy of pretrained large languge model by direct model editing

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Theses_Master(석사논문)

Theoretical study on leveraging privacy of pretrained large languge model by direct model editing직접 모델 수정을 통한 사전 학습된 거대 언어 모델 내부 개인정보의 삭제 방안

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 50
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	차미영	-
dc.contributor.advisor	Cha, Meeyoung	-
dc.contributor.advisor	김란우	-
dc.contributor.author	Myung, Jaehyeon	-
dc.contributor.author	명재현	-
dc.date.accessioned	2024-07-30T19:31:43Z	-
dc.date.available	2024-07-30T19:31:43Z	-
dc.date.issued	2024	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1097250&flag=dissertation	en_US
dc.identifier.uri	http://hdl.handle.net/10203/321670	-
dc.description	학위논문(석사) - 한국과학기술원 : 전산학부, 2024.2,[iv, 27 p. :]	-
dc.description.abstract	Large Language Models (LLMs) store knowledge learned from vast amounts of text data. With the recent trend toward models possessing more parameters and being trained on larger datasets, the likelihood of LLMs unintentionally learning personal information has increased. In response, various studies propose methodologies to prevent LLMs from generating outputs that include personal information. Despite these recent efforts, there is a growing need for approaches to directly delete pre-learned information within the model, as attack techniques aligned with privacy defense strategies continue to advance. Most prior research on information deletion involves fine-tuning, where specific facts are repeatedly trained as irrelevant information to prevent the model from producing outputs containing personal information. However, this approach is challenging to adapt to user requests for personal data deletion and consumes substantial computing resources. This study presents an effective method for deleting personal information within large language models. First, an analysis is conducted on how well personal information within LLMs activates the transformer neural network for output generation. Additionally, the impact of the number of training iterations on the activation level of the transformer network is examined, exploring the potential for more precise updates to the parameters of large language models. Finally, the study confirms the effectiveness of using low-frequency fine-tuning for information deletion compared to traditional fine-tuning approaches. The proposed methodology can be actively applied in services that require agile responses to numerous requests for small-scale personal data deletion, even in scenarios with limited computing resources. All code and data related to the methods and experiments in this white paper will be made publicly available.	-
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.subject	자연 언어 처리▼a거대 언어 모델▼a모델 학습 해제▼a트랜스포머 모델	-
dc.subject	Natural language processing▼aLarge language model▼aModel unlearning▼aTransformer Model	-
dc.title	Theoretical study on leveraging privacy of pretrained large languge model by direct model editing	-
dc.title.alternative	직접 모델 수정을 통한 사전 학습된 거대 언어 모델 내부 개인정보의 삭제 방안	-
dc.type	Thesis(Master)	-
dc.identifier.CNRN	325007	-
dc.description.department	한국과학기술원 :전산학부,	-
dc.contributor.alternativeauthor	Kim, Lanu	-

Appears in Collection: CS-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Theoretical study on leveraging privacy of pretrained large languge model by direct model editing직접 모델 수정을 통한 사전 학습된 거대 언어 모델 내부 개인정보의 삭제 방안

KOASAS

Communities & Collections