Detecting textual adversarial examples through text modification on text classification systems

Cited 2 time in webofscience Cited 0 time in scopus
  • Hit : 70
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorKwon, Hyunko
dc.contributor.authorLee, Sanghyunko
dc.date.accessioned2023-09-07T02:01:31Z-
dc.date.available2023-09-07T02:01:31Z-
dc.date.created2023-03-27-
dc.date.issued2023-08-
dc.identifier.citationAPPLIED INTELLIGENCE, v.53, no.16, pp.19161 - 19185-
dc.identifier.issn0924-669X-
dc.identifier.urihttp://hdl.handle.net/10203/312291-
dc.description.abstractIn this paper, we propose a method for detecting adversarial examples using a text modification module. The proposed method detects adversarial examples based on the change in classification result that occurs when a sample is modified by arbitrarily changing a specific word to a similar word. The method exploits the fact that the adversarial example's sensitivity to changes to specific words is greater than that of the original sample. Experiments were conducted with three datasets (AG's News, a movie review dataset, and the IMDB Large Movie Review Dataset), and TensorFlow was used as a machine learning library. In the experiment using these datasets, the proposed method detected an average of 71.7% of the adversarial sentences while minimizing the change in the results given by the model for the original sentences to an average of 2.9%.-
dc.languageEnglish-
dc.publisherSPRINGER-
dc.titleDetecting textual adversarial examples through text modification on text classification systems-
dc.typeArticle-
dc.identifier.wosid000939085700001-
dc.identifier.scopusid2-s2.0-85148632137-
dc.type.rimsART-
dc.citation.volume53-
dc.citation.issue16-
dc.citation.beginningpage19161-
dc.citation.endingpage19185-
dc.citation.publicationnameAPPLIED INTELLIGENCE-
dc.identifier.doi10.1007/s10489-022-03313-w-
dc.contributor.localauthorLee, Sanghyun-
dc.contributor.nonIdAuthorKwon, Hyun-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorDeep neural network-
dc.subject.keywordAuthorText domain-
dc.subject.keywordAuthorMachine learning security-
dc.subject.keywordAuthorDefense method-
Appears in Collection
RIMS Journal Papers
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 2 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0