Homoglyphs restoration with deep learning - focus on optical character recognition -딥러닝을 활용한 호모글리프 복원 - 광학 문자 인식을 중심으로 -

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 129
  • Download : 0
Homoglyphs are a shape that is difficult to distinguish visually because it is similar or identical. Because of this characteristic, attackers use them for phishing, causing serious problems. In this paper, we deal with the countermeasures against a new type of attacks through homoglyphs. Existing homoglyph attacks consist of characters or words, and so do the corresponding countermeasures. However, since the new type contains multiple homoglyphs in the text of the scam email, a new countermeasure was needed. To this end, we present a natural language processing technique that utilizes sequence information and a method that utilizes both visual elements and sequence information compared to existing methods for restoring homoglyphs. We use accuracy and false-positive rate (FPR) as evaluation criteria to compare the existing methods with the newly proposed methods. Through comparison using multiple evaluation criteria, we show that the method using both visual judgment and sequence information converts the homoglyph most accurately.
Advisors
Shin, Seungwonresearcher신승원researcher
Description
한국과학기술원 :정보보호대학원,
Publisher
한국과학기술원
Issue Date
2022
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 정보보호대학원, 2022.8,[iv, 31 p. :]

Keywords

Homoglyphs▼aCyber threats▼aOptical character recognition▼aNatural language processing; 호모글리프▼a사이버 위협▼a광학 문자 인식▼a자연어처리

URI
http://hdl.handle.net/10203/309621
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1008423&flag=dissertation
Appears in Collection
IS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0