Scene text recognition using part-based character models문자의 지역적 특성 모델을 이용한 자연영상 내 문자인식 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 628
  • Download : 0
Understanding scene images has attracted considerable attentions, and there have been many researches to solve the problem in the form of subproblems such as object detection, object recognition, and scene segmentation. Text in scene images is one of the most informative contents to understand the images. Scene text recognition is the problem of recognizing text in scene images taken in unconstrained manner. Many researches on scene text recognition have been proposed, but most of them utilize character models only in character recognition phase, the last stage of the process. In former phases such as text detection and text extraction, only abstracted features of text regions are used, which might cause loss of information. In this thesis, we propose a novel scene text recognition method which fully utilizes concrete models of target characters from the beginning to the end of the recognition process. Each of the target character set is modeled with a part-based object model called implicit shape model (ISM) to achieve robustness for partial degradation of characters. Towards this end, we trained a Hough forest which localizes character parts and casts probabilistic votes on possible positions of characters. The votes are aggregated in voting spaces via generalized Hough transform, and then character candidates are detected at the local maxima of the voting space. The detected character candidates are verified by organizing the most plausible text lines in a semi-Markov conditional random field (semi-CRF) framework where the optimal configuration can be efficiently found using dynamic programming. As concrete character models are utilized throughout the process, even extremely deformed text are detected and recognized, which are hardly detected with previous approaches.
Advisors
Kim, Jin-Hyungresearcher김진형
Description
한국과학기술원 : 전산학과,
Publisher
한국과학기술원
Issue Date
2014
Identifier
568607/325007  / 020085275
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전산학과, 2014.2, [ vi, 70 p. ]

Keywords

Text recognition; 부분 기반 문자 모델; 허프 포레스트; 내포 형태 모델; 문자인식; part-based character; Implicit shape model; Hough forest

URI
http://hdl.handle.net/10203/197819
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=568607&flag=dissertation
Appears in Collection
CS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0