DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Kim, Jin-Hyung | - |
dc.contributor.advisor | 김진형 | - |
dc.contributor.author | Seok, Jae-Hyun | - |
dc.contributor.author | 석재현 | - |
dc.date.accessioned | 2015-04-23T08:30:35Z | - |
dc.date.available | 2015-04-23T08:30:35Z | - |
dc.date.issued | 2014 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=568607&flag=dissertation | - |
dc.identifier.uri | http://hdl.handle.net/10203/197819 | - |
dc.description | 학위논문(박사) - 한국과학기술원 : 전산학과, 2014.2, [ vi, 70 p. ] | - |
dc.description.abstract | Understanding scene images has attracted considerable attentions, and there have been many researches to solve the problem in the form of subproblems such as object detection, object recognition, and scene segmentation. Text in scene images is one of the most informative contents to understand the images. Scene text recognition is the problem of recognizing text in scene images taken in unconstrained manner. Many researches on scene text recognition have been proposed, but most of them utilize character models only in character recognition phase, the last stage of the process. In former phases such as text detection and text extraction, only abstracted features of text regions are used, which might cause loss of information. In this thesis, we propose a novel scene text recognition method which fully utilizes concrete models of target characters from the beginning to the end of the recognition process. Each of the target character set is modeled with a part-based object model called implicit shape model (ISM) to achieve robustness for partial degradation of characters. Towards this end, we trained a Hough forest which localizes character parts and casts probabilistic votes on possible positions of characters. The votes are aggregated in voting spaces via generalized Hough transform, and then character candidates are detected at the local maxima of the voting space. The detected character candidates are verified by organizing the most plausible text lines in a semi-Markov conditional random field (semi-CRF) framework where the optimal configuration can be efficiently found using dynamic programming. As concrete character models are utilized throughout the process, even extremely deformed text are detected and recognized, which are hardly detected with previous approaches. | eng |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | Text recognition | - |
dc.subject | 부분 기반 문자 모델 | - |
dc.subject | 허프 포레스트 | - |
dc.subject | 내포 형태 모델 | - |
dc.subject | 문자인식 | - |
dc.subject | part-based character | - |
dc.subject | Implicit shape model | - |
dc.subject | Hough forest | - |
dc.title | Scene text recognition using part-based character models | - |
dc.title.alternative | 문자의 지역적 특성 모델을 이용한 자연영상 내 문자인식 연구 | - |
dc.type | Thesis(Ph.D) | - |
dc.identifier.CNRN | 568607/325007 | - |
dc.description.department | 한국과학기술원 : 전산학과, | - |
dc.identifier.uid | 020085275 | - |
dc.contributor.localauthor | Kim, Jin-Hyung | - |
dc.contributor.localauthor | 김진형 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.