(A) hierarchical probabilistic framework for scene text separation영상 내 텍스트 분할을 위한 확률적 계층 프레임워크에 관한 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 577
  • Download : 0
Automated image understanding could be useful so that extracted information could be used in further ways. Since text has been the most significant information medium for the mankind, an automatic system that reads text has been one of the most popular research targets. However, when text reading is applied to $\textit{scene images}$ taken by digital cameras, we meet a different class of text from traditional ones, aka $\textit{scene text}$, which shows highly various characteristics. A common process to recognize text from an image is first to separate text pixels from background, and then recognize the text-only image. Separating text pixels from scene images remains an open problem, since scene images have much variation and properties of text such as position, size, and color are not constrained. In this research, we cast the scene text separation problem into probabilistic labeling in which we yield the labels which maximize its conditional probability given the image. Based on the hierarchical model of scene text with four layers as image - text line - stroke - label, we provide pixel-oriented representation of objects in each layer to build a random field for describing possibilities of objects in the layer. According to the hierarchy, the proposed hierarchical framework decomposes the labeling problem into three sub-problems as one about text lines, one about strokes, and one about labels, where two formers are described using the Kernel Ridge Regression (KRR), and the latter using the Conditional Random Field (CRF). The optimal labels are identified through a stochastic gradient method. Our framework is built learnable so that the parameters in it can be trained to improve the performance. The experimental results showed the promising performance of our framework.
Advisors
Kim, Jin-Hyungresearcher김진형researcher
Description
한국과학기술원 : 전산학전공,
Publisher
한국과학기술원
Issue Date
2009
Identifier
309342/325007  / 020025803
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전산학전공, 2009.2, [ vi, 53 p. ]

Keywords

conditional random field; scene text; scene text separation; image segmentation; hierarchical model of scene text; 조건부 랜덤 필드; 영상내 텍스트; 텍스트 분할; 영상 분할; 영상내 텍스트의 계층적 모델; conditional random field; scene text; scene text separation; image segmentation; hierarchical model of scene text; 조건부 랜덤 필드; 영상내 텍스트; 텍스트 분할; 영상 분할; 영상내 텍스트의 계층적 모델

URI
http://hdl.handle.net/10203/33266
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=309342&flag=dissertation
Appears in Collection
CS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0