(A) hierarchical probabilistic framework for scene text separation영상 내 텍스트 분할을 위한 확률적 계층 프레임워크에 관한 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 578
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorKim, Jin-Hyung-
dc.contributor.advisor김진형-
dc.contributor.authorKwon, Young-Hee-
dc.contributor.author권영희-
dc.date.accessioned2011-12-13T05:26:56Z-
dc.date.available2011-12-13T05:26:56Z-
dc.date.issued2009-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=309342&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/33266-
dc.description학위논문(박사) - 한국과학기술원 : 전산학전공, 2009.2, [ vi, 53 p. ]-
dc.description.abstractAutomated image understanding could be useful so that extracted information could be used in further ways. Since text has been the most significant information medium for the mankind, an automatic system that reads text has been one of the most popular research targets. However, when text reading is applied to $\textit{scene images}$ taken by digital cameras, we meet a different class of text from traditional ones, aka $\textit{scene text}$, which shows highly various characteristics. A common process to recognize text from an image is first to separate text pixels from background, and then recognize the text-only image. Separating text pixels from scene images remains an open problem, since scene images have much variation and properties of text such as position, size, and color are not constrained. In this research, we cast the scene text separation problem into probabilistic labeling in which we yield the labels which maximize its conditional probability given the image. Based on the hierarchical model of scene text with four layers as image - text line - stroke - label, we provide pixel-oriented representation of objects in each layer to build a random field for describing possibilities of objects in the layer. According to the hierarchy, the proposed hierarchical framework decomposes the labeling problem into three sub-problems as one about text lines, one about strokes, and one about labels, where two formers are described using the Kernel Ridge Regression (KRR), and the latter using the Conditional Random Field (CRF). The optimal labels are identified through a stochastic gradient method. Our framework is built learnable so that the parameters in it can be trained to improve the performance. The experimental results showed the promising performance of our framework.eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectconditional random field-
dc.subjectscene text-
dc.subjectscene text separation-
dc.subjectimage segmentation-
dc.subjecthierarchical model of scene text-
dc.subject조건부 랜덤 필드-
dc.subject영상내 텍스트-
dc.subject텍스트 분할-
dc.subject영상 분할-
dc.subject영상내 텍스트의 계층적 모델-
dc.subjectconditional random field-
dc.subjectscene text-
dc.subjectscene text separation-
dc.subjectimage segmentation-
dc.subjecthierarchical model of scene text-
dc.subject조건부 랜덤 필드-
dc.subject영상내 텍스트-
dc.subject텍스트 분할-
dc.subject영상 분할-
dc.subject영상내 텍스트의 계층적 모델-
dc.title(A) hierarchical probabilistic framework for scene text separation-
dc.title.alternative영상 내 텍스트 분할을 위한 확률적 계층 프레임워크에 관한 연구-
dc.typeThesis(Ph.D)-
dc.identifier.CNRN309342/325007 -
dc.description.department한국과학기술원 : 전산학전공, -
dc.identifier.uid020025803-
dc.contributor.localauthorKim, Jin-Hyung-
dc.contributor.localauthor김진형-
Appears in Collection
CS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0