DSpace at KOASAS: A Robust Ensemble of ResNets for Character Level End-to-end Text Detection in Natural Scene Images

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Conference Papers(학술회의논문)

A Robust Ensemble of ResNets for Character Level End-to-end Text Detection in Natural Scene Images

Cited 0 time in webofscience

Cited 0 time in

Hit : 355
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Kim, Jinsu	ko
dc.contributor.author	Kim, Yoonhyung	ko
dc.contributor.author	Kim, Chang-Ick	ko
dc.date.accessioned	2017-06-20T01:58:33Z	-
dc.date.available	2017-06-20T01:58:33Z	-
dc.date.created	2017-06-16	-
dc.date.created	2017-06-16	-
dc.date.issued	2017-06-21	-
dc.identifier.citation	15th International Workshop on Content-Based Multimedia Indexing, CBMI 2017, pp.1 - 6	-
dc.identifier.uri	http://hdl.handle.net/10203/224123	-
dc.description.abstract	Detecting text in natural scene images is a challenging task. In this paper, we propose a character-level end-to-end text detection algorithm in natural scene images. In general, text detection tasks are categorized into three parts: text localization, text segmentation, and text recognition. The proposed method aims not only to localize but also to recognize text. To do these tasks successfully, the proposed method consists of four steps: character candidate patch extraction, patch classification using ensemble of ResNets, non character region elimination, and character region grouping via self-tuning spectral clustering. In the character candidate patch extraction step, character candidate patches are extracted from the image by using both edge information from multi-scale images and Maximally Stable Extremal Regions (MSERs). Then each patch is classified into either character patch or non-character patch by using the deep network that is composed of three ResNets with different hyper-parameters. Text regions are determined by filtering out non-character patches. In order to make further reduction of classification errors, character characteristics are employed to compensate classification results of the ensemble of ResNets. To evaluate the text detection performance, character regions are grouped via self-tuning spectral clustering. The proposed method shows competitive performance on the ICDAR 2013 dataset.	-
dc.language	English	-
dc.publisher	Association for Computing Machinery	-
dc.title	A Robust Ensemble of ResNets for Character Level End-to-end Text Detection in Natural Scene Images	-
dc.type	Conference	-
dc.identifier.wosid	000426964400010	-
dc.identifier.scopusid	2-s2.0-85030773861	-
dc.type.rims	CONF	-
dc.citation.beginningpage	1	-
dc.citation.endingpage	6	-
dc.citation.publicationname	15th International Workshop on Content-Based Multimedia Indexing, CBMI 2017	-
dc.identifier.conferencecountry	IT	-
dc.identifier.conferencelocation	Ospedale degli Innocenti Firenze	-
dc.identifier.doi	10.1145/3095713.3095724	-
dc.contributor.localauthor	Kim, Chang-Ick	-

Appears in Collection: EE-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

A Robust Ensemble of ResNets for Character Level End-to-end Text Detection in Natural Scene Images

KOASAS

Communities & Collections