DSpace at KOASAS: (A) study on saliency-weighted LDA model for scene analysis

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Ph.D.(박사논문)

(A) study on saliency-weighted LDA model for scene analysis장면 분석을 위한 중요도 가중치 LDA 모델에 관한 연구

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 349
Download : 0

Export

Jeon, Jin

The bag-of-visual words (BoW) models have widely been studied for image classification in a computer vision area. However, since the BoW models are mostly based on histograms, they have a limitation in discovering the distributions of visual words within images for semantic scene analysis. Therefore, there has been an attempt to use the Latent Dirichlet Allocation (LDA) model for image scene classification by revealing the latent topic distributions as feature vectors for visual words. Based on the LDA model, each image is represented by word distributions with their latent topics, which can capture semantic regularities in the image. Many previous LDA models, however, are not capable of dealing with spatial information of visual words in images, especially visual saliency which is important in scene classification and understanding. In this dissertation, the LDA model is extended, which is called saliency-weighted LDA (swLDA), by accommodating the visual saliency into the topic distribution inference for visual words in order to capture a human’s perception characteristic that image classification is often performed with focus of attention on salient regions in images. For this, all training images are first divided into image patches which are then grouped into salient and non-salient regions based on saliency maps. Then, the topic distributions of visual words are learned with saliency weights of visual words in the salient and non-salient regions separately. During the training phase, these saliency weights are learned by the swLDA model for image scene classification, which are to be used in the testing phase. While the previous LDA models parameterize the topic distributions of visual words by a single topic distribution, our proposed model incorporates saliency maps to separate the input images into salient and non-salient regions for which their respective topic distributions are computed independently. In order to show the effectiveness of the swLDA model for image scene classification, we present experiment results which reveal that the swLDA model effectively incorporates visual saliency as focus of attention to mimic the human perception behavior and outperforms the previous LDA models in terms of image classification precision.

Advisors: Kim, Munchurl researcher; 김문철 researcher

Description: 한국과학기술원 :전기및전자공학부,

Publisher: 한국과학기술원

Issue Date: 2019

Identifier: 325007

Language: eng

Description: 학위논문(박사) - 한국과학기술원 : 전기및전자공학부, 2019.2,[v, 77 p. :]

Keywords: Latent dirichlet allocation▼ascene analysis▼aimage classification▼atopic distribution▼alatent topic; 잠재 디리클레 할당▼a장면 분석▼a이미지 분류▼a토픽 분포▼a잠재 토픽

URI: http://hdl.handle.net/10203/265138

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=842216&flag=dissertation

Appears in Collection: EE-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

(A) study on saliency-weighted LDA model for scene analysis장면 분석을 위한 중요도 가중치 LDA 모델에 관한 연구

KOASAS

Communities & Collections