(A) study on saliency-weighted LDA model for scene analysis장면 분석을 위한 중요도 가중치 LDA 모델에 관한 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 349
  • Download : 0
The bag-of-visual words (BoW) models have widely been studied for image classification in a computer vision area. However, since the BoW models are mostly based on histograms, they have a limitation in discovering the distributions of visual words within images for semantic scene analysis. Therefore, there has been an attempt to use the Latent Dirichlet Allocation (LDA) model for image scene classification by revealing the latent topic distributions as feature vectors for visual words. Based on the LDA model, each image is represented by word distributions with their latent topics, which can capture semantic regularities in the image. Many previous LDA models, however, are not capable of dealing with spatial information of visual words in images, especially visual saliency which is important in scene classification and understanding. In this dissertation, the LDA model is extended, which is called saliency-weighted LDA (swLDA), by accommodating the visual saliency into the topic distribution inference for visual words in order to capture a human’s perception characteristic that image classification is often performed with focus of attention on salient regions in images. For this, all training images are first divided into image patches which are then grouped into salient and non-salient regions based on saliency maps. Then, the topic distributions of visual words are learned with saliency weights of visual words in the salient and non-salient regions separately. During the training phase, these saliency weights are learned by the swLDA model for image scene classification, which are to be used in the testing phase. While the previous LDA models parameterize the topic distributions of visual words by a single topic distribution, our proposed model incorporates saliency maps to separate the input images into salient and non-salient regions for which their respective topic distributions are computed independently. In order to show the effectiveness of the swLDA model for image scene classification, we present experiment results which reveal that the swLDA model effectively incorporates visual saliency as focus of attention to mimic the human perception behavior and outperforms the previous LDA models in terms of image classification precision.
Advisors
Kim, Munchurlresearcher김문철researcher
Description
한국과학기술원 :전기및전자공학부,
Publisher
한국과학기술원
Issue Date
2019
Identifier
325007
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전기및전자공학부, 2019.2,[v, 77 p. :]

Keywords

Latent dirichlet allocation▼ascene analysis▼aimage classification▼atopic distribution▼alatent topic; 잠재 디리클레 할당▼a장면 분석▼a이미지 분류▼a토픽 분포▼a잠재 토픽

URI
http://hdl.handle.net/10203/265138
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=842216&flag=dissertation
Appears in Collection
EE-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0