DSpace at KOASAS: Multimodal representation

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Master(석사논문)

Multimodal representation멀티모달 표현방법 : Kneser-Ney 평활법/스킵 그램에 기반하는 신경 언어 모델

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 613
Download : 0

Export

Song, Mingoo / 송민구

This paper considers a multimodal representation that associates image features to text such that the conditional probability of the next word given past n words and image features is defined by a neural language model for image retrieval and text generation. By contrast to previous representations, our representation is learned to resolve the issue of data sparsity that has been a deteriorative cause for any neural language model in the evaluation. Specifically, we make use of Kneser-Ney smoothing and skip-gram techniques in order to integrate each of them to a multimodal neural language model, e.g., the Modality-biased Log-bilinear model. As a result, the prediction for the next word using the conditional probability is developed to produce better contextual consistency within one unit of each modality, i.e., one sentence or one image. On the other hand, the correspondence of image and text is also enhanced. The representation is validated on the IAPR TC-12 and Attribute Discovery datasets for image retrieval and text generation, demonstrating improved performance on perplexity and BLEU-n criteria and effective shared representation learning.

Advisors: Yoo, Chang Dong researcher; 유창동 researcher

Description: 한국과학기술원 :전기및전자공학부,

Publisher: 한국과학기술원

Issue Date: 2016

Identifier: 325007

Language: eng

Description: 학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2016.2 ,[iv, 24 p. :]

Keywords: Multimodal Representation; Neural Language Model; Kneser-Ney; Skip-gram; Image Retrieval; Image Query; Text Generation; 멀티모달 표현방법; 신경 언어 모델; 스킵 그램; 이미지 복구; 이미지 검색; 문장 생성

URI: http://hdl.handle.net/10203/221766

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=649621&flag=dissertation

Appears in Collection: EE-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Multimodal representation멀티모달 표현방법 : Kneser-Ney 평활법/스킵 그램에 기반하는 신경 언어 모델

KOASAS

Communities & Collections