DSpace at KOASAS: Emergence of multimodal action representations from neural network self-organization

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Journal Papers(저널논문)

Emergence of multimodal action representations from neural network self-organization

Cited 25 time in

Cited 0 time in

Hit : 557
Download : 340

Export

Parisi, German I. / Tani, Jun researcher / Weber, Cornelius / Wermter, Stefan

The integration of multisensory information plays a crucial role in autonomous robotics to forming robust and meaningful representations of the environment. In this work, we investigate how robust multimodal representations can naturally develop in a self-organizing manner from co-occurring multisensory inputs. We propose a hierarchical architecture with growing self-organizing neural networks for learning human actions from audiovisual inputs. The hierarchical processing of visual inputs allows to obtain progressively specialized neurons encoding latent spatiotemporal dynamics of the input, consistent with neurophysiological evidence for increasingly large temporal receptive windows in the human cortex. Associative links to bind unimodal representations are incrementally learned by a semi-supervised algorithm with bidirectional connectivity. Multimodal representations of actions are obtained using the co-activation of action features from video sequences and labels from automatic speech recognition. Experimental results on a dataset of 10 full-body actions show that our system achieves state-of-the-art classification performance without requiring the manual segmentation of training samples, and that congruent visual representations can be retrieved from recognized speech in the absence of visual stimuli. Together, these results show that our hierarchical neural architecture accounts for the development of robust multimodal representations from dynamic audiovisual inputs. (C) 2016 The Authors. Published by Elsevier B.V.

Publisher: ELSEVIER SCIENCE BV

Issue Date: 2017-06

Language: English

Article Type: Article

Keywords: SUPERIOR TEMPORAL SULCUS; MULTISENSORY INTEGRATION; AUDIOVISUAL INTEGRATION; INVERSE EFFECTIVENESS; ADULT NEUROGENESIS; RECEPTIVE WINDOWS; BIOLOGICAL MOTION; HUMAN BRAIN; PERCEPTION; CORTEX

Citation: COGNITIVE SYSTEMS RESEARCH, v.43, pp.208 - 221

ISSN: 1389-0417

DOI: 10.1016/j.cogsys.2016.08.002

URI: http://hdl.handle.net/10203/224711

Appears in Collection: EE-Journal Papers(저널논문)

Files in This Item: 000402461200018.pdf(1.24 MB)Download

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 25 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Emergence of multimodal action representations from neural network self-organization

This item is cited by other documents in WoS

KOASAS

Communities & Collections