DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Kweon, In So | - |
dc.contributor.advisor | 권인소 | - |
dc.contributor.author | Kim, Dong-Jin | - |
dc.date.accessioned | 2018-06-20T06:21:16Z | - |
dc.date.available | 2018-06-20T06:21:16Z | - |
dc.date.issued | 2017 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=675355&flag=dissertation | en_US |
dc.identifier.uri | http://hdl.handle.net/10203/243250 | - |
dc.description | 학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2017.2,[v, 48 p. :] | - |
dc.description.abstract | It is believed that the success of deep neural networks on various image tasks is achieved by virtue of a large number of annotated data. When it comes to video related tasks, while there have been various datasets, the number of annotated videos in a single dataset is still far less than that of image datasets. In this paper, we leverage existing video datasets that have heterogeneous videos and annotations, so that a model can be trained while compensating for the limit of a single dataset size. Since the video data in each dataset has heterogeneous annotations, traditional multi-task learning is not available in this scenario. To this end, we propose a simple alternating directional optimization method to efficiently learn from the heterogeneous data. We demonstrate the effectiveness of our model on both action recognition and caption embedding tasks. With our method, we show performance improvements on action recognition task and comparable performance on sentence retrieval task to the model trained on a single-task data. | - |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | Deep learning | - |
dc.subject | Action Recognition | - |
dc.subject | Visual semantic embedding | - |
dc.subject | Multi-task learning | - |
dc.subject | Machine learning | - |
dc.subject | 딥러닝 | - |
dc.subject | 행동인식 | - |
dc.subject | 세맨틱 임베딩 | - |
dc.subject | 멀티태스크 학습 | - |
dc.subject | 기계학습 | - |
dc.title | Disjoint multi-task learning between heterogeneous action and caption data | - |
dc.title.alternative | 이형의 행동인식과 캡션 데이터 간의 멀티태스크 학습 기법 | - |
dc.type | Thesis(Master) | - |
dc.identifier.CNRN | 325007 | - |
dc.description.department | 한국과학기술원 :전기및전자공학부, | - |
dc.contributor.alternativeauthor | 김동진 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.