Cocktail Glass Network: Fast Depth Estimation Using Channel to Space Unrolling

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 210
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorYu, Jung-Jaeko
dc.contributor.authorKo, Jong-Gookko
dc.contributor.authorKim, Junmoko
dc.date.accessioned2021-09-08T07:50:44Z-
dc.date.available2021-09-08T07:50:44Z-
dc.date.created2021-09-08-
dc.date.created2021-09-08-
dc.date.created2021-09-08-
dc.date.created2021-09-08-
dc.date.issued2021-
dc.identifier.citationIEEE ACCESS, v.9, pp.114680 - 114689-
dc.identifier.issn2169-3536-
dc.identifier.urihttp://hdl.handle.net/10203/287679-
dc.description.abstractDepth-estimation from a single input image can be used in applications such as robotics and autonomous driving. Recently, depth-estimation networks with UNet encoder/decoder structures have been widely used. In these decoders, operations are repeated to gradually increase the image resolution, while decreasing the channel size. If the upsampling operation at a high magnification can be processed at once, the amount of computation in the decoder can be dramatically reduced. To achieve this, we propose a new network structure, i.e., a cocktail glass network. In this network, convolution layers in the decoder are reduced, and a novel fast upsampling method is used that is known as channel-to-space unrolling, which converts thick channel data into high-resolution data. The proposed method can be easily implemented using simple reshaping operations; therefore, it is suitable for reducing the depth-estimation network. Considering the experimental results based on the NYU V2 and KITTI datasets, we demonstrate that the proposed method reduces the amount of computation in the decoder by half, while maintaining the same level of accuracy; it can be used in both lightweight and large-model-capacity networks.-
dc.languageEnglish-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleCocktail Glass Network: Fast Depth Estimation Using Channel to Space Unrolling-
dc.typeArticle-
dc.identifier.wosid000688218400001-
dc.identifier.scopusid2-s2.0-85113262629-
dc.type.rimsART-
dc.citation.volume9-
dc.citation.beginningpage114680-
dc.citation.endingpage114689-
dc.citation.publicationnameIEEE ACCESS-
dc.identifier.doi10.1109/ACCESS.2021.3105136-
dc.contributor.localauthorKim, Junmo-
dc.contributor.nonIdAuthorKo, Jong-Gook-
dc.description.isOpenAccessY-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorDecoding-
dc.subject.keywordAuthorConvolution-
dc.subject.keywordAuthorImage resolution-
dc.subject.keywordAuthorThree-dimensional displays-
dc.subject.keywordAuthorTask analysis-
dc.subject.keywordAuthorLicenses-
dc.subject.keywordAuthorGlass-
dc.subject.keywordAuthorArtificial neural networks-
dc.subject.keywordAuthorimage processing-
dc.subject.keywordAuthordepth estimation-
dc.subject.keywordAuthormonocular image-
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0