Software-hardware co-designed similarity search engine for online image-text retrieval system온라인 이미지-텍스트 검색 시스템을 위한 소프트웨어-하드웨어 공동 설계를 통한 유사성 검색 엔진

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 61
  • Download : 0
Thanks to enhancing image-text retrieval(ITR) application based on cross-modal retrieval, the application's latency is reduced by extracting feature embeddings of image and text offline. However, due to the similarity search that is the application's bottleneck, it is still not feasible to service online ITR according to our analysis of ITR workloads on GPU. In this paper, we propose a novel software-hardware design to accelerate the similarity search and implement it on a Xilinx Alveo U280 card. We reduce the dataset by 92.4% through quantizing embedding dataset from 32-bit floating point to 8-bit fixed point and reconstructing sparse text embedding matrices to be dense. Our reconstructed dataset searching algorithm is implemented as a 4-stage pipeline and leverages our custom dataflow, which minimizes off-chip data transfer. We achieve up to 214.5x and 8.3x faster and up to 264.2x and 41.7x more energy-efficient than the baseline and optimized GPU design, respectively, on the MS-COCO 5K dataset.
Advisors
Kim, Joo-Youngresearcher김주영researcher
Description
한국과학기술원 :전기및전자공학부,
Publisher
한국과학기술원
Issue Date
2022
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2022.2,[iv, 25 p. :]

URI
http://hdl.handle.net/10203/309855
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=997199&flag=dissertation
Appears in Collection
EE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0