A Framework for Area-efficient Multi-task BERT Execution on ReRAM-based Accelerators

Cited 1 time in webofscience Cited 0 time in scopus
  • Hit : 86
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorKang, Myeongguko
dc.contributor.authorShin, Hyeinko
dc.contributor.authorShin, Jaekangko
dc.contributor.authorKim, Lee-Supko
dc.date.accessioned2021-12-09T06:52:42Z-
dc.date.available2021-12-09T06:52:42Z-
dc.date.created2021-11-25-
dc.date.created2021-11-25-
dc.date.created2021-11-25-
dc.date.issued2021-11-
dc.identifier.citation40th IEEE/ACM International Conference on Computer Aided Design (ICCAD)-
dc.identifier.issn1933-7760-
dc.identifier.urihttp://hdl.handle.net/10203/290337-
dc.description.abstractWith the superior algorithmic performances, BERT has become the de-facto standard model for various NLP tasks. Accordingly, multiple BERT models have been adopted on a single system, which is also called multi-task BERT. Although the ReRAM-based accelerator shows the sufficient potential to execute a single BERT model by adopting in-memory computation, processing multi-task BERT on the ReRAM-based accelerator extremely increases the overall area due to multiple fine-tuned models. In this paper, we propose a framework for area-efficient multi-task BERT execution on the ReRAM-based accelerator. Firstly, we decompose the fine-tuned model of each task by utilizing the base-model. After that, we propose a two-stage weight compressor, which shrinks the decomposed models by analyzing the properties of the ReRAM-based accelerator. We also present a profiler to generate hyper-parameters for the proposed compressor. By sharing the base-model and compressing the decomposed models, the proposed framework successfully reduces the total area of the ReRAM-based accelerator without an additional training procedure. It achieves a 0.26× area than baseline while maintaining the algorithmic performances.-
dc.languageEnglish-
dc.publisherIEEE/ACM-
dc.titleA Framework for Area-efficient Multi-task BERT Execution on ReRAM-based Accelerators-
dc.typeConference-
dc.identifier.wosid000747493600037-
dc.identifier.scopusid2-s2.0-85124141467-
dc.type.rimsCONF-
dc.citation.publicationname40th IEEE/ACM International Conference on Computer Aided Design (ICCAD)-
dc.identifier.conferencecountryUS-
dc.identifier.conferencelocationVirtual-
dc.identifier.doi10.1109/ICCAD51958.2021.9643471-
dc.contributor.localauthorKim, Lee-Sup-
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 1 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0