DFX: A Low-latency Multi-FPGA Appliance for Accelerating Transformer-based Text Generation

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 254
  • Download : 0
DFX: a low-latency multi-FPGA appliance for accelerating transformer-based text generation-DFX is a multi-FPGA appliance that accelerates transformer-based text generation-DFX adopts model parallelism to efficiently process the large-scale language model-Xilinx Alveo U280 data center accelerator card provides high performance with low-cost-FPGA-to-FPGA communication is enabled by QSFP cable at 100 Gb/s.
Publisher
Institute of Electrical and Electronics Engineers Inc.
Issue Date
2022-08-22
Language
English
Citation

2022 IEEE Hot Chips 34 Symposium, HCS 2022

DOI
10.1109/HCS55958.2022.9895626
URI
http://hdl.handle.net/10203/301196
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0