Flagger: Cooperative Acceleration for Large-Scale Cross-Silo Federated Learning Aggregation

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 3
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorPan, Xiuruiko
dc.contributor.authorAn, Yudako
dc.contributor.authorLiang, Shengwenko
dc.contributor.authorMao, Boko
dc.contributor.authorMingzhe Zhangko
dc.contributor.authorLi, Qiaoko
dc.contributor.authorJung, Myoungsooko
dc.contributor.authorZhang, Jieko
dc.date.accessioned2024-09-11T19:00:14Z-
dc.date.available2024-09-11T19:00:14Z-
dc.date.created2024-04-25-
dc.date.issued2024-06-29-
dc.identifier.citation51st IEEE/ACM International Symposium on Computer Architecture, ISCA 2024-
dc.identifier.urihttp://hdl.handle.net/10203/322921-
dc.description.abstractCross-silo federated learning (FL) leverages homomorphic encryption (HE) to obscure the model updates from the clients. However, HE poses the challenges of complex cryptographic computations and inflated ciphertext sizes. As cross-silo FL scales to accommodate larger models and more clients, the overheads of HE can overwhelm a CPU-centric aggregator architecture, including excessive network traffic, enormous data volume, intricate computations, and redundant data movements. Tackling these issues, we propose Flagger, an efficient and high-performance FL aggregator. Flagger meticulously integrates the data processing unit (DPU) with computational storage drives (CSD), employing these two distinct near-data processing (NDP) accelerators as a holistic architecture to collaboratively enhance FL aggregation. With the delicate delegation of complex FL aggregation tasks, we build Flagger-DPU and Flagger-CSD to exploit both in-network and in-storage HE acceleration to streamline FL aggregation. We also implement Flagger-Runtime, a dedicated software layer, to coordinate NDP accelerators and enable direct peer-to-peer data exchanges, markedly reducing data migration burdens. Our evaluation results reveal that Flagger expedites the aggregation in FL training iterations by 436% on average, compared with traditional CPU-centric aggregators.-
dc.publisherIEEE/ACM-
dc.titleFlagger: Cooperative Acceleration for Large-Scale Cross-Silo Federated Learning Aggregation-
dc.typeConference-
dc.type.rimsCONF-
dc.citation.publicationname51st IEEE/ACM International Symposium on Computer Architecture, ISCA 2024-
dc.identifier.conferencecountryAG-
dc.identifier.conferencelocationBuenos Aires-
dc.contributor.localauthorJung, Myoungsoo-
dc.contributor.nonIdAuthorPan, Xiurui-
dc.contributor.nonIdAuthorAn, Yuda-
dc.contributor.nonIdAuthorLiang, Shengwen-
dc.contributor.nonIdAuthorMao, Bo-
dc.contributor.nonIdAuthorMingzhe Zhang-
dc.contributor.nonIdAuthorLi, Qiao-
dc.contributor.nonIdAuthorZhang, Jie-
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0