Worker Assignment for Multiple Masters to Speed Up Coded Distributed Computing in Heterogeneous Clusters

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 92
  • Download : 0
In distributed computing systems, coding has played an important role to robustify the system against the effect of noise, e.g., stragglers, system failures and communication bottlenecks. Most of the existing work has focused on a simple master-worker model with one master and homogeneous workers. However, real-world systems are typically configured with heterogeneous workers distributed to computing nodes and serve multiple tasks in parallel. In this study, we consider the scenario in which multiple masters perform matrix multiplications using the workers having group heterogeneity. The group heterogeneity models that homogeneous workers are located in the same location and regarded as a group; the workers deployed in the different locations are potentially heterogeneous. We propose an asymptotically optimal worker assignment to multiple masters for coded distributed computing in the presence of heterogeneous groups of workers. Specifically, we present a lower bound for the expected latency in terms of the numbers of workers assigned to the masters and the amount of tasks allocated to workers. Adding the concentration constraints on the number of workers allocated to masters, we can obtain the minimum of the lower bound by taking the optimal worker assignment. We find the optimal worker assignment by converting the problem at hand into a linear programming problem. From both numerical simulations and experiments on Amazon EC2 clusters, we confirm that the effect of the proposed worker assignment is significant in various scenarios.
Publisher
IEEE COMPUTER SOC
Issue Date
2023-05
Language
English
Article Type
Article
Citation

IEEE TRANSACTIONS ON SERVICES COMPUTING, v.16, no.3, pp.2283 - 2298

ISSN
1939-1374
DOI
10.1109/TSC.2022.3201550
URI
http://hdl.handle.net/10203/310984
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0