TY - GEN
T1 - Distributed Load Balancing in the Face of Reappearance Dependencies
AU - Agrawal, Kunal
AU - Kuszmaul, William
AU - Wang, Zhe
AU - Zhao, Jinhao
N1 - Publisher Copyright:
© 2024 Owner/Author.
PY - 2024/6/17
Y1 - 2024/6/17
N2 - We consider the problem of load-balancing on distributed databases. We assume that data is divided into chunks and each chunk can be replicated on a constant number d of servers. When a request arrives, it is routed to one of the servers that contains the relevant chunk. Each server may store outstanding requests in a bounded queue and requests may be rejected if the queue is full. The goal is to design strategies for data distribution and request routing that minimize both the rejection rate and the average request latency. What makes this problem technically difficult is reappearance dependencies: if a chunk x is accessed at multiple different time steps, then the set of d servers that it can be routed to is the same each time it is accessed. This is a substantial departure from classical balls-and-bins settings where each ball arrival introduces fresh randomness into the system. We show that, with new algorithmic and analytical approaches, it is possible to overcome reappearance dependencies and construct algorithms with optimal rejection rate, latency, and queue size.
AB - We consider the problem of load-balancing on distributed databases. We assume that data is divided into chunks and each chunk can be replicated on a constant number d of servers. When a request arrives, it is routed to one of the servers that contains the relevant chunk. Each server may store outstanding requests in a bounded queue and requests may be rejected if the queue is full. The goal is to design strategies for data distribution and request routing that minimize both the rejection rate and the average request latency. What makes this problem technically difficult is reappearance dependencies: if a chunk x is accessed at multiple different time steps, then the set of d servers that it can be routed to is the same each time it is accessed. This is a substantial departure from classical balls-and-bins settings where each ball arrival introduces fresh randomness into the system. We show that, with new algorithmic and analytical approaches, it is possible to overcome reappearance dependencies and construct algorithms with optimal rejection rate, latency, and queue size.
KW - balls into bins
KW - distributed key-value stores
KW - load balancing
KW - power of $d$ choices
UR - http://www.scopus.com/inward/record.url?scp=85197389934&partnerID=8YFLogxK
U2 - 10.1145/3626183.3659968
DO - 10.1145/3626183.3659968
M3 - Conference contribution
AN - SCOPUS:85197389934
T3 - Annual ACM Symposium on Parallelism in Algorithms and Architectures
SP - 321
EP - 330
BT - SPAA 2024 - Proceedings of the 36th ACM Symposium on Parallelism in Algorithms and Architectures
PB - Association for Computing Machinery
T2 - 36th ACM Symposium on Parallelism in Algorithms and Architectures, SPAA 2024
Y2 - 17 June 2024 through 21 June 2024
ER -