Content area

Abstract

Fault-tolerant consensus is one of the foundational problems in distributed computing. A number of autonomous processes aim to agree on a common value among the initial ones, despite failures in processes or the communication medium. The rise of decentralized systems, with their continuously growing user base and applications, poses an unprecedented challenge to the scalability of fault-tolerant consensus.

Current consensus mechanisms often struggle to balance scalability with fault tolerance and communication efficiency, particularly in dynamic environments where failures can be adaptive or even malicious. These challenges extend beyond communication complexity as decentralized systems must also achieve low network latency—a critical measure in the era of fast access to information.

Surprisingly, these issues are not confined to practical implementations; they also extend to foundational theoretical models. Despite the introduction of the consensus problem more than 40 years ago by Lamport, Pease, and Shostak, and a substantial body of literature aimed at understanding its complexity across various theoretical domains, optimal—or even close-to-optimal—solutions for consensus in many of these domains remain elusive. The focus of this thesis is to bridge some of the existing gaps and to progress the understanding of the Consensus problem in synchronous peer-to-peer networks where parties are subject to failures.

As a first result [Hajiaghayi et al., 2022b], we propose a new randomized algorithm against an adaptive adversary that reduces the communication complexity of the problem by a factor linear in the size of the system (denoted n), resulting in only a Θ˜ ( √n) multiplicative gap from the known lower bounds. Under the same failure regime, the second result derandomizes the previous algorithm, providing a time- and communication-optimal deterministic solution for Consensus when the number of failed processes is at most n/logn [Chlebus et al., 2023]. 

Interestingly, the next proposed result precisely captures the tradeoff between time complexity and randomness in any solution to Consensus against adaptive adversaries capable of crashing processes. We prove that solutions with lower latency require more queries to random sources [Hajiaghayi et al., 2024a].

For systems exhibiting more severe omission failures, we provide the first algorithm that is optimal, up to polylogarithmic factors, in two key measures: time complexity (latency) and communication complexity (scalability) [Hajiaghayi et al., 2024a].

Finally, we present new solutions for distributed systems enhanced by quantum communication channels [Hajiaghayi et al., 2024b].

A common feature among the proposed solutions is their reliance on combinatorial properties of sparse but well-communicated graphs—probabilistic or deterministic expanders. The thesis unveils and provides a thorough treatment of numerous new properties of these graphs related to fast and efficient communication patterns in distributed systems. Last but not least, beyond the main results, it introduces several novel paradigms and algorithms for crash-resilient distributed computing (both randomized and deterministic), including, among others, counting, weak local coin, gossip, and checkpointing.

Details

1010268
Title
Towards Scalability and Efficiency in Distributed Systems
Number of pages
257
Publication year
2025
Degree date
2025
School code
0117
Source
DAI-A 86/12(E), Dissertation Abstracts International
ISBN
9798286424689
Committee member
Kowalski, Dariusz; Mount, David; Shi, Elaine; Srinivasan, Aravind
University/institution
University of Maryland, College Park
Department
Computer Science
University location
United States -- Maryland
Degree
Ph.D.
Source type
Dissertation or Thesis
Language
English
Document type
Dissertation/Thesis
Dissertation/thesis number
31770247
ProQuest document ID
3223840834
Document URL
https://www.proquest.com/dissertations-theses/towards-scalability-efficiency-distributed/docview/3223840834/se-2?accountid=208611
Copyright
Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.
Database
ProQuest One Academic