Detecting termination by weight-throwing in a faulty distributed system
TC Tseng - Journal of Parallel and Distributed Computing, 1995 - Elsevier
This paper presents a fault-tolerant termination detection algorithm for a distributed system
in which processes tend to fail. Allowing an arbitrary number of processes to have fail-stop
behavior, the algorithm can detect termination efficiently with O (M+ kn+ n) control messages
and O (k+ 1) detection delays, where M is the number of basic messages issued, n is the
number of processes, and k is the actual number of processes that fail. This algorithm has
fewer detection delays than existing algorithms in the literature and comparable …
in which processes tend to fail. Allowing an arbitrary number of processes to have fail-stop
behavior, the algorithm can detect termination efficiently with O (M+ kn+ n) control messages
and O (k+ 1) detection delays, where M is the number of basic messages issued, n is the
number of processes, and k is the actual number of processes that fail. This algorithm has
fewer detection delays than existing algorithms in the literature and comparable …
Showing the best result for this search. See all results