Back to Search Start Over

Causal Distributed Breakpoints

Authors :
RICE UNIV HOUSTON TX DEPT OF COMPUTER SCIENCE
Fowler, Jerry
Zwaenepoel, Willy
RICE UNIV HOUSTON TX DEPT OF COMPUTER SCIENCE
Fowler, Jerry
Zwaenepoel, Willy
Source :
DTIC AND NTIS
Publication Year :
1990

Abstract

A causal distributed breakpoint is initiated by a sequential breakpoint in one process of a distributed computation, and restores each process in the computation to its earliest state that reflects all events that happened before the breakpoint. A causal distributed breakpoint is the natural extension for distributed programs of the conventional notion of a breakpoint in a sequential program. We present an algorithm for finding the causal distributed breakpoint given a sequential breakpoint in one of the processes. Approximately consistent checkpoint sets are used for efficiently restoring each process to its state in a causal distributed breakpoint. Causal distributed breakpoints assume deterministic processes that communicate solely by messages. The dependencies that arise from communication between processes are logged. Dependency logging and approximately consistent checkpoint sets have been implemented on a network of SUN workstations running the V-System. Overhead on the message passing primitive varies between 1 and 14 percent for dependency logging. Execution time overhead for a 200 x 200 Gaussian elimination is less than 4 percent, and generates a dependency log of 288 kilobytes.<br />Sponsored in part by Grant NSF-CDA86-19893.

Details

Database :
OAIster
Journal :
DTIC AND NTIS
Notes :
text/html, English
Publication Type :
Electronic Resource
Accession number :
edsoai.ocn832082422
Document Type :
Electronic Resource