Robert H. B. Netzer's research works | Brown University, Rhode Island and other places

What is this page?

This page lists the scientific contributions of an author, who either does not have a ResearchGate profile, or has not yet added these contributions to their profile.

It was automatically created by ResearchGate to create a record of this author's body of work. We create such pages to advance our goal of creating and maintaining the most comprehensive scientific repository possible. In doing so, we process publicly available (personal) data relating to the author as a member of the scientific community.

If you're a ResearchGate member, you can follow this page to keep up with this author's work.

If you are this author, and you don't want us to display this page anymore, please let us know.

Experience with techniques for refining data race detection

Conference Paper

Jan 2006

Dynamic data race detection is a critical part of debugging shared-memory parallel programs. The races that can be detected must be refined to filter out false alarms and pinpoint only those that are direct manifestations of bugs. Most race detection methods can report false alarms because of imprecise run-time information and because some races ar...

Detecting Race Conditions in Parallel Programs that Use Semaphores

Article

Jan 2003

We address the problem of detecting race conditions in programs that use semaphores for synchronization. Netzer and Miller showed that it is NP-complete to detect race conditions in programs that use many semaphores. We show in this paper that it remains NP-complete even if only two semaphores are used in the parallel programs. For the tractable ca...

Optimal Run-Time Tracing of Message-Passing Programs

Article

Jul 2002

The widespread adoption of distributed computing has accentuated the need for an effective set of support tools to facilitate debugging and monitoring of distributed programs. Unfortunately for distributed programs, this is not a trivial task. Many distributed programs are inherently non-deterministic in nature. Two runs of the same programs with t...

Deadlock-Free Incremental Replay of Message-Passing Programs

Article

May 2001

To support incremental replay of message-passing applications, processes must periodically checkpoint and the content of some messages must be logged, to break dependencies of the current state of the execution on past events. This paper shows that known adaptive logging algorithms are likely to introduce deadlocks in replay, and we introduce a new...

Communication-Based Prevention of Useless Checkpoints in Distributed Computations.

Article

Full-text available

Jan 2000

A useless checkpoint is a local checkpoint that cannot be part of a consistent global checkpoint. This paper addresses the following problem. Given a set of processes that take (basic) local checkpoints in an independent and unknown way, the problem is to design communication-induced checkpointing protocols that direct processes to take additional...

Race-Condition Detection in Parallel Computation with Semaphores

Article

Jul 1999

We address a problem arising in debugging parallel programs, detecting race conditions in programs using semaphores for synchronization. It is NPcomplete to detect race conditions in programs that use many semaphores [10]. We show in this paper that it remains NP-complete even if the programs are allowed to use only two semaphores. For the case of...

An efficient logging algorithm for incremental replay of message-passing applications

Conference Paper

May 1999

To support incremental replay of message-passing applications, processes must periodically checkpoint and the content of some messages must be logged, to break dependencies of the current state of the execution on past events. The paper presents a new adaptive logging algorithm that dynamically decides whether to log a message based on dependencies...

Consistency Issues in Distributed Checkpoints.

Article

Apr 1999

A global checkpoint is a set of local checkpoints, one per process. The traditional consistency criterion for global checkpoints states that a global checkpoint is consistent if it does not include messages received and not sent. The paper investigates other consistency criteria, transitlessness, and strong consistency. A global checkpoint is trans...

An Efficient Logging Algorithm for Incremental Replay of Message-Passing Applications

Article

Feb 1999

To support incremental replay of message-passing applications, processes must periodically checkpoint and the content of some messages must be logged, to break dependencies of the current state of the execution on past events. The paper presents a new adaptive logging algorithm that dynamically decides whether to log a message based on dependencies...

Preventing useless checkpoints in distributed computations

Conference Paper

Full-text available

Nov 1997

A useless checkpoint is a local checkpoint that cannot be part of a consistent global checkpoint. The paper addresses the following important problem. Given a set of processes that take (basic) local checkpoints in an independent and unknown way, the problem is to design a communication induced checkpointing protocol that directs processes to take...

Replaying distributed programs without message logging

Conference Paper

Sep 1997

Debugging long program runs can be difficult because of the delays required to repeatedly re-run the execution. Even a moderately long run of five minutes can incur aggravating delays. To address this problem, techniques exist that allow re-executing a distributed program from intermediate points by using combinations of checkpointing and message l...

Communication-Based Prevention of Useless Checkpoints In Distributed Computations

Article

Full-text available

Jul 1997

Finding consistent global checkpoints in a distributed computation

Article

Full-text available

Jul 1997

Consistent global checkpoints have many uses in distributed computations. A central question in applications that use consistent global checkpoints is to determine whether a consistent global checkpoint that includes a given set of local checkpoints can exist. Netzer and Xu (1995) presented the necessary and sufficient conditions under which such a...

Consistency Issues in . . .

Article

Jun 1997

: A global checkpoint is a set of local checkpoints, one per process. The traditional consistency criterion for global checkpoints states that a global checkpoint is consistent iff it does not include messages received and not sent. This paper investigates other consistency criteria, transitlessness and strong consistency. A global checkpoint is tr...

Jong-Deok Choi

Article

May 1997

Flowback analysis is a powerful technique for debugging programs. It allows the programmer to examine dynamic dependences in a program's execution history without having to re-execute the program. The goal is to present to the programmer a graphical view of the dynamic program dependences. We are building a system, called PPD, that performs flowbac...

Communication-Based Prevention of Useless Checkpoints In Distributed Computations

Technical Report

Full-text available

May 1997

A useless checkpoint is a local checkpoint that cannot be part of a consistent global checkpoint. This paper addresses the following important problem. Given a set of processes that take (basic) local checkpoints in an independent and unknown way, the problem is to design a communicationinduced checkpointing protocol that directs processes to take...

Detecting Data Races on Weak Memory Systems Sarita V. Adve, Mark D. Hill, Barton P. Miller, Robert H.B. Netzer

Article

Full-text available

Apr 1997

For shared-memory systems, the most commonly assumed programmer's model of memory is sequential consistency. The weaker models of weak ordering, release consistency with sequentially consistent synchronization operations, data-race-free-0, and data-race-free-1 provide higher performance by guaranteeing sequential consistency to only a restricted cl...

Race-Condition Detection in Parallel Computation with Semaphores (Extended Abstract).

Conference Paper

Sep 1996

We address a problem arising in debugging parallel programs, detecting race conditions in programs using semaphores for synchronization. It is NP-complete to detect race conditions in programs that use polynomial number of semaphores [10]. We show in this paper that it remains NP-complete even if the programs are allowed to use only two semaphores,...

Detecting Race Conditions in Parallel Programs that Use One Semaphore

Conference Paper

Mar 1996

We address a problem arising in debugging parallel programs, detecting race conditions in programs using a single semaphore for synchronization. It is NP-complete to detect races in programs that use many semaphores. For the case of a single semaphore, we give an algorithm that takes O(n 1.5p) time, where p is the number of processors and n is the...

Debugging race conditions in message-passing programs

Article

Jan 1996

In this paper we address the problem of dynamically locating unwanted nondeterminism (race conditions) in executions of explicitly parallel message-passing programs. We formally define what it means for a race to exist and show conceptually how to dynamically locate races. We also show the importance of accurate race detection as a starting point f...

Sender-based message logging for reducing rollback propagation

Conference Paper

Nov 1995

We present a sender-based message logging protocol for supporting fault tolerance with checkpointing and rollback recovery in distributed systems. Our scheme achieves the benefits of both optimistic and pessimistic message logging. Experimental results show that the maximum rollback induced by our protocol, and the number of messages logged, can be...

Dynamic and I/O-Efficient Algorithms for Computational Geometry and Graph Problems: Theoretical and Experimental Results

Article

Oct 1995

Robert H. B. Netzer

As most important applications today are large-scale in nature, high-performance methods are becoming indispensable. Two promising computational paradigms for large-scale applications are dynamic and I/O-efficient computations. We give efficient dynamic data structures for several fundamental problems in computational geometry, including point loca...

Compressed Differences: An Algorithm for Fast Incremental Checkpointing

Article

Sep 1995

The overhead of saving checkpoints to stable storage is the dominant performance cost in checkpointing systems. In this paper, we present a complete study of compressed differences, a new algorithm for fast incremental checkpointing. Compressed differences reduce the overhead of checkpointing by saving only the words that have changed in the curren...

Necessary and sufficient conditions for consistent global snapshots

Article

Mar 1995

Consistent global snapshots are important in many distributed applications. We prove the exact conditions for an arbitrary checkpoint, or a set of checkpoints, to belong to a consistent global snapshot, a previously open problem. To describe the conditions, we introduce a generalization of Lamport's (1978) happened-before relation called a zigzag p...

Optimal tracing and replay for debugging message-passing programs

Article

Full-text available

Jan 1995

Acommon debugging strategy involves re-executing a program (on a given input) over and over,e ach time gaining more information about bugs. Such techniques can fail on message-passing parallel programs. Because of nondeterminacy,different runs on the given input may produce different results. This non-repeatability is a serious debugging problem, s...

Critical-path-based message logging for incremental replay of message-passing programs

Conference Paper

Jul 1994

Debugging long-running, nondeterministic message-passing parallel programs requires incremental replay, the ability to exactly replay selected parts of an execution. To support incremental replay, we must log enough messages and checkpoint processes often enough to allow any requested replay to complete quickly. We present an adaptive tracing strat...

Optimal Tracing and Incremental Reexecution for Debugging Long-Running Programs.

Conference Paper

Jun 1994

Adaptive message logging for incremental replay of message-passing programs

Conference Paper

Dec 1993

This paper presents an adaptive message logging algorithm that keeps time and space costs low by logging only a fraction of the messages. The algorithm dynamically tracks dependences among messages to determine which cause domino effects and must be traced. The domino effect can force a replay to start arbitrarily far back in the execution, and dom...

Adaptive Message Logging for Incremental Program Replay

Article

Dec 1993

Adaptive message logging, which traces dependences between messages and checkpoints and selectively logs messages, letting users accurately and efficiently replay specific portions of parallel programs, is presented. Traces are reduced by logging only messages that cannot be quickly recomputed during replay. By restarting the execution at the right...

Trace Size vs Parallelism in Trace-and-Replay Debugging of Shared-Memory Programs

Conference Paper

Sep 1993

Robert H. B. Netzer

Execution replay is a debugging strategy where a program is run over and over on an input that manifests bugs. For explicitly parallel shared-memory programs, execution replay requires support of special tools --- because these programs can be nondeterministic, their executions can differ from run to run on the same input. For such programs, execut...

Optimal Tracing and Replay For Debugging Shared-Memory Parallel Programs

Article

Sep 1993

Robert H. B. Netzer

Execution replay is a crucial part of debugging. Because explicitly parallel shared-memory programs can be nondeterministic, a tool is required that traces executions so they can be replayed for debugging. We present an adaptive tracing strategy that is optimal and records the minimal number of shared-memory references required to exactly replay ex...

A Bibliography of Parallel Debuggers, 1993 Eddition.

Conference Paper

Jan 1993

Adaptive message logging for incremental replay of message-passing programs

Article

Jan 1993

An abstract is not available.

A bibliography of parallel debuggers

Article

Jan 1993

What are Race Conditions? - Some Issues and Formalizations

Article

Full-text available

Sep 1992

In shared-memory parallel programs that use explicit synchronization, race conditions result when accesses to shared memory are not properly synchronized. Race conditions are often considered to be manifestations of bugs since their presence can cause the program to behave unexpectedly. Unfortunately, there has been little agreement in the literatu...

Experience with Techniques for Refining Data Race Detection.

Conference Paper

Full-text available

Jan 1992

Improving the Accuracy of Data Race Detection

Article

Full-text available

May 1991

For shared-memory parallel programs that use explicit synchronization, data race detection is an important part of debugging. A data race exists when concurrently executing sections of code access common shared variables. In programs intended to be data race free, they are sources of nondeterminism usually considered bugs. Previous methods for dete...

Techniques for Debugging Parallel Programs with Flowback Analysis

Article

Full-text available

May 1991

Detecting Data Races on Weak Memory Systems.

Conference Paper

Full-text available

May 1991

Not Available

A Bibliography of Parallel Debuggers, 1993 Edition

Article

Jan 1991

An abstract is not available.

Detecting Data Races in Parallel Program Executions

Article

Full-text available

Feb 1970

Several methods currently exist for detecting data races in an execution of a shared-memory parallel program. Although these methods address an important aspect of parallel program debugging, they do not precisely define the notion of a data race. As a result, is it not possible to precisely state which data races are detected, nor is the meaning o...

On the Complexity of Event Ordering for Shared-Memory Parallel Program Executions

Article

Full-text available

Feb 1970

This paper presents results on the complexity of computing event orderings for sharedmemory parallel program executions. Given a program execution, we formally define the problem of computing orderings that the execution must have exhibited or could have exhibited, and prove that computing such orderings is an intractable problem. We present a form...

Optimal Tracing and Replay for Debugging Message-Passing Parallel Programs

Article

Full-text available

Feb 1970

A common debugging strategy involves reexecuting a program (on a given input) over and over, each time gaining more information about bugs. Such techniques can fail on message-passing parallel programs. Because of variations in message latencies and process scheduling, different runs on the given input may produce different results. This non-repeat...