ArticlePDF Available

Analyzing Traces with Anonymous Synchronization

August 1992

August 1992

Authors:

David P. Helmbold

University of California, Santa Cruz

Charles E. Mcdowell

University of California, Santa Cruz

this paper. A trace specifies a total ordering of the events performed by the program. For our purposes, the trace reflects only one of the orders in which the events could have occurred. A more restrictive definition that is difficult to achieve in practice would be for a trace to specify the exact order in which the events did occur. Since traces are only approximations of executions, there are usually several executions that are consistent with a given trace. What we want to compute is the orderings between pairs of events that must occur in all executions which are consistent with the trace. In general this will be a partial order. If the partial order contains all orderings that must occur, then a pair of events not ordered by this "must occur" partial ordering can potentially execute in either order. Much research has been directed towards determining the partial ordering of events in parallel and distributed systems. Previous models have assumed point-to-point communication which makes it very easy to determine which events were caused by which other events (e.g. "message received by B from A" is clearly caused by "message sent by A to B"). Unfortunately the synchronization models supported by several parallel programming languages allow for anonymous communication, where the partner is unknown. Examples of anonymous communication include locks, semaphores, and monitors. Emrath, Ghosh, and Padua [EGP89] present a method for detecting non-determinacy in parallel programs that utilize fork/join and event style synchronization instructions with the Post, Wait, and Clear primitives. They construct a Task Graph from the given synchronization instructions and the sequential components of the program that is intended to show the guaranteed orderings between events. For ...

Content uploaded by David P. Helmbold

Content may be subject to copyright.

6. Conclusion

References

[AP87] T. R. Allen and D. A. Padua. Debugging fortran on a shared memory machine.

Proc. International Conf. on Parallel Processing

, pages 721{727, 1987.

[Dij65] E. W. Dijkstra. Solution of a problem in concurrent programming control.

Communications of the ACM

, 8(9), September 1965.

[EGP89] P. A. Emrath, S. Ghosh, and D. A. Padua. Event synchronization analysis for

debugging parallel programs. In

Supercomputing '89

, November 1989. Reno,

NV.

[EP88] P. A. Emrath and D. A. Padua. Automatic detection of nondeterminacy in

parallel programs. In

Proc. Workshop on Parallel and Distributed Debugging

pages 89{99, May 1988.

[Fid88] C. J. Fidge. Partial orders for parallel debugging. In

Proc. Workshop on

Parallel and Distributed Debugging

, pages 183{194, May 1988.

[GPH*88] M. D. Guzzi, D. A. Padua, J. P. Hoeinger, , and D. H. Lawrie. Cedar fortran

and other vector and parallel fortran dialects. In

Proceedings Supercomputing

'88

, pages 114{121, 1988.

[IBM88]

Parallel FORTRAN language and library reference

. IBM, 1988.

[Lam78] L. Lamport. Time, clocks, and the ordering of events in a distributed system.

CACM

, 21(7):558{565, July 1978.

[Lam86] Leslie Lamport. The mutual exclusion problem: part i{a theory of interprocess

communication.

JACM

, 33(2):290{312, April 1986.

[Mat88] F. Mattern. Virtual time and global states of distributed systems. In M.

Cosnard, editor,

Proceedings of Parallel and Distributed Algorithms

, 1988.

[McD89] C. E. McDowell. A practical algorithm for static analysis of parallel programs.

Journal of Parallel and Distributed Computing

, June, 1989.

[NM89] R. Netzer and B. P. Miller.

Detecting Data Races in Parallel Program Exe-

cutions

. Technical Report 894, University of Wisconsin-Madison, November

1989.

[Tay84] R. N. Taylor.

Debugging Real-Time Software in a Host-Target Environment

Technical Report, U.C. Irvine Tech. Rep. 212, 1984.

6. Conclusion

of events. We feel that this is misleading { an execution is more properly viewed as a

partial ordering on the events. Fidge and Mattern have pioneered the use of time vectors

to represent these partial orders. We have extended this approach by using time vectors

to analyze sets of executions rather than just capturing a single execution.

6. Conclusion

After adding the virtual edge from BW1

to CW1, CW1 becomes the second wait

on S1. Using Algorithm 5:

(BW1,CW1)

(BW1,CW1), (BW1,CS1), (BW1,CS2),

(BS1,CW1), (BS1,CS1), (BS1,CS2)

After adding the virtual edge from CS1 to

BW1, BW1 b ecomes the second wait on

S1. Again using Algorithm 5:

(CW1,BW1)

(BW1,CW1), (BS1,CW1), (BS2,CW1),

(BW1,CS1), (BS1,CS1), (BS2,CS1)

(BW1,CW1)

(CW1,BW1) =

(BW1,CW1),(BW1,CS1),(BS1,CS1),(BS1,CW1)

Figure 5.1: Detect Critical Regions

The problem is made even more dicult when there is no clear correspondence between

the blo cking and enabling events in the trace.

This paper contains a series of algorithms for extracting useful information from

sequential traces with anonymous synchronization. The rst algorithm is very similar to

the vector timestamp methods of Fidge and Mattern [Fid88, Mat88]. The other algorithms

systematically manipulate these vectors of timestamps in order to discover pairs of events

that must be ordered in every execution which is consistent with the trace. In addition to

presenting our algorithms, we have also proved their correctness.

Although our algorithms nd many of these \must-be-ordered" relationships, we have

been unable to prove that they nd all of them. We are investigating additional procedures

which can increase the number of \must-be-ordered" relationships found. We would also

like to distinguish all pairs of events that are concurrent in some consistent execution from

pairs of events which can happen in either order, but not concurrently.

Some parallel programming environments view a parallel execution as a linear sequence

6. Conclusion





2 =

)

, i.e., if there are enough signals for both waits to precede,

then the two waits can happen concurrently.



= 1 =

) :

(

), i.e., there is only one signal for a wait to precede, then we

can conclude that they cannot happen concurrently. The starting points of critical

regions have b een found. The following procedure is used to determine unordered

sequential event pairs in critical region.

1. First, assume that event

happened before

. Thus

is the

+ 2nd wait for

. Using Algorithm 4 with

+ 2 to calculate time vectors for event

and

other events.

Let

(

e; e

) =

(

; e

):(

; e

)

Conc

; e

;

and ^



(

)[

]





(

)[

]

or ^



(

)[

]





(

)[

]

Undo the timestamp up dating.

2. Similarly, assume that event

happened b efore

. Thus

is the

+ 2nd wait

for

. Using Algorithm 4 with

+ 2 to calculate time vectors for event

and other events.

Let

(

; e

) =

(

; e

):(

; e

)

Conc

; e

;

and ^



(

)[

]





(

)[

]

or ^



(

)[

]





(

)[

]

Undo the timestamp up dating.

Let Seq

(

e; e

)

(

; e

). Notice that Seq

maintains the set of unordered

event pairs in the critical region. They are not concurrent in any executions,

whenever

happened before

occurred before

3. Let Seq = Seq

[

Seq

Let Conc = Conc

Seq





0 means neither of them can precede. In this case, there is a deadlo ck.

End Algorithm 5.

Algorithm 5 generates two sets of event pairs. Conc contains those concurrent event

pairs. Seq contains those unordered sequential event pairs. The remaining event pairs are

ordered. Figure 5.1 shows the application of this algorithm to the trace from Figure 1.1.

6 Conclusion

One of the most dicult tasks in debugging parallel programs is determining the timing

relationships between the events performed by the parallel program. Although several

parallel systems include facilities for creating a trace of the signicant events, the sequential

nature of the trace makes it dicult to determine which events could have happened in

parallel.

5. Adjusting the Timestamps to Determine Concurrency

From equation 4.6 we know that at most

non-shadowed signals (excluding

) in

(

) do

not follow

(i.e. the

+ 1st smallest and later always follow

Therefore, in every execution, at least one of the

+ 1 non-shadowed signals preceding

follows (or is equal to)

By transitivity

happens b efore

in every execution consistent with, so,



5 Adjusting the Timestamps to Determine Concurrency

Up to now, we have computed a partial order that reects a safe order relation between

events from the trace

. Given any two events

and

, if ^



(

)[

]





(

)[

] or



(

)[

]





(

)[

] then the two events are ordered. Otherwise,

and

are two unordered

events. The unordered events are not necessarily concurrent events. They may have to

occur sequentially. In this case, we call them

unordered sequential

events. For example,

if the program has a properly implemented lock around a critical region, then dierent

executions may have tasks entering the critical region in dierent orders. In no execution,

however, do two tasks concurrently enter the critical region.

When debugging parallel programs, we would like to distinguish those pairs of events

that are concurrent in some consistent execution from pairs of events which can happen

in either order, but not concurrently. Unfortunately, the concurrent relation cannot be

determined immediately from the timestamps. We cannot necessarily say

can happen

concurrently with event

even if we know ^



(

)



(

). As an example, in Figure 4.2,

even though ^



(



(

C W

1), the two W1 events cannot occur at the same time. It is,

in general, a hard problem to determine whether two unordered events can really happen

concurrently.

Let

e; e

be a pair of events. Event

may happen concurrently with

only

if ^



(

)



(

). The following procedure can be used to detect critical regions, and

to determine unordered sequential event pairs in critical regions. The algorithm will

calculate two sets. The set Conc contains concurrent event pairs, while the set Seq

contains unordered sequential event pairs. Initially, we assume that all unordered events are

potential concurrent events. Once some critical regions have been detected, the algorithm

will move those unordered sequential event pairs from Conc to Seq.

Algorithm 5:

Initially let Conc =

e; e

, and ^



(

)



(

)

. Let Seq =



Repeat the following pro cedure until no more changes are p ossible.

Pick any two unordered wait events

and

for semaphore

where (

e; e

)

Conc.

Let

(

e; e

) be the set of wait events for semaphore

which precede either event

(based on current timestamps ^



Let

(

e; e

) =

is a signal event using

and

precedes

g [ f

is not

shadowed with respect to either

, and

does not follow either

Let

(

e; e

)

and

(

e; e

)

4. Expanding the Safe Order Relation

Figure 4.2: Expanding the Safe Order Relation

Case1: Assume ^



(

)[

] = ^



(

)[

] then



(

)[

]





(

)[

]

)



by the induction hypothesis (4.4)

)



by transitivity (4.5)

Case2: ^



(

)[

]

= ^



(

)[

Event

is a wait on semaphore

. Let

be computed as sp ecied in the algorithm, then



(

)[

]





(

)[

] = min



(

)[

] :

(

)

(4.6)

where non-shadowed signal set

(

) is computed according to the algorithm, and

min

selects the

+ 1st smallest value from the set.

In every execution, at least

+ 1 signal events precede

since there are at least

waits

on the same semaphore must happen b efore

In any arbitrary execution P, let

be the number of shadowed signals (with respect to

)

that precede

By transitivity the corresponding

shadowing waits precede

, and at least

waits

precede wait event

Therefore, at least

+ 1 signal events precede

in the execution and

+ 1 of them

are non-shadowed signals.

4. Expanding the Safe Order Relation

Therefore, the signal event

is shadowed by some wait event

where



with

respect to

This forms a contradiction with the assumption that

is shadowed by

The Algorithm 4 is based on the following observation. If

is a wait event on semaphore

and

other wait events on

must happen before

, then at least

+ 1 non-shadowed

signal events happen before

in every execution consistent with the trace.

Algorithm 4:

Initially ^



(

) =



(

) for all events

Repeat the following pro cedure until no more changes are p ossible.

Pick an event

. If

is a wait event using semaphore

, let



(

) be the set of wait events on semaphore



be the number of wait events

(

) such that

and if

then



(

)[

]





(

)[

], and



(

) =

: ^

is a signal event on

6

as indicated by the ^



timestamps, and ^

is not shadowed with respect to

and

= the

+ 1st component-wise minimum of ^



) for ^

(

is not a wait event, let

be the 0 vector.



(

) = max(^



(

)

; 

(

)

; v

)

End Algorithm 4.

Figure 4.2 shows the new ^



timestamps generated when Algorithm 4 is executed starting

with Figure 3.1.

Theorem 5:

Algorithm 4 generates only safe order relations, i.e., for any two events

and



(

)[

]





(

)[

]

)



Proof

: The proof is by induction on the number of updates. As a base case the theorem

holds for the initial values of ^



from Theorem 4.

Assume the theorem holds before some update. Consider two events

and

where



(

)[

]



(

)[

] before the update, and



(

)[

]





(

)[

] after the update.

Because ^



(

)[

] never changes, ^



(

)[

] was updated.

We consider two cases.

4. Expanding the Safe Order Relation

Figure 4.1: Shadowed Signal Event

Since for each shadowed signal there is only one corresponding shadowing wait (by

Denition 15), we have

(

)

jj

(

)

We only need to show that

(

)

(

)

Assume to the contrary that

(

)

(

)

, which means that there are at least two

signals

and

(

) shadowed by some

(

Assume



. Let

and

be the number of waits and signals on

performed by

between

and

be the number of waits and signals on

performed by

between

and

. This is shown in the following which represents the local subsequence

of events performed by some task, where time moves from left to right.

j ?

?!j j ?

?!j

...

j ?

?!j j ?

?!j

Therefore,

since

is shadowed by

(4.1)

+ 1 since

is shadowed by

(4.2)

Combining equations 4.1 and 4.2 gives us

+ 1 (4.3)

However, equation 4.3 means that the subsequence between

and

contains more waits

than signals.

4. Expanding the Safe Order Relation

add additional safe orderings into the partial order using the fact that only some wait

events in the trace can actually proceed immediately after each signal event. The partial

order resulting from this nal step will be represented by the time vectors ^



(

). Initially,



(

) =



(

Denition 14:

Let

be a wait event and

be a signal event on the same

semaphore

where



(

)



(

)

. Let

(

e; e

)

be the subsequence of

containing every

event

where



and



(

)



(

)

. If any sux of

(

e; e

)

contains more wait events

than signal events on

, then the signal event

shadowed

with respect to

Denition 15:

Let

(

e; e

)

be the shortest sux of

(

e; e

)

which contains more wait

events than signal events on

, and let

be the rst event of

(

e; e

)

. We say

shadowed by event

with respect to

Lemma 2:

Given a wait event

and a signal event

on the same semaphore

, if

shadowed by some event

with respect to

then



Event

is a wait event on semaphore



The event

, which shadows

with respect to

, is unique. We dene

to be the

shadowing wait event corresponding to

, and



The subsequence between

and

(in the same task) contains as many signal events

as wait events on semaphore

Proof

: The pro of is straightforward from the denitions.

Denition 16:

For any wait event

, let

(

) =

is shadowed with respect to

, and

(

) =

(

)

s.t.

is shadowed by

with respect to

In the example shown in Figure 4.1, the signal event CS1 is shadowed by CW1 with

respect to two wait events performed by task B.

Lemma 3:

For any wait event

, the correspondence between shadowed signal and

shadowing wait is one to one, i.e.,

(

)

(

)

Proof

: Let

be a wait event on semaphore

. From Denitions 14 and 15, we know

that any pair of corresponding shadowed signal and shadowing wait belongs to the same

task.

Therefore, it is enough to show that the corresp ondence between shadowed signal and

shadowing wait is one to one within each task

where 1



Let

(

) and

(

) be the sets of shadowed signal events and shadowing wait events

performed by task

with resp ect to

4. Expanding the Safe Order Relation

Equation 3.7 implies for some

max(



(

)

; 

(

)

; 

(

))[

]

max(



(

)

; 

(

)

;

min(



(

)

;

...



(

)))[

]

Equation 3.5 and 3.8 imply



(

)[

]

min(



(

)

;

...



(

))[

] (3.9)



(

)[

]

< 

(

)[

] (3.10)



(

)

6



(

) (3.11)

Again 3.11 contradicts the assumption that

is the rst event in the top ological order of

the partial order P such that



(

)

6



(

Therefore, there is no event

in any execution P such that



(

)

6



(

Theorem 4:

After rewinding, we have a partial order that is a safe order relation, i.e.



(

)

< 

(

)



Proof

: Let

be the task performing

, so



(

)[

] =



(

)[

] =



(

)[



(

)[

]





(

)[

] from the hypothesis (3.12)



(

)[

]





(

)[

] by Lemma 1 (3.13)



(

)[

]





(

)[

] since

(3.14)

for all

and (3.15)



from the denition of



(3.16)

The rewinding process is based on the fact that any signal event might enable any

wait event on the same semaphore. We may have lost some safe order relations during

rewinding. As an example, in Figure 3.1, time vector



says that two W2 events and

the W1 event in task A may happen concurrently with all of the events in task B and C.

However, it is obvious that the W1 in task A must happ en after the two S1 events in task

B and C, and the second W2 in task A has to wait until all of the events in B and C have

occurred. The nal step in the algorithm will nd some of the order relations lost during

the rewinding procedure.

4 Expanding the Safe Order Relation

The result of the rewind step is a partial order that is a safe order relation. It is an

overly conservative safe order relation because it assumed that any wait could happen

immediately after any signal for the same semaphore. We now undertake a process to

3. Rewinding the Time Vectors

From the inductive hypothesis



(

)



min(



(

)

;

...

; 

(

))

;

and from the algorithm

min(



(

)

;

...

; 

(

))

< 

)

Therefore



(

)

< 

After rewinding, we have a partial order that is a safe order relation. If event

has

an earlier time vector than

, we can say

will happen before

in all executions that are

consistent with the given trace. Before we prove this in theorem 4 we rst present one

lemma used in the proof.

Lemma 1:

For any execution

consistent with a trace

and for al l events



(

)





(

)

Proof

: Assume to the contrary that there is an execution

and some event

such that



(

)

6



(

In any topological ordering (with respect to partial order P) of events in E, let

be the

rst event in the topological ordering such that



(

)

6



(

We consider two cases.

Case1: If

is not a wait event then from Algorithms 1 and 3:



(

) = max(



(

)

; 

(

)) (3.3)



(

) = max(



(

)

; 

(

)) (3.4)

Note that



(

)

6



(

) by our choice of

. This contradicts the assumption that

is the

rst event in the topological order of the partial order P such that



(

)

6



(

Case2: Event

is a wait event. From the choice of

we get



(

)





(

) (3.5)



(

)

6



(

) (3.6)

Substituting the denitions of



and



into 3.6 gives:

max(



(

)

; 

(

)

; 

(

))

6

max(



(

)

; 

(

)

;

min(



(

)

;

...



(

))) (3

where

is the corresponding signal event of

and thus app ears before

in the topological

order, and each

for 1



is one of the

signal events for the semaphore waited

on by

3. Rewinding the Time Vectors

Figure 3.1: Rewinding the Time Vectors

Proof

: It is enough to show that



(

)[

]





)[

]

)



(

)

< 

). The opposite direction

follows directly from the denition of vector comparison.

The proof is by induction on the number of updates made by Algorithm 3. As the base

case, from theorem 2, the theorem holds for the initial



values.

Assume the theorem holds before some update. Consider two arbitrary events,

and

, after updating a single time vector.

Since Algorithm 3 does not change



(

)[

] and never increases time vectors, updating



(

)

can not make



(

)[

]





)[

]

)



(

)

< 

) false. Therefore, we consider three cases

when



) was up dated.

Case1:

= ^

then from the algorithm



(

)

< 

Otherwise,



(

)

< 

) which implies



(

)

< 

Case2:

and



)[

] =



)[

This implies



(

)[

]





)[

]. Since neither



) nor



(

) changed, by the induction

hypothesis



(

)





), and the algorithm ensures that



)

< 

). Therefore



(

)

< 

Case3:

and



)[

]



)[

]. This implies that ^

is a wait event for some semaphore

Let

...

be the signal events for the semaphore S. From the algorithm denition and

the assumption we know



(

)[

]





)[

] = min(



(

)

;

...

; 

(

))[

]

3. Rewinding the Time Vectors

3 Rewinding the Time Vectors

The result of the initialize step in the previous section is an unsafe order relation. It is

unsafe b ecause we assumed that the

th signal event for a particular semaphore was the

one allowing the

th wait event to precede. The next step is to rewind the time vectors to

account for the fact that any signal event might be the one that allowed any wait event on

the same semaphore to complete. We use



(

) to represent the new time vector assigned

to event

during and after the rewinding process. Initially



is the same as



Suppose

is a wait event, and

and

are two signal events, either of which could

have caused

to complete. In this case, we only know that either

must have

happened b efore

. The trace might be in any of the forms:

...

; e

;

...

; e;

...

; e

;

. . .;

...

; e

;

...

; e;

...

; e

;

. . .;

...

; e

;

...

; e

;

. . .

; e;

. . .; or

...

; e

;

...

; e

;

. . .

; e;

. . ..

However, we can conclude that the common ancestors of

and

must o ccur before

Therefore if



and



then



. The rewind step dened below uses this fact

to obtain a safe order relation.

Algorithm 3:

Initially,

E; 

(

) =



(

Repeat the following procedure until no further changes are possible.

For all event

, let



(

) = max(



(

)

; 

(

)

; v

)

where if

is wait event on semaphore S:

= min(



(

)

;

...

; 

(

)) (3.1)

where

...

are all the signal events for the semaphore S. (3.2)

otherwise

is the 0 vector.

End Algorithm 3.

Observe that the only dierence between Algorithm 3 and Algorithm 2 (used to

compute



) is that for wait events in Algorithm 3,

is the minimum of a set of time

vectors, which includes the time vector used for

in computing



. Therefore the values



will only get smaller as Algorithm 3 executes.

Theorem 3:

For any two distinct events

;



(

)[

]





)[

]

()



(

)

< 

)

2. Initializing the Vectors

2. If

is the corresponding signal event and

then



(

)[

]





(

)[

] and

)



(

)[

]





(

)[

] and the result follows.

3. If

is the corresponding signal event and

then

Property 8 (2.8)



(

)[

]





(

)[

] from the inductive hypothesis (2.9)



(

)[

]





(

)[

] from the denition of



(2.10)

and the result follows.

Theorem 2:

For any two distinct events

; e



(

)[

]





(

)[

]

()



(

)

< 

(

)

Proof

: The

(

direction is trivial. For the

)

direction, assume to the contrary that there

are two events,

and

where



(

)[

]





(

)[

] but



(

)

< 

(

). Thus there is

some vector comp onent

such that



(

)[

]

> 

(

)[

]

(2.11)

Let

be the event occurring in

with sequence number



(

)[

] then



(

)[

] =



(

)[

]

(2.12)

from Theorem 1 and 2.12 (2.13)

from the hypothesis and theorem 1 (2.14)

from transitivity of

(2.15)



(

)[

]





(

)[

] from 2.15 and Theorem 1 (2.16)

Combining 2.12 and 2.16 forms a contradiction with 2.11.

Corollary 1:

For any two distinct events



(

)[

]

> 

)[

]

and



)[

]

> 

(

)[

] =

)

The initialization process creates a partial ordering of the events in the trace. This

partial ordering corresponds to an execution which is strongly consistent with the trace.

It describ es the

happened before

relation for the

canonical execution

Unfortunately, this partial order only gives the happ ened before relationships between

events for the canonical execution, i.e. it is an unsafe order relation. The

th signal event in

one execution might not necessarily be the

th signal in some other execution. Therefore,

event

may not happen before ^

in some other execution even if it did in this execution.

Even when



(

)

< 

) we cannot say

must happen before

2. Initializing the Vectors

Proof

: From Properties 3 and 7 we know that if either side holds then

appears before

in the trace. Therefore, it suces to prove that whenever the algorithm assigns a time

vector to some event

, and

is any event appearing earlier in the trace (and thus already

assigned a time vector by the algorithm) the two conditions are equivalent. We prove this

by induction on the position of

in the trace.

After the rst event is assigned a time vector, the theorem trivially holds as no distinct

pairs of events have been assigned time vectors. We now show that the time vector assigned

to the next event,

, satises the theorem assuming that the time vectors assigned to all

events appearing before

in the trace satisfy the theorem.

We rst show that assuming



(

)[

]





(

)[

] then

. If

, so that the

two events are in the same task,

, the implication follows because the selected vector

component is the event count for task

. Otherwise the events occur in dierent tasks

and



(

)[

] =



)[

]

where

is either

(

or possibly

is a wait event and

is the corresponding signal event.

In either case ^

has previously been assigned a time vector and



(

)[

] =



)[

] by the denition of



(2.1)



(

)[

]





)[

] from the assumption (2.2)

from the denition of ^

(2.3)

Either

= ^

and the theorem is proven or by the induction hypothesis

, and by

transitivity

To prove that

)



(

)[

]





(

)[

] we consider three cases.

Case1: If

, so that the two events are in the same task, the result follows from

Properties 3 and 4.

Case2: If

is not a wait event then



(

)[

] =



(

)[

] (2.4)

Property 8 (2.5)



(

)[

]





(

)[

] from the hypothesis (2.6)



(

)[

]





(

)[

]

(2.7)

Case3: If

is a wait event then we have three subcases:

1. If

is the corresponding signal event then the result trivially holds.

2. Initializing the Vectors

Algorithm 2:

To compute initial time vectors,



(

), from a trace

use algorithm 1 with

the following mo dications.



The

th wait event on semaphore

(in trace order) corresp onds to the

th signal

event on



The events are assigned time vectors in the order they appear in the trace.

End Algorithm 2.

For the given trace, Figure 1.1(a) shows the result of the initialization procedure.

The time vectors computed for the canonical execution have the following properties:

Property 4:

and

are two events in the same task

and

occurred before

in the

trace, then

and



(

)

< 

)

Property 5:

and

are the corresponding signal/wait pair (the

th signal and the

wait on the same semaphore

in the trace), then

and



(

)

< 

)

Property 6:

At any point in the trace, the maximum value of any time vector component

is the number of events performed by the corresponding task up to that point.

Property 7:

and



(

)[

]





)[

]

then either

= ^

appears before

in the

trace.

Because an event is only constrained to follow its predecessor in the same task, and in

the case of wait events, the corresponding signal, the following property holds.

Property 8:

then one of the following is true:

= ^

where

is a wait event and

is the corresponding signal event,

where

is a wait event and

is the corresponding signal event.

Given the correspondence between signal and wait events for execution P, events can

be assigned time vectors by using Algorithm 1. Mattern [Mat88] has shown that the time

vector



correctly represents the partial order relation

, i.e., for any pair of distinct

events

and



(

)[

]





(

)[

]

()

For completeness, we now prove that the initial time vectors,



, correctly represent the

happened before relation for the canonical execution.

Theorem 1:

For any pair of distinct events

and



(

)[

]





(

)[

]

()

2. Initializing the Vectors

1.3 An Overview of the New Algorithms

In the following sections we will introduce a series of algorithms to calculate dierent

time vectors for trace events. By comparing their nal time vectors, we can distinguish

many

ordered

events from the

unordered

, p otentially concurrent, events. Our goal is a set

of time vectors where if event

has an earlier time vector than

, then

will happen

before

all

executions that are consistent with the given trace

The three phases of the algorithm are \initialize", \rewind", and \expand". The

initialization uses Algorithm 1. The resulting partial order is similar to that computed

by the algorithm of [Fid88]. This partial order is shown to be equivalent to the \happened

before",

, relation for a canonical execution

. Note that the canonical execution is in

general not the same execution which generated the trace. The result of the rewinding

phase is a partial order that is a subrelation of the



relation. Unfortunately this safe

order relation is overly conservative, in that there may be many \must happen before"

relations that it does not include. The third and nal phase results in a safe partial order

that is closer to the \must happen before" relation.

2 Initializing the Vectors

Before giving the algorithm for computing the initial time vectors, we dene a canonical

execution that will be used to verify the \correctness" of the time vectors.

Denition 13:

Given a trace

with the total ordering of events,

, the partial order

corresponding to the

canonical

execution

is constructed by selecting and taking the

transitive closure of the fol lowing subrelation of



and

are two events from the same task and

then



and

are the

th signal and wait events respectively on the same semaphore,

then

In the remainder of the paper we will use

to mean

where

is the canonical

execution dened above.

Property 3:

then

appears before

in the trace.

Given a specic input and a trace, there are in general executions which are not consistent with that

trace, however, any such execution will contain a race if and only if a race occurred in the execution that

generated the trace [AP87].

1. Introduction

length

, where

is the total number of tasks.

Each task

has its own vector component

[

] which guarantees a strict temporal ordering of events occurring in that task. A local

event counter which is incremented each time an event occurs in the task can b e used as

the lo cal clock.

Before presenting the algorithms for computing time vectors from a trace, we need to

dene some notation.

Denition 9:

For an event

is the previous event performed by the same task

if such an event exists.

Denition 10:

For an event



(

)

is the time vector containing the local event

count for

in the

th position and zeros elsewhere.

Denition 11:

For any two time vectors

u; v



() 8

(

[

]



[

])

u<v

()



and

() :

(

u < v

)

and

(

v < u

)

Denition 12:

For any k time vectors

;

...

; v



min(

;

...

; v

)

is a vector of

whose

th component is

min(

[

]

;

...

; v

[

])

, and



max(

;

...

; v

)

is a vector of

whose

th component is

max(

[

]

;

...

; v

[

])

The following algorithm (derived from [Mat88, Fid88]) computes time vectors for the

events in an execution. This algorithm requires the correspondence between signal and

wait events. The time vectors produced reect the execution's partial order.

Algorithm 1:

Given the correspondence between signal and wait events for execution

events are assigned time vectors,



(

), in topological order.



(

) = max(

; v

; 

(

))

where

(



(

) if

has a predecessor

the 0 vector otherwise



) if

is a wait event and

is the corresponding signal event

the 0 vector if

is not a wait event

End Algorithm 1.

We use an integer valued clock in our discussion although a real number valued clock can also be used.

1. Introduction

Denition 4:

An execution is

strongly consistent

with a trace if it is consistent and

the total order specied by the trace is an extension of the partial order specied by the

execution.

For example, consider the trace

AS1, CW1, CS1, CS2, BW1, BS1, BS2, AW2, AW2,

AW1

. Event AS1 means task A performs a signal(

), AW1 means task A performs

wait(

) etc. Figure 1.1 shows the four executions which are consistent with this trace. In

addition, the executions (a) and (b) are strongly consistent with the trace.

Denition 5:

Consider the correspondence between signal and wait events in execution P

and two distinct events

e; e

. If

and

then events

and

are concurrent, and

thus can happen at the same time, in the execution.

Denition 6:

The symbol \

" is used to represent the concurrent relationship between

events. Two events

and

are concurrent, i.e.

, if they can happen at the same

time in some execution which is consistent with the trace.

Denition 7:

The symbol \



" is used to represent the

must happen b efore

relationship

between events. Given two events

and

, if



, then event

will happen before

all executions that are consistent with the given trace. Events

and

are

ordered



, otherwise, they are

unordered

Concurrent events are always unordered, but unordered events need not be concurrent.

For example, see events BW1 and CW1 in Figure 1.1.

Notice that, in general,



is dierent from the relation

for any choice of

The former relation tells us that

must happen before

in all executions consistent with

the trace being analyzed, while the later says that

happened before

in the execution

represented by the partial order

. If



then

for all consistent executions

But the converse condition do es not hold.

In Figure 1.1, CS1

BW1 if

is the execution (a). However, if

is the execution

(c), BS1

CW1, and BW1

CS1 by transitivity. Event AS1 happens before BW1 and

CW1 in all executions consistent with the trace, therefore AS1



BW1 and AS1



CW1.

There is no order relation between event CS2 and BW1 in execution (a). Therefore, they

can happen concurrently, i.e., CS2

BW1.

Denition 8:

A partial ordering R on the events is a

safe

order relation if

)



. If R is not safe, then R is

unsafe

1.2 Virtual Time

The concept of virtual time for distributed systems was introduced by Lamport in 1978

[Lam78]. The time vectors we compute in this paper are an extension of the time vectors of

Fidge [Fid88] and Mattern [Mat88]. There, each task

has a clock

which is a vector of

1. Introduction

Trace =

AS1, CW1, CS1, CS2, BW1, BS1, BS2, AW2, AW2, AW1

Figure 1.1: Trace, Executions, and Time Vectors

1. Introduction



and a (positive integer) sequence number equal to one plus the number of previous

operations performed by the task.

In order to perform the nal race analysis, it must be p ossible to determine from a trace

what shared objects are referenced b etween any two synchronization events. This can b e

done by additionally associating with each event the source line number of the statement

generating the event. From this the path between two adjacent events can be determined

and the variables referenced along the path can be computed [McD89].

Many other kinds of synchronization op erations can b e simulated by using counting

semaphores. Consider, for example, the event \

init task t

" which creates a new task

and the event \

await task t

" which blocks the running task until task

has terminated.

Given a trace containing these events, we can create an equivalent a trace containing only

semaphore events.

In each execution every wait event has a corresponding signal event. We use this

correspondence to dene a partial order representing that execution.

Denition 1:

execution

of a paral lel program is a partial ordering of the events

performed. This partial order is the transitive closure of edges from each event to the

next event performed by the same task and edges to each wait event from the corresponding

signal event.

The relation dened by the partial order

representing an execution is called the

happened

before

relation and is denoted with the symbol

. Our denition of \happened before" is

consistent with that of Lamport[Lam78].

Denition 2:

A trace of an execution is an interleaving of the local sequences of events

for



where for every prex of the trace and every semaphore S, the prex

contains at least as many signal(S) events as wait(S) events.

Every trace must satisfy the following properties:

Property 1:

No two events in the trace have both the same task id and the same sequence

number.

Property 2:

If there is an event with task id

and sequence number

, then for every



i < k

, there is an event with task id

and sequence number

appearing earlier in the

trace.

A single execution usually has many possible traces. Similarly, a single trace could have

been generated by any one of a number of executions. (Figures 1.1(a) and 1.1(b) show two

dierent executions for the same trace).

Denition 3:

An execution is

consistent

with a trace if the local sequences of trace events

for each task



is the same as in the execution.

1. Introduction

occur" execution order. Our algorithms appear to be more ecient and may nd more

guaranteed order relations.

Netzer and Miller [NM89] present a formal model of a program execution based

on Lamport's model of concurrent systems [Lam86]. Their model includes fork/join

parallelism and synchronization using semaphores. They distinguish b etween an

actual

data race

, which is a data race exhibited by the particular program execution generating

the trace, and a

feasible data race

, which is a data race that could have been exhibited

due to timing variations. They show how to characterize each detected data race as either

being feasible, or as belonging to a set of data races such that at least one data race in

the set is feasible. They rely on the trace for their ordering information. As an example,

when two tasks try to enter some critical regions surrounded by some binary semaphore

S, their algorithm will say that these two tasks are ordered when accessing these regions.

Under their denitions there is neither an actual nor feasible data race even if two tasks

write to some shared variable in this case. We view the ordering relationships in the trace

with suspicion, and wish to generate race reports in this situation.

We believe that it is more helpful to analyze sets of executions rather than just one

specic execution based on some trace information. We feel that, in terms of detecting data

races by trace analysis, it is critical to distinguish the

ordered

events from the

unordered

potentially

concurrent

, events. In this paper we present a collection of algorithms that

extend previous work in computing partial orders. The algorithms presented compute a

partial order containing only \

must occur

" type orderings from a linearly ordered trace

containing anonymous synchronization. The algorithms presented in this paper make

few assumptions about specic trace features and can be adjusted to work with traces

generated by many parallel systems, including IBM Parallel Fortran [IBM88], and Cedar

Fortran [GPH*88].

1.1 Description of the Mo del

We view a parallel program as a nite set of

tasks

;

...

; T

where

is the number

of tasks in the system. These tasks p erform synchronization and computation operations,

including computation on shared data

. In an execution, each task

is a sequential entity

characterized by a local sequence

of events. Dierent tasks may perform operations

concurrently. We assume, for convenience, that each task has a unique identier.

In our model, programs synchronize using only counting semaphores which are assumed

to be initialized to zero. Therefore, each event is a tuple containing:



the operation completed (wait or signal),



the semaphore aected,



the id of the task that performed the operation,

Although operations on shared data can b e used for synchronization [Dij65], we only consider explicit

synchronization operations as capable of generating synchronization events.

1. Introduction

1 Introduction

One of the fundamental problems encountered when debugging a parallel program is

determining the race conditions in the program. A race condition may exist when two

or more parallel tasks access shared data in an unspecied order and at least one of the

accesses is a write access. Notice that races include both accesses that may occur \at the

same time" and accesses that must occur sequentially but the order is unspecied (e.g.

accesses protected by a lock). One approach to determining potential races is based on

computing all of the reachable concurrent states of the program [McD89, Tay84]. The

major disadvantage of this approach is that the number of concurrent states may become

prohibitively large. Another approach to determining p otential races is based on analyzing

a trace from an execution of the program [EP88, EGP89, NM89]. This approach has the

disadvantage that a trace must be recorded, and is limited to determining races that

can occur given the input data used. Even for the given data, it may not b e possible

to determine all races [AP87]. Nevertheless, this later approach can provide important

information to help in debugging parallel programs and is the sub ject of this paper.

trace

species a total ordering of the events performed by the program. For our

purposes, the trace reects only one of the orders in which the events could have occurred.

A more restrictive denition that is dicult to achieve in practice would be for a trace to

specify the exact order in which the events did occur. Since traces are only approximations

of executions, there are usually several executions that are consistent with a given trace.

What we want to compute is the orderings b etween pairs of events that

must occur

in all

executions which are consistent with the trace. In general this will be a partial order. If

the partial order contains all orderings that must occur, then a pair of events not ordered

by this \

must occur

" partial ordering can potentially execute in either order.

Much research has been directed towards determining the partial ordering of events in

parallel and distributed systems. Previous models have assumed point-to-point commu-

nication which makes it very easy to determine which events were caused by which other

events (e.g. \message received by B from A" is clearly caused by \message sent by A to

B"). Unfortunately the synchronization models supported by several parallel programming

languages allow for anonymous communication, where the partner is unknown. Examples

of anonymous communication include lo cks, semaphores, and monitors.

Emrath, Ghosh, and Padua [EGP89] present a method for detecting non-determinacy

in parallel programs that utilize fork/join and event style synchronization instructions

with the

Post, Wait

, and

Clear

primitives. They construct a

Task Graph

from the given

synchronization instructions and the sequential components of the program that is intended

to show the guaranteed orderings b etween events. For each

Wait

event node, all

Post

nodes

that might have triggered that

Wait

are identied. An edge is then added from the closest

common ancestor of these

Post

events to the

Wait

event node. The idea of the algorithm

is very simple, but it may be computationally complex. Also some of the guaranteed

order relations may be missed by their algorithm. Rather than repeatedly computing

the common ancestor information, we use time vectors to calculate the guaranteed \must

Analyzing Traces with

Anonymous Synchronization

David P. Helmbold

Charles E. McDowell

Jian-Zhong Wang

UCSC-CRL-89-42

December, 1989

Board of Studies in Computer and Information Sciences

University of California at Santa Cruz

Santa Cruz, CA 95064

abstract

In a parallel system, events can o ccur concurrently. However, programmers are often

forced to rely on misleading sequential traces for information about their program's behav-

ior. We present a series of algorithms which extract ordering information from a sequential

trace with anonymous semaphore-style synchronization.

We view a program execution as a partial ordering of events, and dene which executions

are consistent with a given trace. Although it is generally not possible to determine which

of the consistent executions occurred, we dene the notion of \safe orderings" which are

guaranteed to occur in every execution which is consistent with the trace.

The main results of the paper are algorithms which determine many of the \safe or-

derings". The rst algorithm starts from a sequential trace and creates a partially ordered

canonical execution. The second algorithm strips away the ordering relationships particular

to the canonical execution, so that the resulting partial order is safe. The third algorithm

increases the amount of ordering information while maintaining a safe partial order. All

three algorithms are accompanied by proofs of correctness.

keywords: virtual time, program tracing, parallel processing, debugging

This work was supported by IBM under agreement SL 88096.

A Unified Model for Concurrent Debugging.

Conference Paper

Full-text available

Jan 1993

Maotai 2.0: Data Race Prevention in View-Oriented Parallel Programming

Conference Paper

Full-text available

Jan 2010

This paper proposes a data race prevention scheme, which can prevent data races in the View-Oriented Parallel Programming (VOPP) model. VOPP is a novel shared-memory data-centric parallel programming model, which uses views to bundle mutual exclusion with data access. We have implemented the data race prevention scheme with a memory protection mechanism. Experimental results show that the extra overhead of memory protection is trivial in our applications. We also present a new VOPP implementation-Maotai 2.0, which has advanced features such as deadlock avoidance, producer/consumer view and system queues, in addition to the data race prevention scheme. The performance of Maotai 2.0 is evaluated and compared with modern programming models such as OpenMP and Cilk.

Detecting Race Conditions in Parallel Programs that Use Semaphores

Article

Apr 2003

We address the problem of detecting race conditions in programs that use semaphores for synchronization. Netzer and Miller showed that it is NP-complete to detect race conditions in programs that use many semaphores. We show in this paper that it remains NP-complete even if only two semaphores are used in the parallel programs. For the tractable case, i.e., using only one semaphore, we give two algorithms for detecting race conditions from the trace of executing a parallel program on p processors, where n semaphore operations are executed. The first algorithm determines in O(n) time whether a race condition exists between any two given operations. The second algorithm runs in O( np log n) time and outputs a compact representation from which one can determine in O(1) time whether a race condition exists between any two given operations. The second algorithm is near-optimal in that the running time is only O( log n) times the time required simply to write down the output.

2002 Report for the SAAPP Project

Article

Jakob Engblom

Experience with techniques for refining data race detection

Conference Paper

Jan 2006

Dynamic data race detection is a critical part of debugging shared-memory parallel programs. The races that can be detected must be refined to filter out false alarms and pinpoint only those that are direct manifestations of bugs. Most race detection methods can report false alarms because of imprecise run-time information and because some races are caused by others. To overcome this problem, race refinement uses whatever run-time information is available to speculate on which of the detected races should be reported. In this paper we report on experimental tests of two refinement techniques previously developed by us. Our goal was to determine whether good refinement is possible, and how much run-time information is required. We analyzed two sets of programs, one set written by others (which they had tested and believed to be race-free but which in fact had subtle races) and another set written by us (in which we introduced more complex races). We performed race detection and refinement on executions of these programs, and recorded both the global event ordering and an approximate ordering recorded without a global clock. We found that in all the programs written by others, accurate refinement was possible even without the global ordering. In the other programs, accurate refinement was also possible but required the global ordering. These results suggest that our techniques refine races accurately, and lead a programmer directly to race-causing bugs. They also suggest that race detection methods should record only enough information necessary for good refinement (either global or approximate event orderings), and this information depends on the severity of the races being debugged.

Data race: Tame the beast

Article

Full-text available

Mar 2010

Data races hamper parallel programming and threaten the reliability of future software. This paper proposes the data race prevention scheme View-Oriented Data race Prevention (VODAP), which can prevent data races in the View-Oriented Parallel Programming (VOPP) model. VOPP is a novel shared-memory data-centric parallel programming model, which uses views to bundle mutual exclusion with data access. We have implemented the data race prevention scheme with a memory protection mechanism. Experimental results show that the extra overhead of memory protection is trivial in our applications. The performance is evaluated and compared with modern programming models such as OpenMP and Cilk. VOPP-View oriented parallel programming-Concurrent programming-SPMD-Data race free-Data-centric programming-Cilk-Multicore-Shared-memory

State Based Visualization of PVM Applications.

Conference Paper

Jan 1996

Roland Wismüller

Experience with Techniques for Refining Data Race Detection.

Conference Paper

Full-text available

Jan 1992

Race Detectors for Cilk and Cilk++ Programs

Chapter

Jan 2011

The Cilk++ concurrency platform

Conference Paper

Jan 2009

Charles E. Leiserson

The availability of multicore processors across a wide range of computing platforms has created a strong demand for software frameworks that can harness these resources. This paper overviews the Cilk++ programming environment, which incorporates a compiler, a runtime system, and a race-detection tool. The Cilk++ runtime system guarantees to load-balance computations effectively. To cope with legacy codes containing global variables, Cilk++ provides a ldquohyperobjectrdquo library which allows races on nonlocal variables to be mitigated without lock contention or substantial code restructuring.

ResearchGate has not been able to resolve any references for this publication.

Analyzing Traces with Anonymous Synchronization

Abstract

Recommended publications

On the loss of parallelism by imposing synchronization structure

Detecting Data Races by Analyzing Sequential Traces

Detecting data races from sequential traces

Computing Reachable States of Parallel Programs

Debugging concurrent programs