ArticlePDF Available

Point processes for unsupervised line network extraction in remote sensing

November 2005
IEEE Transactions on Pattern Analysis and Machine Intelligence 27(10):1568-79

November 2005
27(10):1568-79

DOI:10.1109/TPAMI.2005.206

Source
PubMed

Authors:

Xavier Descombes

National Institute for Research in Computer Science and Control

Josiane Zerubia

National Institute for Research in Computer Science and Control

This paper addresses the problem of unsupervised extraction of line networks (for example, road or hydrographic networks) from remotely sensed images. We model the target line network by an object process, where the objects correspond to interacting line segments. The prior model, called "Quality Candy," is designed to exploit as fully as possible the topological properties of the network under consideration, while the radiometric properties of the network are modeled using a data term based on statistical tests. Two techniques are used to compute this term: one is more accurate, the other more efficient. A calibration technique is used to choose the model parameters. Optimization is done via simulated annealing using a Reversible Jump Markov Chain Monte Carlo (RJMCMC) algorithm. We accelerate convergence of the algorithm by using appropriate proposal kernels. The results obtained on satellite and aerial images are quantitatively evaluated with respect to manual extractions. A comparison with the results obtained using a previous model, called the "Candy" model, shows the interest of adding quality coefficients with respect to interactions in the prior density. The relevance of using an offline computation of the data potential is shown, in particular, when a proposal kernel based on this computation is added in the RJMCMC algorithm.

Segment types of the “Candy” model defined with respect to R c .

…

Internal bad orientation R io . f s 1 ; s 2 g and f s 1 ; s 3 g hold ð C 1 Þ as p 2

…

External bad orientation R eo . s 2 , s 3 , and s 3 have an extremity in

…

A drawback of the “Candy” model.

…

Different types of connection. f s 1 ; s 2 g : attractive connection.

…

Figures - uploaded by Xavier Descombes

Content may be subject to copyright.

Content uploaded by Xavier Descombes

Content may be subject to copyright.

Point Processes for Unsupervised Line Network

Extraction in Remote Sensing

Caroline Lacoste, Xavier Descombes, and Josiane Zerubia, Fellow, IEEE

Abstract—This paper addresses the problem of unsupervised extraction of line networks (for example, road or hydrographic

networks) from remotely sensed images. We model the target line network by an object process, where the objects correspond to

interacting line segments. The prior model, called “Quality Candy,” is designed to exploit as fully as possible the topological properties

of the network under consideration, while the radiometric properties of the network are modeled using a data term based on statistical

tests. Two techniques are used to compute this term: one is more accurate, the other more efficient. A calibration technique is used to

choose the model parameters. Optimization is done via simulated annealing using a Reversible Jump Markov Chain Monte Carlo

(RJMCMC) algorithm. We accelerate convergence of the algorithm by using appropriate proposal kernels. The results obtained on

satellite and aerial images are quantitatively evaluated with respect to manual extractions. A comparison with the results obtained

using a previous model, called the “Candy” model, shows the interest of adding quality coefficients with respect to interactions in the

prior density. The relevance of using an offline computation of the data potential is shown, in particular, when a proposal kernel based

on this computation is added in the RJMCMC algorithm.

Index Terms—Stochastic processes, Monte Carlo, simulated annealing, edge and feature detection, remote sensing.

1INTRODUCTION

HIS paper tackles the problem of line network extraction

from satellite or aerial images, the final application

being either the production or the updating of maps. Many

methods have been—and will be—developed to answer this

difficult problem, in particular, for road network extraction.

One possibility is to consider a semiautomatic approach:

an operator gives a starting point and a direction that

initialize a tracking algorithm [1], [2], some endpoints that

may be linked by an algorithm based on dynamic program-

ming [3], [4], or some checking points that initialize an edge

extraction algorithm based on deformable contour models

[5], on dynamic programming [6], or on profile analysis [7].

These approaches usually allow a fast and accurate extrac-

tion. Nevertheless, the productivity gain of such approaches

is weak with respect to the extraction done by an expert.

A second pos sibility is to consider a fully-automatic

approach. The extraction is then an ill-posed problem in

which it is difficult to find a good compromise between

exhaustivity and specificity. Most automatic methods pub-

lished in the literature rely on a local optimization process

based, for example, on morphological operators [8], on

operators dedicated to road extraction [9], on operators based

on texture [10], or on neural networks [11]. The major

drawback of these low-level techniques is their sensitivity

to noise, particularly for high resolution images in which a

noise inherent to the observed scene is added (for example,

trees shadows on the roads). To reduce this sensitivity to

noise, some authors propose to combine different operators

[3], [12]. Although they often provide a coarse detection, such

techniques are widely used to initialize a network reconstruc-

tion procedure. Indeed, they allow the finding of road seeds

for the initialization of semiautomatic algorithms [13], [14], to

construct a graph on which a Markov random field can be

defined [15], or to initialize a self-organizing map algorithm

[16]. These two-step approaches are, however, strongly

sensitive to the predetection. For the extraction of thick

networks from high resolution images, multiscale ap-

proaches are proposed to reduce the effect of noise while

providing an accurate extraction. In [17], the extraction of the

central axis of the roads from an image of reduced resolution

is used to initialize a snake-based algorithm for the extraction

of road edges in the original high resolution image. In [18],

hypotheses of road segment are generated using the detection

of edges at high resolution and the detection of lines at low

resolution. Then, a grouping process is performed and gaps

are bridged using a contextual information.

We aim to develop an unsupervised technique, which is

not based on a combination of processings of the image. The

framework of our study is stochastic, which allows us to

benefit from the robustness with respect to noise and from

the tools associated with this type of method: estimators,

algorithms for exploration of the state space, which are

useful if the space is large, etc. Moreover, these models

avoid the sensitivity to noise of the usual pixel oriented

approaches. We model the line network by a marked point

process, that is, a random set of objects—each described by

a point and some marks—whose number of objects is a

random variable. One such model, called the “Candy”

model, was proposed in [19] for road network extraction.

The objects of this process are segments which interact to

allow the manipulation of strong geometrical constraints.

In this paper, we develop an extension of this first model,

called the “Quality Candy” model , which uses quality

1568 IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 27, NO. 10, OCTOBER 2005

. C. Lacoste is with CREATIS, INSA, 7 rue Jean Capelle, bat. Blaise Pascal,

F-69621 Villeurbanne Cedex, France.

E-mail: caroline.lacoste@creatis.insa-lyon.fr.

. X. Descombes and J. Zerubia are with INRIA, 2004 route des Lucioles, BP

93, 06902, Sophia Antipolis Cedex, France.

E-mail: {Xavier.Descombes, Josiane.Zerubia}@inria.fr.

Manuscript received 12 Aug. 2003; revised 1 Sept. 2004; accepted 31 Dec.

2004; published online 11 Aug. 2005.

Recommended for acceptance by C. Schmid.

For information on obtaining reprints of this article, please send e-mail to:

tpami@computer.org, and reference IEEECS Log Number TPAMI-0226-0803.

0162-8828/05/$20.00 ß 2005 IEEE Published by the IEEE Computer Society

coefficients with respect to interactions between segments to

better model the curvature, the junctions, and the intersec-

tions of the network. Moreover, we construct a new data term

that incorporates data properties through statistical measures

of the local homogeneity and the contrast with the back-

ground near the line network. Two methods implementing

this data term are developed: one is more accurate, the other is

more efficient. A calibration technique is proposed for

choosing the model parameters so that the extracted line

network possesses good properties (no free segments, no

redundancy, no gaps, etc.). The optimization is done via

simulated annealing using a Reversible Jump Markov Chain

Monte Carlo (RJMCMC) algorithm [20], [21]. This algorithm

allows line network extraction without any specific initializa-

tion: the initial state can be the empty state (no segment). The

performance of such an algorithm strongly depends on the

construction of the perturbation proposal kernel. To accel-

erate its convergence, we use relevant perturbations, such as a

segment birth in the neighborhood of a segment and a

segment birth based on data. The extraction algorithm is

tested on satellite and aerial images with each prior model

and with each data term implementation. The results are

evaluated quantitatively with respect to a manual extraction.

This paper is organized as follows: First, Section 2 recalls

the key-points about spatial processes for the understanding

of marked point processes. The “Quality Candy” model is

presented in Section 3 after a brief overview of the “Candy”

model. Section 4 is devoted to the construction of the data

term. Section 5 describes the optimization algorithm.

Section 6 addresses the problem of parameter choice. Finally,

Section 7 presents the results of line network extraction.

2GENERAL FRAMEWORK

Point processes provide a rigorous framework based on

measure theory [22] to describe a scene by an unordered set

of points in a compact F  IR

. For n 2 IN , let 

be the set

of configurations fx

; ...;x

g that consist of n unordered

points of F .Apoint process on F is a mapping X from a

probability space to the set configurations  ¼

n¼0



such that, for all bounded Borel sets A  F , the number of

points N

ðAÞ falling in A is a finite random variable.

The canonical “completely random” point process is the

uniform Poisson point process. X is a uniform Poisson

process on IR

if:

1. for all bounded Borel sets A  IR

, N

ðAÞ has a

Poisson distribution with mean jAj, where >0 is

the intensity;

2. if A

; ...;A

are disjoint bounded sets, then

ðA

Þ; ...;N

ðA

Þ are independent.

This definition induces the following conditional property:

Given N

ðAÞ¼n, the n points falling in A are independently

and uniformly distributed in A. The law of a Poisson process

of intensity  on an observed frame F  IR

is thus defined by

the following probability measure on ð; BÞ:

ðBÞ¼

n¼0



jFj

ðfx

; ...;x

gÞ dx

...dx

: ð1Þ

Although, in most applications, it is not realistic to assume

points are scattered randomly, Poisson processes are useful

to build more complex models. Indeed, interactions can be

introduced by specifying a density with respect to the

reference measure . Let h be a nonnegative function on .

Then, the measure  having a density h with respect to  is

defined by:

ðBÞ¼

hðXÞðdXÞ: ð2Þ

If 0 <ðBÞ < 1, then  can be normalized to make a

probability measure  defined by: ðBÞ=ðÞ. Interaction

models are usually specified by an unnormalized Gibbs

density given by:

hðXÞ¼exp UðXÞðÞ; ð3Þ

where UðXÞ is the energy of the system. For instance, let us

assume that:

UðXÞ¼vn

ðXÞ; ð4Þ

where n

ðXÞ is the number of pairs of points in interaction.

Two points interact if the distance between these two points

is lower than a threshold D. Different types of models

derive from (4):

. If v>0, the process, called Strauss process, induces

a repulsion between points.

. If v ¼1, the process, called “hard core” process,

forbids any interaction.

. If v<0, the process induces an attraction between

points, but is not well-defined since

hðXÞ ðdXÞ¼1:

To model the observed scene by a set of objects, we can

augment a point process by adding extra information (i.e.,

object parameters) to each point. Such a process is called a

marked point process or an object process. A marked point

process on F, with marks in a space M, is a point process on

F  M such that NðA  MÞ < 1 almost surely for any

compact A  F . In this context, the uniform Poisson process

is a marked point process where points are distributed

according to a uniform Poisson point process, and marks

associated to each point are uniformly distributed in M.

3PRIOR MODELS FOR LINE NETWORK

EXTRACTION

A line network is generally characterized by several strong

constraints, among them: continuity of the network and

small curvature, especially for road network extraction. If a

line network is considered as a set of segments, these

constraints can be thought of as interactions between

segments which can either penalize or favor some particular

configurations through potentials in the process density.

For the “Candy” model, briefly presented in Section 3.2, all

these potentials are constant. To improve this model, we

propose here to introduce potential functions measuring the

quality of each interaction. Our prior model is presented in

Section 3.3.

LACOSTE ET AL.: POINT PROCESSES FOR UNSUPERVISED LINE NETWORK EXTRACTION IN REMOTE SENSING 1569

1. B is the smallest -algebra such that, for all Borel sets A  F ,the

mapping fx

; ...;x

g7!N

ðAÞ is measurable.

3.1 Reference Process

The two prior models presented in the following sections are

specified by a density with respect to a uniform Poisson

marked point process. Each object (i.e., marked point) of this

reference process is a segment s described by its midpoint

p 2 F ¼ 0;X

max

½0;Y

max

½, its length L 2 L

min

max

½, and

its orientation  2 0;½.

3.2 “Candy” Model

The “Candy” model, proposed in [23], is based on

three possible relations between segments: the connection

and two relations of bad orientation, R

(internal bad

orientation) and R

(external bad orientation).

Two segments are said to be connected if two of their

extremities are closer than a constant . Indeed, under the

Poisson process, no exact connection between pairs of

segments occurs almost surely. This relation (connection)

defines several types of segments as shown in Fig. 1. Free

segments are those which are not connected, single ones are

those with only one of their endpoints connected to other

segments, and double segments have their two endpoints

connected. In the density, free and single segments are

penalized by positive and constant potentials !

and !

order to avoid breaks in the network and false alarms. The

internal bad orientation, R

, is defined to avoid super-

position of line segments or pairs of segments crossing at

too sharp an angle. Two segments s

¼ðp

;

Þ and s

ðp

;

Þ are concerned by R

if:

. ðC

Þ: kp

 p

kmaxfL

g=2.

. ðC

Þ: 

¼ 

 =2 >

min

,where

¼ minfj





j;j

 

jg denotes the absolute difference of

orientation between s

and s

(modulo =2).

Fig. 2 illustrates this definition. A positive and constant

potential !

is assigned to pairs that verify R

The last relation R

has been introduced to control the

curvature of the line network. An influence zone ZðsÞ is

defined for each segment s ¼ðp; L; Þ as the two discs of

radius L=4 whose centers are the two endpoints of p.

Two segments s

and s

are concerned by R

if:

. ðC

Þ: either exactly one extremity of s

is in Zðs

Þ or

exactly one extremity of s

is in Zðs

Þ.

. ðC

Þ: kp

 p

k > maxfL

g=2 (i.e., s

and s

do not

hold ðC

Þ).

. ðC

Þ: 

¼ minfj

 

j;j

 

jg >

max

Fig. 3 illustrates this definition. A positive and constant

potential !

is assigned to pairs that verify R

The “Candy” model is then specified by the prior density

given by:

ðSÞ/exp  !

n þ !

þ !



; ð5Þ

where !

is a constant weight, the !

i; i¼1;...;4

are positive and

constant weights, n denotes the total number of segments in

the configuration S, n

the number of free segments, n

the

number of single segments, n

the number of pairs of

segments with respect to R

, and n

the number of pairs of

segments with respect to R

. This density specifies a well-

defined marked point process, as the Ruelle’s stability

condition [24] is verified. This condition and the stronger

condition of local stability are proven in [25].

Neverthel ess, the density takes the same value for

configurations which do not have the same quality, as we

can see in Fig. 4. Thus, we can obtain a line network with

little breaks between connected extremities and not as

smooth as possible. That is the reason why we have

introduced quality coefficients to smooth the solution in the

following model, called “Quality Candy.”

3.3 “Quality Candy” Model

Considering the “Candy” model as a good starting point,

we have chosen to keep its general structure, replacing

constant potentials of interactions by variable functions g

for different relations between segments. The prior density

is now given by the following formula:

ðSÞ/expðU

ðSÞÞ ð6Þ

with U

ðSÞ¼!

n þ !

þ !

ð7Þ

r2R

<s;s

ðs; s

Þ; ð8Þ

1570 IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 27, NO. 10, OCTOBER 2005

Fig. 1. Segment types of the “Candy” model defined with respect to R

Fig. 2. Internal bad orientation R

. fs

g and fs

g hold ðC

Þ as p

and p

are in the circle of center p

and radius L

=2. The pair fs

g is

concerned by R

, while fs

g is not.

Fig. 3. External bad orientation R

. s

, s

, and s

have an extremity in

the influence zone Zðs

Þ (two small circles). So, fs

g, fs

g, and

g hold ðC

Þ. The pair fs

g does not hold ðC

Þ; fs

g does not

hold ðC

Þ; only fs

g is concerned by R

where n, n

, and n

are defined as before, the !

are

positive (except !

which can be negative) and constant

weights, R denotes the set of possible relations between

segments, <; >

is a pair of interacting segments with

respect to the relation r, and g

ð:; :Þ is the potential function

with respect to r. One can notice that, if we take the

two relations of bad orientation of the first model and a

potential function equal to 1 in any case, we obtain the

“Candy” model. Thus, the “Quality Candy” model can be

seen as a generalization of the “Candy” model.

First of all, we have chosen to redefine the relation of

connection adding a constraint to accelerate the optimiza-

tion algorithm. From now on, two segments are said to be

connected if the corresponding angle is not sharp. For

example, in Fig. 5, s

and s

are not connected. Moreover, in

order to promote small curvature and continuity of the line

network, the quality of each connection is introduced in the

density via a potential function. This potential function,

denoted g

, is defined for each pair of connected segments

as the average of g



which depends on the

absolute difference of orientation 

(modulo =2) between

and s

and g



which depends on the distance d

between

two connected extremities of s

and s

ðs

Þ¼



ð

Þþg



ðd

with



ð

Þ¼

ð

;

max

Þ if 

<

max

2 if not





ðd

Þ¼ðd

;Þ:

ð9Þ

Here, g



gives an attractive ( negative) weight to pairs of

segments ðs

Þ whose absolute difference of orientation 

is lower than a threshold  (such as ðs

Þ in Fig. 5), and a

repulsive ( positive) one in the other case (such as ðs

in Fig. 5). g



is attractive in any case. The attractive terms of

these two functions are given by the quality function 

below:

ð:; MÞ : ½0;M!½0; 1

x 7! ðx; MÞ¼

1 þ M

1 þ x

 1



ð10Þ

This is a decreasing and positive function on ½0;M. The

quality value is thus maximal (equal to one) for a difference

of orientation or a distance equal to 0.

This new potential allows us to work without the relation

of external bad orientation R

, as the repulsive part of the

connection potential concerns the same type of interaction.

Then, we keep exactly the same relation for internal bad

orientation, but redefine the potential function. Pairs of

segments forming a too sharp angle are forbidden for

stability reasons. So, an infinite weight is given to them (i.e.,

“hard-core” potential). For the other pairs in internal bad

orientation, we use the same quality function  to define a

repulsive weight based on the difference of orientation

between the two segments. So, for each pair <s

the formula is the following:

ðs

Þ¼

1 if 

1  ð

;=2  

min

Þ if not ;



ð11Þ

where 

min

is the minimal difference with respect to the

right angle from which two segments are considered as

disoriented, and c is a positive constant close to zero that

corresponds to the minimal difference of orientation which

is allowed for two close segments (i.e., segments that hold

ðC

Þ).

Finally, (7) becomes:

ðSÞ¼!

n þ !

þ !

ðs

þ !

ðs

Þ:

ð12Þ

This model is locally stable which guarantees the ergodic

convergence o f the RJMCMC algorithm described in

Section 5.2. The stability proof is given in [26].

4DATA MODELING

4.1 Construction of the Data Term

In order to extract line networks from any remotely sensed

image, we need a realistic and robust modeling of the data.

This section describes the construction of the data term. It is

based on the following assumptions:

. H

: The gray level variation between a road and the

background is large.

. H

: The local average of the road gray level is

homogeneous.

In order to check that a segment s

is well-fitted to the

data, we consider the set of pixels V

corresponding to the

object s

in the image—which is composed of a fixed

number n

of strips according to the supposed constant

width of the line network—and two adjacent strips

corresponding to the nearby background. These two strips

are positioned at a distance d from V

in order to allow a

range of widths as illustrated in Fig. 6. Considering the

pixel values in each region x as a sample of a population

composed of n

independent values, a Student’s t-test is

LACOSTE ET AL.: POINT PROCESSES FOR UNSUPERVISED LINE NETWORK EXTRACTION IN REMOTE SENSING 1571

Fig. 4. A drawback of the “Candy” model.

Fig. 5. Different types of connection. fs

g: attractive connection.

g: not considered as a connection. fs

g: repulsive connection

with respect to the orientation.

used to determine if the averages of the two samples are

significantly different. This statistical test is adapted to

inference based on small samples as in our case. The

formula for the t-test is a ratio between the absolute value of

the difference of the two sample means and a measure of

the variability of the sample. Here is the t-test expression for

two samples x and y:

t  testðx; yÞ¼

jx  yj

ﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃ

e

; ð13Þ

where

x, e

, and n

respectively refer to the sample mean,

the sample standard deviation, and the number o f

observations corresponding to the sample x. Above some

critical value, we can consider that the two samples x and y

come from two populations with different means. The

statistical value we consider for mean difference hypothesis

is the minimum of the test value between V

and a

border region:

ðs

Þ¼ min

l2f1;2g

t  testðR



and the statistical value for homogeneity hypothesis H

the maximal t-test between two inside strips (if n

> 1):

ðs

Þ¼ max

j;k2f1;::;n

g;j6¼k

t  testðb



;

where b

is the jth inside strip. If n

equals 1, there is no

homogeneity test and we fix T

ðs

Þ at 1. Then, the statistical

value T

corresponding to s

is the ratio of these two quantities,

with the additional condition that T

ðs

Þ is lower than 1 in

order to avoid promoting excessively very homogeneous

regions:

ðs

max 1 ;T

ðs

Þ½

: ð14Þ

Moreover, we proceed to a thresholding and a conversion of

the test values from ½0; 1 to ½1; 2. These two boundaries

were chosen in order to obtain final configurations with the

properties listed in Section 4. Finally, the potential value is

the following:



2ifT

1  2

t

if t

1ifT

;

ð15Þ

where t

and t

) are two positive thresholds

emp irically chosen. The potential 

associated to the

segment s

is a dual potential: it can take attractive

(negative) values, as well as repulsive (positive) values.

The thresholds t

and t

are robust in the sense that results

are similar if we modify their values a little bit and that we

can use the same thresholds for two different images with

similar radiometric properties. Finally, the data term energy

of a configuration S is defined as the sum of the potentials

associated to the segments belonging to S. The data term is

thus given by:

ðSÞ/exp 



; ð16Þ

where 

is a positive and constant weight.

We have built a data term based on realistic and general

hypotheses to represent various types of networks. More-

over, we perform a reliable testing of these hypotheses

thanks to the use of statistical tests particularly well

adapted to small samples.

4.2 Offline Computation

Nevertheless, the online computation of these statistical

tests is time consuming, the RJMCMC algorithm having no

memory and often requiring a few millions of proposals of

new elements. We thus suggest to realize a precalculation of

the data term. We consider segments of minimal length

min

, positioned at every pixel of the image lattice, for a

fixed number of orientations N. For every considered

orientation



k; k¼1;...;N

, we assign to every pixel p of the

image lattice the value w

ðpÞ which minimizes the potential

value given by (15) as follows:

ðpÞ¼ min

¼L

min

;



;p2V



: ð17Þ

For a segment s

with orientation 

, the potential is defined

by the mean of the precomputed values (for the orientation



of the discrete space which is the closest to 

) on the set

of pixels V

corresponding to s



cardðV

p2V

ðpÞ ;k¼ arg min

j





j: ð18Þ

The expression of the data term is the same as in (6),

replacing 



ðSÞ/exp 



: ð19Þ

Let us note that this term is less precise than the previous

one because a single length of segment is considered for the

statistical tests and because of the orientation discretization.

5OPTIMIZATION

5.1 Simulated Annealing

To extract the line network from an image, we aim to find a

configuration

SS which maximizes the unnormalized process

density h given by:

hðSÞ¼h

ðSÞ h

ðSÞ; ð20Þ

where h

is the prior density and h

is the data term. This is a

nonconvex problem for which a direct optimization is not

possible given the large size of the state space that is

n¼0



, where 

is the set of configurations of n segments.

We propose to estimate this maximum by a simulated

1572 IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 27, NO. 10, OCTOBER 2005

Fig. 6. A mask corresponding to a road segment.

annealing, which consists of successive simulations of the

process distribution 

specified by the density h

1=T

, with T

gradually decreasing to 0. A proof of convergence is given in

[27] when the decrease of temperature T is logarithmic. In

practice, temperature decreases geometrically in order to

reduce the computing time.

5.2 Simulation of Spatial Point Processes

The algorithm chosen to simulate the unnormalized

measure 

is a Reversible Jump Markov Chain Monte

Carlo (RJMCMC) algorithm with a Metropolis-Hastings-

Green dynamics [20], [21]. It consists in simulating a

discrete Markov chain S

which performs small jumps

between the spaces 

. The measure of interest occurs as the

stationary measure of the chain. This iterative algorithm

does not depend on the initial state. At each step, a

transition from the current state S to a new state S

proposed according to a proposal kernel QðS ! :Þ. The

transition is accepted with a probability ðS; S

Þ given by

the Green’s ratio. This acceptance ratio is computed so that

the detailed balance condition is verified, condition under

which this algorithm converges to 

. This strong condition

is given by:



ðdSÞ P ðS; dS

Þ¼



ðdS

Þ P ðS

;dSÞ; ð21Þ

where A and B are two sets of the tribu associated to  and

P is the transition kernel of the Markov chain S

. Supposing

that 

ð:Þ QðS ! :Þ has a finite density, D, with respect to a

symmetrical measure on   , (21) is satisfied if:

ðS;S

ÞDðS; S

Þ¼ðS

;SÞDðS

;SÞ: ð22Þ

As shown in [28] for the finite space state case, it is optimal

to make the probability  as large as possible to reduce the

autocorrelation of the Markov chain. Thus, we take:

ðS;S

Þ¼min 1;RðS; S

; ð23Þ

where R is the Green’s ratio given by:

RðS;S

Þ¼

DðS

;SÞ

DðS;S

ð24Þ

and (22) is verified.

One interesting point of the Metropolis-Hastings-Green

algorithm is that the proposal kernel Q can be decomposed

into several kernels q

, each corresponding to a reversible

move, as it has been proposed in [21]. Although it is

sufficient to define the uniform birth-and-death [20] in

order to simulate a marked point process (theoretically, S

should converge to the same measure), it is important to

define relevant moves in order to accelerate the conver-

gence of the Markov chain. For example, a simple move,

such as a translation or a rotation, is more efficient than a

death followed by a birth, leading to the same result.

Furthermore, a birth-and-death within a neighborhood of

an object of the current configuration is often relevant,

especially if objects are supposed to be in interaction.

5.3 Proposal Kernels

In this section, several proposal kernels are described and

the explicit formula of the associated Green’s ratio is given.

5.3.1 Uniform Birth-and-Death (UBD)

It is the simplest proposal kernel which allows to make

small jumps between spaces of different sizes. It consists of

a uniform birth of a segment in F ½L

min

max

½0;—

proposed with a probability p

—and a uniform deat h

(inverse proposal) in the set of segments S.

In the case of the birth of a segment s

, the Green’s ratio is

given by:

RðS;S [ s

Þ¼

ðS [ s

ðSÞ

jF j

nðSÞþ1

; ð25Þ

where p

(respectively, p

¼ 1  p

) is the probability of

choosing a birth (respectively, a death) and  is the intensity

of the reference Poisson process. In the case of the death of a

segment s

, the Green’s ratio is given by:

RðS;S n s

Þ¼

ðS n s

ðSÞ

nðSÞ

jFj

: ð26Þ

5.3.2 Simple Moves

The second kind of move usually proposed is the

modification of a randomly chosen object according to a

symmetrical transformation. Let T¼fT

: a 2 Eg be a

family of symmetrical transformations parameterized by a

vector a 2 E. If the modification of an object s is done by

applying T

where a is uniformly chosen in E,

RðS; ðS n sÞ[s

Þ¼

ðS; ðS n sÞ[s

ðSÞ

: ð27Þ

For instance, T can be defined as the family of rotations

defined in ½



; 



. A rotation T



consists in changing the

orientation  of the considered segment by adding



2½



; 



:

 ð þ d



Þ½;

where ½: denotes the modulo function. In the same way,

families of translations and dilations are defined.

5.3.3 Improved Simple Moves

The proposal of a move described in the previous section is

not relevant when the considered segment is connected: the

connection often disappears with such a move. That is why

we define a new subkernel which depends on the segment

state with respect to the connection. First, we randomly

choose a segment s in the configuration. If s is free, we

uniformly choose a rotation, a translation, or a dilation,

which are described in Section 5.3.2. If s is single, we

randomly choose between three possible moves:

. Translation of the unconnected extremity e

:We

uniformly chose a vector ½dx; dy in a given square

½; ½;  and translate e

by adding the

vector ½dx; dy. If the new length is not in ½L

min

max

,

the operation is iterated.

. Translation of the connected extremity e

:We

uniformly chose a vector ½dx; dy in a square of side

lower than  and e

by adding the vector ½dx; dy.If

the new segment is not connected through e

anymore, the operation is iterated.

LACOSTE ET AL.: POINT PROCESSES FOR UNSUPERVISED LINE NETWORK EXTRACTION IN REMOTE SENSING 1573

. Translation of a connection: First, we choose a pair of

connected segments to which this segment belongs;

secon d, we uniformly choose a vector ½dx; dy in

½; ½; ; finally, we translate both con-

nected extremities by adding the vector ½dx; dy. If the

new lengths are not in ½L

min

max

, the operation is

iterated.

If s is double, we uniformly choose one of its extremities

and propose one of the two last above-mentioned moves for

a single segment. Note that if there is a change of segment

state, we have to refuse the proposal (because in this case,

the inverse move will never be proposed). The Green’s ratio

is the same as the one given by (27) for simple moves.

5.3.4 Birth-and-Death Based on Data (BDD)

Rather than uniformly proposing a new segment, it would be

relevant to use data information to propose segments that are

well-positioned more often. Following this idea, we use the

offline computation of the statistical values w

—presented in

Section4.2—todefine abirth-and-death(BD) kernel. Westore,

for every pixel p

i; i¼1;...;P

and every orientation



k; k¼1;...;N

, the

potential 

—given by (15)— corresponding to the segments

¼ð

min

;



Þ whose midpoints

are positioned in the

square ½p

 of F corresponding to p

. We then obtain, for every

orientation 

, a map C

defining an inhomogeneous (i.e.,

nonuniform) birth kernel:

ðp

Þ¼

M  

j¼1

ðM  

; ð28Þ

where the constant M is chosen so that C

defines a strictly

positive probability measure. Here, we choose M ¼ 3. The

weaker the potential associated to a pixel p

, the stronger the

probability C

ðp

Þ of proposing a segment midpoint in ½p

.

The procedure is then the following one: Firstly, the length

and the orientation of the new segment are uniformly drawn

in the mark space. Secondly, a pixel p is randomly chosen

according to the map C

corresponding to the closest

orientation of the discrete orientation space. Finally, the

segment midpoint is uniformly drawn inside ½p. Then, a

death part is defined in order to obtain a reversible move. It

consists of uniformly removing a segment from the config-

uration. Let us note that this difference between birth and

death treatments is counterbalanced by the Green’s ratio.

In the case of the birth of a segment s

, whose midpoint is

in ½p and orientation corresponds to 

, in the discrete space

the Green’s ratio is given by:

RðS; S [ s

Þ¼

ðS [ s

ðSÞ

j½pj

ðpÞðnðSÞþ1Þ

; ð29Þ

where p

and p

are, respectively, the probability of

choosing a birth and a death based on data. In the death

case, the Green’s ratio is given by:

RðS;S n s

Þ¼

ðS n s

ðSÞ

nðSÞ C

ðpÞ

jpj

: ð30Þ

5.3.5 Birth-and-Death within a Neighborhood (BDN)

To accelerate the process, it is important to make proposals

which are coherent with the model. Here, the segments are

supposed to be connected. So, proposing a birth near an

extremity of a segment in the current configuration seems to

be relevant. That is the reason why we have introduced a

BD kernel within a neighborhood with respect to the

connection. The birth part consists in uniformly choosing a

segment s

in the current configuration S and an endpoint

among the endpoints of s

which are inside F (compact

set corresponding to the data image) and then proposing a

new segment from this endpoint. The procedure to compute

this new segment is the following:

. one endpoint is uniformly generated in the ball of

center e

and radius ;

. the orientation is uniformly selected either in ½0; 2

when the connection only depends on the distance

between segment endpoint (“Candy” model), or in

½  =2;þ =2, when the connection is only

defined for segments forming a large angle (“Quality

Candy” model);

. the length is uniformly drawn in ½L

min

max

.

The death part consists in uniformly choosing a segment s

among connected segments.

The Green’s ratio for the birth of a segment s

is then the

following:

RðS; S [ s

Þ¼

ðS [ s

ðSÞ



nðSÞ

jF j n

ðS [ s

Þ n

ðS [ s

;

ð31Þ

where p

and p

are, respectively, the probabilities of

choosing a birth and a death in a neighborhood, n

ðS [ s

is the number of neighbors of s

in the configuration ðS [

Þ with respect to the connection and  is the ratio

between the Lebesgue measure of the mark space

M ¼½L

min

max

½0;, and the measure of the space

where the length and the orientation of the new segment

are drawn. Thus, when the connection only depends on

endpoint distance, this ratio is equal to 2 and, when the

connection is only defined for a large angle, this ratio is

equal to 1. For the death case, it becomes:

RðS; S n s

Þ¼

ðS n s

ðSÞ

jFj n

ðSÞ n

ðSÞ



nðS n s

: ð32Þ

6PARAMETER CHOICE

This section addresses the problem of parameter choice,

which is a crucial issue given the large number of

parameters involved in the model construction. Indeed,

we distinguish between physical or radiometric parameters

and weight parameters.

The physical parameters are used in the definition of

objects (L

min

, L

max

), in the construction of the mask used to

compute the data term (n

, d), and in the definition of

interactions (, 

min

, 

max

). The choice of the physical

parameters is relatively easy. For example, the number of

bands n

in the pixel mask directly depends on image

resolution and the type of the line network (river, road,

highway, etc.). The parameter  has to be lower than L

min

for a good definition of the connection. Moreover, it is

important not to take  too small: First, the network

flexibility is reduced for a small ; second, the network

extension is often refused for a small  as the acceptance

1574 IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 27, NO. 10, OCTOBER 2005

probability of a segment birth within a neighborhood

decreases with  (see (31)). In practice, we take  equal to

2 pixels for low resolution images ( 5 meters) and equal to

5 pixels for high resolution images ( 1 meter).

The radiometric parameters are the thresholding para-

meters t

and t

used to compute data potential (see (15)).

We choose t

and t

according to the contrast between the

line network and the nearby background.

The choice of the weight parameters that control the

importance of the various terms in the density is more

delicate. The estimation methods refer to these parameters

(the physical parameters are supposed to be known). In

complete data, the parameters of the “Candy” prior model

are estimated by stochastic gradient in [25]. This algorithm

converges to a local optimum located in the neighborhood

of the initial set of parameters. The interest of the method is

reduced as the line network is supposed to be known. In

incomplete data, the weight parameters have to be

estimated in parall el with the configuration

SS which

maximizes the density h. The most well-known estimation

algorithm is the Expectation-Maximization (EM) algorithm.

Nevertheless, the EM algorithm and its variants are not

always adapted and there is no proof of convergence

towar d the max imum likelihood estimation. Here, we

propose to use a “calibration” approach which guarantees

that the configuration

SS would have good properties. We

first define constraints on local representative configura-

tions according heuristic considerations on the network

topology. We then translate these constraints into a set of

inequalities between the weight parameters. Note that the

relations derived on the parameters are rather simple since

the energy is a linear function of parameters. Finally, this set

of inequalities provides a set of admissible parameters.

At the end of the simulated annealing, we aim to obtain a

configuration with the following properties:

1. No free segment.

2. No pair of single segments not connected to the rest

of the network.

3. No interruption in the network if the addition of a

segment allows to obtain a branch of small curvature.

4. No branches composed of several consecutive

segments all of which perform badly in the data

statistical tests.

5. No additional segment at the end of a branch which

does not fit the data or which leads t o strong

curvature.

6. No too close segments.

Let us recall that, given (12) and (16), the energy of a

configuration S can be written as:

UðSÞ¼!

n þ !

þ !

ðs

þ !

ðs

Þþ



;

ð33Þ

where !

i;i¼1;::4

and 

are positive weights. The properties

on the final configuration can be translated by the following

constraints on the weight parameters ð!

i;i¼0;::4

;

Þ:

1. The energy of a free segment s

is larger than the

energy of the empty configuration. In the worst case,



is equal to 1. Then, we get:

þ !

 

> 0: ð34Þ

2. The energy of two single segments s

and s

is larger

than the energy of the empty configuration. Con-

sidering the worst case (

, 

and g

ðs

Þ are

equal to 1), we have:

þ 2!

 !

 2

> 0: ð35Þ

3. Theenergymustdecreasebyaddingadouble

segment s

linking two single segments such that the

two associated connection potentials are negative.

The worst case is thus reached when 

is maximal

(i.e., equal to 2) and when the connection potentials

are zero. The expression of this constraint is then:

2

þ !

< 2!

: ð36Þ

But, we want the energy to be increased when the

connection potentials are equal to 1, so that:

þ 2

þ !

> 2!

: ð37Þ

4. The energy must increase by adding two double

segments which do not fit the data (i.e., data potential

equals to 2) linking two single segments (worst case:

all connection potentials are equal to 1):

 3!

þ 4

þ 2!

> 2!

: ð38Þ

5. Theenergymustdecreasebyaddingasingle

segment s

to another single segment s

if g

ð:Þ <¼

0 and 

<¼ 0:

< 0: ð39Þ

On the contrary, it must increase if 

¼ 2

 !

þ 2

> 0: ð40Þ

6. The last property (no too close segments) is verified

as soon as !

> 0 due to the “hard-core” imposed on

the relation of bad orientation.

Taking 

¼ x!

, these equations lead to the parameter

tuning rule given in Table 1, where a, b, and c are positive

constants. One example is given with the additional con-

straint that the potential of a free segment is larger (or equal)

than the potential of two single segments in order to widely

accept the birth from a free segment at the beginning of the

algorithm, the initial state being the empty configuration.

7RESULTS

7.1 Quantitative Evaluation with Respect to a

Reference Network

As we aim to compare different models and algorithms, we

need to find criteria of performance that can be obtained by

conducting a quantitative evaluation of the detected line

networks with respect to a reference line network. For each

image for which a reference network is given, segments

provided by our algorithm are matched with the branches

of the reference (supplied in the form of broken lines by a

specialist). A segment is matched with the reference if each

of its extremities is at a distance from the reference network

lower than a threshold fixed by the user (we can take, for

example, a threshold equal to ten pixels). This matching is

illustrated by Fig. 7. It allows the computation of L

and L

the lengths of omitted and overdetected sections, and A, the

LACOSTE ET AL.: POINT PROCESSES FOR UNSUPERVISED LINE NETWORK EXTRACTION IN REMOTE SENSING 1575

area included between these sections which have been

matched. The three criteria for the evaluation of the

extracted network are:

. F ¼ 100

ref

, the ratio of false alarms with respect to

the reference length L

ref

;

. 0 ¼ 100

ref

, the ratio of omissions with respect to the

reference length;

. D ¼

ref

L

, a geometrical error corresponding to the

distance between matched sections, equal to the area

A included between these sections divided by the

length of the reference sections which have been

matched.

7.2 Comparison of the Prior Models

The performance of the “Candy” model in simulating and

detecting line network was reported in [19]: the results are

encouraging, with relatively few false alarms and omissions

for an automatic method. Nevertheless, the line network is

not as smooth as possible and undesirable interruptions are

observed. Our first o bjective was to determine if the

incorporation of quality measures in the density could

improve the results.

First, Fig. 8 provides one sample from each prior model

obtained using the RJMCMC algorithm described in

Section 5. The initial state is the empty configuration. The

chosen proposition kernel is composed by two equiprobable

kernels: uniform birth-and-death (UBD) and birth-and-

death within a neighborhood (BDN). The sample obtained

for the “Candy” model seems good in the sense that all

segments are connected and form long broken lines.

Nevertheless, the drawback of this model, mentioned in

Section 3.2, is confirmed: We observe small breaks and

points of strong curvature. The consequence of this draw-

back is that we obtain a line network which is not as smooth

as a real road network. The “Quality Candy” model seems

to be more appropriate for road detection as the sample

obtained is clearly smoother than the one obtained with the

“Candy” model.

Second, Fig. 9 provides results of road network extraction

from a satellite SPOT image (Panchro, size: 256  256 pixels,

resolution: 10 m) with the two prior models. The extraction

algorithm is a simulated annealing using an RJMCMC

algorithm whose proposal kernel Q

is composed of

three equiprobable kernels: UBD, BDN, and improved

simple moves. The data term used here corresponds to an

online computation: The data term is given by (16) with t

4 and t

¼ 8. Two rates of temperature decrease are tested.

Globally, the results present less than 10 percent of false

alarms. The geometrical error with respect to the manual

extraction is of subpixel magnitude. Nevertheless, the

percentage of omissions is high: more than 30 percent. These

omissions co rrespond to dirt tracks, which are not as

rectilinear and clearly contrasted with respect to the back-

ground as the main roads. For the “Candy” model, if the

temperature decrease is too fast, the main roads are not

completely detected, as shown in Fig. 9d. Indeed, there is a

critical temperature for which it is important to have a slow

decrease in order to reach the global maximum of the

density. That is why the main roads are well detected in

Fig. 9c, obtained with a slower temperature decrease. This

points out that the algorithm result is very dependent on the

temperature decrease. The “Quality Candy” model provides

a smoother line network. Moreover, there are less omissions

for the same temperature decrease. It appears to be not as

sensitive to the temperature decrease as the “Candy” model.

This implies a possible significant reduction of the comput-

ing time.

7.3 Online Computation of the Data Potential

This section presents extraction results using the “Quality

Candy” model with an online computation of the data term.

As in Section 7.2, the extraction algorithm is a simulated

1576 IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 27, NO. 10, OCTOBER 2005

TABLE 1

Parameter Tuning Following the Constraints on the Final Configuration

Fig. 7. Matching between detected and reference line network.

Fig. 8. Samples of the prior models. (a) “Candy” model and (b) “Quality

Candy” model.

annealing using an RJMCMC algorithm with the proposal

kernel Q

(UBD, BDN, and improved simple moves).

First, Fig. 10 gives a result of road extraction from a high

reso lution aerial image provided by the IGN (French

Mapping Institute). The task is not straightforward: Some

trees interfere with good detection of road sections, some

fields have nearly the same grey level as the roads, and field

textures might produce a significant answer to the statistical

tests used to compute the data term. The “Quality Candy”

model is well-adapted to this extraction problem. With an

online computation of the data potential, we obtain about

4 percent of false alarms and 16 percent of omissions, and

the geometrical error D is lower than one pixel. There are no

breaks except in the road crossing where the data potential

is positive. The omissions correspond in majority to

secondary roads which have not been detected due to their

small size and higher curvature than the ones observed for

the main roads.

Second, Fig. 11 presents a result of river extraction from a

satellite image of Guinea provided by the BRGM (French

Geological Survey). Despite the low contrast of the image,

the extraction algorithm provides a line network with few

omissions and overdet ections with respect to reference

network provided by the BRGM. Nevertheless, the detection

is not very accurate due to the sinuosity of this network.

7.4 Offline Computation of the Data Potential

In this section, we evaluate the improvement due to the use

of an offline computation of the data potential. We have

tested two proposal kernels: the proposal kernel Q

(UBD,

BDN, improved simple moves) and the kernel Q

composed

of three equiprobable kernels: birth-and-death based on

data (BDD), BDN, and improved simple moves.

For the aerial image given in Fig. 10a, the use of an

offline computation of the data potential induces a small

loss of quality (see Table 2). Nevertheless, Table 2 shows

that the use of an offline computation is very efficient in

terms of computing time. Moreover, it shows that it is

relevant to use the BDD kernel as the results obtained using

and Q

(for the same model) are of equivalent quality.

For the satellite image given in Fig. 11a, the results

obtained with the two methods are of equivalent quality but

the computing time is much lower when we use an offline

computation (see Table 2). This can be explained by the fact

that the average length of a segment is very close to the

minimal length used for the precalculation.

Globally, these results show the relevance of using a

kernel based on data: We obtain nearly the same result with

as with Q

and the gain in computing time is important

(see Table 2).

8CONCLUSION AND FUTURE WORK

We have proposed in this paper a method for performing

unsupervised line network extraction from remotely sensed

images with medium or high resolution. The “Quality

Candy” prior model is particularly suited to the extraction

of road networks. Indeed, the use of quality coefficients for

the connection relation leads to a continuous line network

with a small curvature. It can be adapted in an encouraging

way to the case of networks which are more sinuous such as

riverine forests. The optimization was performed using

simulated annealing with an RJMCMC algorithm, which is

based on a composed proposal kernel built in order to

accelerate the convergence. The results have shown the

relevance of using an offline computation of the data

potential, in particular, when a proposal kernel based on

this computation is added in the RJMCMC algorithm.

Nevertheless, a small loss in quality can be observed when

an offline computation is performed.

The proposed stochastic model allows us to perform data

fusion in order to benefit from the contribution of several

sources (for instance, multisensor or multitemporal data;

some p reliminary work has been con ducted with radar

images). Moreover, this model could be extended to more

complex objects, such as broken lines, which would adapt

LACOSTE ET AL.: POINT PROCESSES FOR UNSUPERVISED LINE NETWORK EXTRACTION IN REMOTE SENSING 1577

Fig. 9. Road network extraction from a SPOT image for each prior model

and the same data term: t

¼ 4, t

¼ 8. (a) Data (256  256 pixels).

(b) Manual extraction. (c) Candy. (d) Candy. (e) Quality Candy.

(f) Quality Candy.

themselves more easily to sinuous networks (this work is

currently in progress). Regarding optimization, as this study

has shown that the optimization algorithm has an influence

not only on the computing time but also on the final result, we

will focus in a near future on the improvement of the

simulated annealing scheme by using an adaptive rule of

temperature decrease or using parallel simulated annealing.

Finally, work remains to be done to provide a product for

updating and creating maps. To make it easier to exploit the

extraction results, it is essential to develop tools for the

evaluation of the extracted network, based on data and

assumptions on the network topology and geometry. Evalua-

tion criteria could be used to indicate the ambiguous parts of

the detected line network to a user. Moreover, postprocessing

based on these criteria could be relevant.

ACKNOWLEDGMENTS

The authors would like to thank the French Mapping

Institute (IGN) for providing an aerial image, the French

Geological Survey (BRGM) for providing data shown in

Fig. 11 and for partial financial support, as well as Nicolas

Baghdadi from the BRGM for several interesting discussions.

This work was conducted while C. Lacoste was a PhD

student with the Ariana research group.

REFERENCES

[1] G. Vosselman and J. de Knecht, “Road Tracing by Profile

Matching and Kalman Filtering,” Automatic Extraction of Man-

Made Objects from Aerial and Space Images, pp. 265-274, Apr. 1995.

[2] D. Geman and B. Jedynak, “An Active Testing Model for Tracking

Roads in Satellite Images,” IEEE Trans. Pattern Analysis and

Machine Intelligence, vol. 18, pp. 1-14, 1996.

[3] M.A. Fischler, J.M. Tenenbaum, and H.C. Wolf, “Detection of

Roads and Linear Structures in Low-Resolution Aerial Imagery

Using a Multisource Knowledge Integration Technique,” Compu-

ter Graphics and Image Processing, vol. 15, pp. 201-223, 1981.

[4] N. Merlet and J. Zerubia, “New Prospects in Line Detection by

Dynamic Programming,” IEEE Trans. Pattern Analysis and Machine

Intelligence, vol. 18, no. 4, pp. 426-431, Apr. 1996.

[5] W.M. Neuenschwander, P. Fua, L. Iverson, G. Sze

kely, and O.

Kubler, “Ziplock Snakes,” Int’l J. Computer Vision, vol. 25, no. 3,

pp. 191-201, 1997.

1578 IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 27, NO. 10, OCTOBER 2005

Fig. 10. Road network extraction from an aerial image (resolution: 50 cm) with an online computation of the data potential. (a) Data (892  652 pixels).

(b) Manual extraction. (c) Extracted line network.

Fig. 11. Hydrographic network extraction from a satellite image (SPOT XS2) with the parameter values given in Table 1 and with t

¼ 3 and t

¼ 7

and with an online computation of the data potential. (a) Data (682  674 pixels). (b) Manual extraction. (c) Extracted line network.

TABLE 2

Quantitative Comparison between the Online (D

) and Offline (D

) Computations of the Data Potential

[6] A. Gru

n and H. Li, “Road Extraction from Aerial and Satellite

Images by Dynamic Programming,” ISPRS J. Photogrammetry and

Remote Sensing, vol. 50, no. 4, pp. 11-20, Aug. 1995.

[7] I. Couloigner and T. Ranchin, “Mapping of Urban Areas: A

Multiresolution Modeling Approach for Semi-Automatic Extrac-

tion of Streets,” Photogrammetric Eng. and Remote Sensing, vol. 66,

no. 7, pp. 867-874, July 2000.

[8] M.A. Serendero, “Extrac tion D’informations Symboli ques en

Imagerie SPOT: Re

seaux de Communication et Agglome

rations,”

PhD Thesis (in French), Univ. of Nice—Sophia Antipolis, 1989.

[9] R. Duda and P. Hart, Pattern Classification and Scene Analysis. New

York: John Wiley & Sons, 1973.

[10] D. Haverkamp, “Extracting Straight Road Structure in Urban

Environments Using IKONOS Satellite Imagery,” Optical Eng.,

vol. 41, no. 9, pp. 2107-2110, Sept. 2002.

[11] U. Bhattacharya and S. Parui, “An Improved Backpropagation

Neural Network for Detection of Road-Like Features in Satellite

Imagery,” Int’l J. Remote Sensing, vol. 18, pp. 3379-3394, Apr. 1997.

[12] D. Wang, D. He, L. Wang, and D. Morin, “Extraction du Re

seau

Routier Urbain a

l’Aide d’Images SPOT HRV,” Int’l J. Remote

Sensing, vol. 17, no. 4, pp. 827-833, 1996.

[13] A. Zlotnick and P. Carnine, “Finding Road Seeds in Aerial

Images,” Computer Vision, Graphics, and Image Processing, vol. 57,

pp. 243-260, 1993.

[14] M. Barzohar and D.B. Cooper, “Automatic Finding of Main Roads

in Aerial Images by Using Geometric-Stochastic Models and

Estimation,” IEEE Trans. Pattern Analysis and Machine Intelligence,

vol. 18, no. 7, pp. 707-721, July 1996.

[15] F. Tupin, H. Maitre, J.-F. Mangin, J.-M. Nicolas, and E. Pechersky,

“Detection of Linear Features in SAR Images: Application to Road

Network Extraction,” IEEE Trans. Geoscience and Remote Sensing,

vol. 36, no. 2, pp. 434-453, 1998.

[16] P. Doucette, P. Agouris, A. Stefanidis, and M. Musavi, “Self-

Organized Clustering for Road Extraction in Classified Imagery,”

ISPRS J. Photogrammetry and Remote Sensing, vol. 55, pp. 347-358,

2001.

[17] I. Laptev, T. Lindeberg, W. Eckstein, C. Steger, and A. Baumgart-

ner, “Automatic Extraction of Roads from Aerial Images Based on

Scale Space and Snakes,” Machine Vision and Applications, vol. 12,

pp. 23-31, 2000.

[18] A. Baumgartner, C. Steger, H. Mayer, W. Eckstein, and H. Ebner,

“Automatic Road Extraction Based on Multi-Scale, Grouping, and

Context,” Photogrammetric Eng. and Remote Sensing, vol. 65, no. 7,

pp. 777-785, July 1999.

[19] R. Stoica, X. Descombes, and J. Zerubia, “A Gibbs Point Process

for Road Extraction in Remotely Sensed Images,” Int’l J. Computer

Vision, vol. 57, no. 2, pp. 121-136, 2004.

[20] C.J. Geyer and J. Møller, “Simulation and Likelihood Inference for

Spatial Point Process,” Scandinavian J. Statistics, Series B, vol. 21,

pp. 359-373, 1994.

[21] P. Green, “Reversible Jump Markov Chain Monte-Carlo Compu-

tation and Bayesian Model Determination,” Biometrika, vol. 57,

pp. 97-109, 1995.

[22] M. van Lieshout, Markov Point Processes and Their Applications.

Imperial College Press, 2000.

[23] R. Stoica, “Processus Ponctuels pour L’extraction des Re

seaux

Line

iques dans les Images Satellitaires et Ae

riennes,” PhD Thesis

(in French), University of Nice—Sophia Antipolis, Feb. 2001.

[24] D. Ruelle, “Superstable Interactions in Classical Statistical

Mechanics,” Comm. Math. Physics, vol. 18, pp. 127-159, 1970.

[25] M. van Lieshout and R. Stoica, “The Candy Model Revisited:

Markov Properties and Inference,” Research Report PNA-R0115,

CWI, Amsterdam, The Netherlands, 2001.

[26] C. Lacoste, X. Descombes, and J. Zerubia, “A Comparative Study

of Point Processes for Line Network Extraction in Remote

Sensing,” Research Report 4516, INRIA, Sophia Antipolis, France,

July 2002.

[27] M. van Lieshout, “Stochastic Annealing for Nearest-Neighbour

Point Processes with Application to Object Recognition,” Research

Report BS-R9306, CWI, Amsterdam, The Netherlands, 1993.

[28] P.H. Peskun, “Optimum Monte Carlo Sampling Using Markov

Chains,” Biometrika, vol. 60, pp. 607-612, 1973.

Caroline Lacoste received t he engineering

degree in mathemati cal modeling an d the

master of science degree in applied mathe-

matics from the National Institute of Applied

Sciences (INSA) of Toulouse in 2001. She

received the PhD degree in signal and image

processing in 2004 from the University of

Nice—Sophia Antipolis. In 2004, she joined the

INSA of Lyon where she is a teaching fellow at

the “first cycle” department and a research fellow

at the CREATIS laboratory (CNRS/INSERM/INSA/UCBL). Her research

interests include inverse problems, image pr ocessing, stochastic

geometry, and Monte Carlo methods.

Xavier Descombes received the bach elor’s

degree in telecommunications from the Ecole

Nationale Superieure des Telecommunications

(ENST) Paris, France, in 1989, the master of

science in mathematics from the University of

Paris VI in 1990, the PhD degree in signal and

image processing from the ENST in 1993, and

the “habilitation” in 2004 from the University of

Nice—Sophia Antipolis. He has been a posdoc-

toral researcher at ENST in 1994, at the

Katoliek e Universitat Leuven in 1995, at the Institut National de

Recherche en Informatique et en Automatique (INRIA) in 1996, and a

visiting scientist in the Max Planck Institute of Leipzig in 1997. He is

currently a permanent researcher at INRIA. His research interests

include Markov Random Fields, stochastic geometry, and stochastic

modeling in image processing.

Josiane Zerubia received the MSc degree from

the Department of Electrical Engineering at

ENSIEG, Grenoble, France in 1981, the doctor

engineer degree in 1986, a PhD in 1988, and an

“Habilitation” in 1994, all from the University of

Nice—Sophia Antipolis, France. She has been a

permanent research scientist at INRIA since

1989 and the director of research since July

1995. She was head of a remote sensin g

laboratory (PASTIS, INRIA Sophia Antipolis)

from mid-1995 to 1997. Since January 1998, she has been in charge

of a new research group working on remote sensing (ARIANA, INRIA-

CNRS-University of Nice). She has been adjunct professor at Sup’Aero

(ENSAE) in Toulouse since 1999. Before, she was with the Signal and

Image Processing Institute of the University of Southern California

(USC) in Los Angeles as a postdoctoral researcher. She also worked as

a researcher for the LASSY (University of Nice and CNRS) from 1984 to

1988 and in the research lab of Hewlett Packard in France and in Palo

Alto, California from 1982 to 1984. She was part of the IEEE IMDSP

Technical Committee (SP Society) from 1997 to 2003 and associate

editor of the IEEE Transactions on Image Processing from 1998 to

2002. She has been member-at-large of the board of governors of the

IEEE SP Society since 2002, area editor of IEEE Transactions on Image

Processing since 2003 and, guest coeditor of a special issue of IEEE

Transactions on Pattern Analysis and Machine Intelligence in 2003. She

has also been a member of the editorial board of the French Society for

Photogrammetry and Remote Sensing (SFPT) since 1998. She has

been cochair of two workshops on Energy Minimization Methods in

Computer Vision and Pattern Recognition (EMMCVPR ’01, Sophia

Antipolis, France, and EMMCVPR ’03, Lisbon, Portugal), cochair of a

workshop on Image Processing and Related Mathematical Fields (IPRM

’02, Moscow, Russia), and chair of a workshop on Photogrammetry and

Remote Sensing for Urban Areas, Marne La Vallee, France, 2003. Her

current research interest is image processing using probabilistic models

or variational methods. She also works on parameter estimation and

optimization techniques. She is a fellow of the IEEE.

. For more information on this or any other computing topic,

please visit our Digital Library at www.computer.org/publications/dlib.

LACOSTE ET AL.: POINT PROCESSES FOR UNSUPERVISED LINE NETWORK EXTRACTION IN REMOTE SENSING 1579

Learning Point Processes and Convolutional Neural Networks for Object Detection in Satellite Images

Article

Full-text available

Mar 2024

Convolutional neural networks (CNN) have shown great results for object-detection tasks by learning texture and pattern-extraction filters. However, object-level interactions are harder to grasp without increasing the complexity of the architectures. On the other hand, Point Process models propose to solve the detection of the configuration of objects as a whole, allowing the factoring in of the image data and the objects’ prior interactions. In this paper, we propose combining the information extracted by a CNN with priors on objects within a Markov Marked Point Process framework. We also propose a method to learn the parameters of this Energy-Based Model. We apply this model to the detection of small vehicles in optical satellite imagery, where the image information needs to be complemented with object interaction priors because of noise and small object sizes.

Accurate Extraction of Ground Objects from Remote Sensing Image Based on Mark Clustering Point Process

Article

Full-text available

Jul 2022
ISPRS

The geometric features of ground objects can reflect the shape, contour, length, width, and pixel distribution of ground objects and have important applications in the process of object detection and recognition. However, the geometric features of objects usually present irregular geometric shapes. In order to fit the irregular geometry accurately, this paper proposes the mark clustering point process. Firstly, the random points in the parent process are used to determine the location of the ground object, and the irregular graph constructed by the clustering points in the sub-process is used as the identification to fit the geometry of the ground object. Secondly, assuming that the spectral measurement values of ground objects obey the independent and unified multivalued Gaussian distribution, the spectral measurement model of remote sensing image data is constructed. Then, the geometric extraction model of the ground object is constructed under the framework of Bayesian theory and combined with the reversible jump Markov chain Monte Carlo (RJMCMC) algorithm to simulate the posterior distribution and estimate the parameters. Finally, the optimal object extraction model is solved according to the maximum a posteriori (MAP) probability criterion. This paper experiments on color remote sensing images. The experimental results show that the proposed method can not only determine the position of the object but also fit the geometric features of the object accurately.

Clustering Point Process Based Network Topology Structure Constrained Urban Road Extraction From Remote Sensing Images

Article

Full-text available

Jan 2022

To extract complicated road network from remote sensing images on urban scenes, this article presents a clustering point process (CPP) based network topology structure constrained road extraction algorithm. Firstly, the CPP is constructed to model the feature points, such as endpoints, bends, and crossroads in a road system. Based on that, an initial network topology structure is constructed by connecting the points with lines. Then, according to the network structure characteristic and the spectral characteristic of road, a network topology structure constraining model and a spectral measurement constraining model are constructed, respectively. By combining the models above, a road extraction model is built under the framework of Bayes’ theorem. Finally, to simulate from the road extraction model and extract an optimal road network, a solution strategy, reversible jump Markov Chain Monte Carlo (RJMCMC) simulation algorithm with related transfer operations, is designed according to the CPP and network topology structure. Several high-resolution remote sensing images on urban scenes are tested. According to a buffer evaluation method, and compared with the comparing algorithms, accuracy and extraction rate of results from the proposed algorithm are increased by 10.86% and 8.75% on average, respectively. It is proved that the proposed algorithm can extract the complicated road network effectively.

AUTOMATIC DETECTION OF ROAD EDGES FROM AERIAL LASER SCANNING DATA

Article

Full-text available

Jun 2019

When aerial laser scanning (ALS) is deployed with targeted flight path planning, urban scenes can be captured in points clouds with both high vertical and horizontal densities to support a new generation of urban analysis and applications. As an example, this paper proposes a hierarchical method to automatically extract data points describing road edges, which are then used for reconstructing road edges and identifying accessible passage areas. The proposed approach is a cell-based method consisting of 3 main steps: (1) filtering rough ground points, (2) extracting cells containing data points of the road curb, and (3) eliminating incorrect road curb segments. The method was tested on a pair of 100 m × 100 m tiles of ALS data of Dublin Ireland’s city center with a horizontal point density of about 325 points/m2. Results showed the data points of the road edges to be extracted properly for locations appearing as the road edges with the average distance errors of 0.07 m and the ratio between the extracted road edges and the ground truth by 73.2%.

Learning point process models for vehicles detection using CNNs in satellite images

Conference Paper

Nov 2023

CNN-Based Energy Learning for MPP Object Detection in Satellite Images

Conference Paper

Aug 2022

Point process and CNN for small object detection in satellite images

Conference Paper

Oct 2022

Generating impact maps from bomb craters automatically detected in aerial wartime images using marked point processes

Article

Jun 2022

Even more than 75 years after the Second World War, numerous unexploded bombs (duds) linger in the ground and pose a considerable hazard to society. The areas containing these duds are documented in so-called impact maps, which are based on locations of exploded bombs; these locations can be found in aerial images taken shortly after bombing. To generate impact maps, in this paper we present a novel approach based on marked point processes (MPPs) for the automatic detection of bomb craters in such images, some of which are overlapping. The object model for the craters is represented by circles and is embedded in the MPP-framework. By means of stochastic sampling, the most likely configuration of objects within the scene is determined. Each configuration is evaluated using an energy function that describes the consistency with a predefined object model. High gradient magnitudes along the object borders and homogeneous grey values inside the objects are favoured, while overlaps between objects are penalized. Reversible Jump Markov Chain Monte Carlo (RJMCMC) sampling, in combination with simulated annealing, provides the global optimum of the energy function. Our procedure allows the combination of individual detection results covering the same location. Afterwards, a probability map for duds is generated from the detections via kernel density estimation and areas around the detections are classified as contaminated, resulting in an impact map. Our results, based on 74 aerial wartime images taken over different areas in Central Europe, show the potential of the method; among other findings, a clear improvement is achieved by using redundant image information. We also compared the MPP method for bomb crater detection with a state-of-of-the-art convolutional neural network (CNN) for generating region proposals; it turned out that the CNN outperforms the MPPs if a sufficient amount of representative training data is available and a threshold for a region to be considered as crater is properly tuned prior to running the experiments. If this is not the case, the MPP approach achieves better results.

Automatic Road Extraction from Remote Sensing Images Based on Rectangle Marked Point Process

Article

Jan 2021

Aiming at solving inaccurate and incomplete extraction of road in remote sensing images, this paper proposes an automatic extraction algorithm based on Rectangle Marked Point Process (RMPP). First, the RMPP is designed to model the road surface, which aims to obtain the centerline and width of road correctly. Secondly, under the framework of Bayesian theory, the proposed road extraction model is built by combining network reconstruction model and spectral measurement model. The former is to constrain the relationships between rectangles according to the structure characteristic of the road. The latter is to constrain the consistency between rectangle and road body in image. Also, to obtain optimal results, related transfer kernels of RJMCMC (Reversible Jump Markov Chain Monte Carlo) based simulation algorithm are designed accordingly. Finally, testing of the proposed method and comparing methods are carried out with different remote sensing datasets. Experimental results from the proposed algorithm show that the completeness, correctness and quality can reach 98%, 94% and 92%, respectively. Compared with the results from the comparing method qualitatively and quantitatively, it can be verified that the proposed method can not only extract the high-quality road networks from different datasets but also can obtain the width of the road simultaneously.

Aide à la décision pour le remplacement valvulaire aortique percutané

Thesis

Oct 2019

Vincent Auffret

La sténose aortique serrée est la valvulopathie acquise de l’adulte la plus fréquente affectant jusqu’à 10% des octogénaires. Sa prise en charge percutanée est en plein essor et confronte les cliniciens à des problèmes nouveaux constituant un champ de recherche important. Notre travail s’inscrit dans le cadre des gestes médico-chirurgicaux assistés par ordinateur et vise à proposer des solutions d’aide à la décision basées sur l’assistance informatique. Cette Thèse est ainsi composée de 4 parties. Une première partie porte sur la problématique médicale dans laquelle s’inscrit le remplacement valvulaire aortique percutané (TAVI) ainsi que le contexte du TAVI en France et présente un article évaluant l’évolution des caractéristiques des patients et des procédures à l’échelle nationale entre 2010 et 2015 dans les registres nationaux FRANCE 2 et FRANCE TAVI. Ce premier chapitre identifie des problématiques médicales auxquelles les opérateurs sont confrontés au quotidien notamment la sélection optimale des candidats et la minimisation des complications de la procédure dans le contexte d’une réduction du profil de risque des patients traités. La seconde partie s’intéresse à l’étude de populations, par des méthodes statistiques classiques, pour établir des facteurs prédictifs de résultats du TAVI ou de survenue d’une complication donnée afin d’aider le clinicien dans sa planification de la procédure. Cette partie est articulée autour de 3 articles portant sur les facteurs prédictifs d’accidents vasculaires cérébraux, les troubles conductifs et les facteurs prédictifs de « mauvais résultats » après TAVI. Nous démontrons l’intérêt de ce type d’analyse qui resteront nécessaires à l’avenir mais abordons également leurs limites qui expliquent pourquoi d’autres pistes doivent être explorées pour stocker, trier, restituer les informations pertinentes à l’opérateur voire les augmenter pour faciliter ses décisions notamment en préopératoire.L’objet de la 3ème partie est d’aborder un système d’aide à la décision par ordinateur de type « case-based reasoning » (CBR) qui pourrait tirer bénéfice de l’identification de ces facteurs pronostiques et à terme les intégrer dans une interface globale et ergonomique d’aide à la décision. Nous avons travaillé dans le cadre du projet européen H2020 EurValve sur l’élaboration d’un CBR dont la problématique se concentre pour l’instant sur le choix optimal de la voie d’abord, du type et de la taille de prothèse. Notre travail s’est concentré sur une étape analytique de la conception de ce type de système portant sur l’étude et l’amélioration de la mesure de similarité utilisée pour rapprocher le cas à traiter (problème) de ses plus proches voisins (cas déjà traités et leur « solution » thérapeutique). Enfin, une dernière partie porte sur l’augmentation des informations disponibles pour l’aide à la décision en préopératoire par la simulation numérique spécifique patient. Après un état de l’art des méthodes utilisées dans le domaine du TAVI, nous avons travaillé à l’élaboration et au paramétrage d’un modèle de simulation de l’insertion du guide rigide dans le ventricule gauche (une étape de la procédure qui peut conditionner le positionnement de la prothèse et donc le résultat final). Afin de réaliser une première validation de cette simulation exploitant l’imagerie tomodensitométrique 3D préopératoire, l’approche proposée repose sur l’extraction de la région d’intérêt dans le volume 3D (segmentation) et sa mise en correspondance avec l’imagerie fluoroscopique 2D peropératoire par le biais d’un recalage 3D/2D. Nos travaux sur ces méthodes de traitement de l’image nécessaires à la mise en œuvre et la validation de notre stratégie de simulation sont discutés dans cette partie. Enfin nous présentons une application clinique potentielle du modèle de simulation portant sur l’influence de la forme du guide et de ses conditions d’insertion sur sa stabilité et les forces de pression s’exerçant sur le ventricule gauche

Automatic road extraction based on multi-scale, grouping, and context

Article

Full-text available

Jul 1999
PHOTOGRAMM ENG REM S

An approach for the automatic extraction of roads from digital aerial imagery is proposed. It makes use of several versions of the same aerial image with different resolutions. Roads are modeled as a network of intersections and links between these intersections, and are found by a grouping process. The context of roads is hierarchically structured into a global and a local level. The automatic segmentation of the aerial image into different global contexts, i.e., rural, forest, and urban area, is used to focus the extraction to the most promising regions. For the actual extraction of the roads, edges are extracted in the original high resolution image (0.2 to 0.5 m) and lines are extracted in an image of reduced resolution. Using both resolution levels and explicit knowledge about roads, hypotheses for road segments are generated. They are grouped iteratively into larger segments. In addition to the grouping algorithms, knowledge about the local context, e.g., shadows cast by a tree onto a road segment, is used to bridge gaps. To construct the road network, finally intersections are extracted. Examples and results of an evaluation based on manually plotted reference data are given, indicating the potential of the approach.

Automatic Extraction of Man-Made Objects from Aerial and Space Images

Chapter

Full-text available

Jan 1995

Road tracing is a promising technique to increase the efficiency of road mapping. In this paper a new road tracing algorithm is presented. Road positions are computed by matching the average grey value profile of a reference road segment with profiles taken from the image. The road parameters are estimated by the recursive Kaiman filter. By utilizing the prediction step of the Kaiman filter the road tracer is able to continue following the road despite temporary failures of the profile matching that are due to road crossings, exits and cars.

The Candy model revisited: properties and inference

Article

Jan 2003

Simulation procedures and likelihood inference for spatial point processes

Article

Jan 1994

Reversible jump Markov chain Monte Carlo computation and Bayesian model determination (with Discussion

Article

Jan 1997

Stochastic Annealing for Nearest-Neighbour Point Processes with Application to Object Recognition

Article

Jun 1994

Lieshout van MNM

We study convergence in total variation of non-stationary Markov chains in continuous time and apply the results to the image analysis problem of object recognition. The input is a grey-scale or binary image and the desired output is a graphical pattern in continuous space, such as a list of geometric objects or a line drawing. The natural prior models are Markov point processes found in stochastic geometry. We construct well- defined spatial birth-and-death processes that converge weakly to the posterior distribution. A simulated annealing algorithm involving a sequence of spatial birth-and-death processes is developed and shown to converge in total variation to a uniform distribution on the set of posterior mode solutions. The method is demonstrated on a tame example.

An Improved backpropagation neural network for detection of road-like features in satellite imagery

Article

Nov 1997

This paper presents an application of backpropagation neural network for the detection of linear structures in remote-sensing images. The purpose of the approach is two-fold. First, to exploit the advantages of a neural network classifier over the tranditional ones. Second, to avoid the strategic phases of enhancement and thresholding. Once the network is learnt, the classification scheme is real-time. Two critical issues in the present approach are the selection of the network architecture and the rate of convergence of learning. Solutions to these two problems are proposed. Experimental results on IRS and SPOT images are presented. Satisfactory classification results have been obtained using the network.

L'extraction du réseau routier urbain à partir d'images SPOT HRV

Article

Mar 1996

The extraction of urban road networks from satellite sensor images with a fine spatial resolution is difficult because of the disturbances generated by buildings and parking lots, the existence of intersections and sharp curves, and the diversity of road widths. The present study proposes a procedure for this task and the result extracted from a SPOT HRV panchromatic image. The proposed method can adequately detect not only straight roads of various widths but also sharp curved roads and intersections.

Pattern Classification and Scene Analysis

Article

Jan 1973

Simulation and likelihood inference for spatial point processes

Article

C. J. Geyer

Point processes for unsupervised line network extraction in remote sensing

Abstract and Figures

Recommended publications

A Marked Point Process of Rectangles and Segments for Automatic Analysis of Digital Elevation Models

Road network extraction in remote sensing by a Markov object process

A Comparative Study of Point Processes for Line Network Extraction in Remote Sensing

A Polyline Process for Unsupervised Line Network Extraction in Remote Sensing