Conference PaperPDF Available

Range-based Obstructed Nearest Neighbor Queries

June 2016

June 2016

DOI:10.1145/2882903.2915234

Conference: the 2016 International Conference

Authors:

Huaijie Zhu

Sun Yat-Sen University

Xiaochun Yang

Northeastern University (Shenyang, China)

In this paper, we study a novel variant of obstructed nearest neighbor queries, namely, range-based obstructed nearest neighbor (RONN) search. A natural generalization of continuous obstructed nearest-neighbor (CONN), an RONN query retrieves the obstructed nearest neighbor for every point in a specified range. To process RONN, we first propose a CONN-Based (CONNB) algorithm as our baseline, which reduces the RONN query into a range query and four CONN queries processed using an R-tree. To address the shortcomings of the CONNB algorithm, we then propose a new RONN by R-tree Filtering (RONN-RF) algorithm, which explores effective filtering, also using R-tree. Next, we propose a new index, called O-tree, dedicated for indexing objects in the obstructed space. The novelty of O-tree lies in the idea of dividing the obstructed space into non-obstructed subspaces, aiming to efficiently retrieve highly qualified candidates for RONN processing. We develop an O-tree construction algorithm and propose a space division scheme, called optimal obstacle balance (OOB) scheme, to address the tree balance problem. Accordingly, we propose an efficient algorithm, called RONN by O-tree Acceleration (RONN-OA), which exploits O-tree to accelerate query processing of RONN. In addition, we extend O-tree for indexing polygons. At last, we conduct a comprehensive performance evaluation using both real and synthetic datasets to validate our ideas and the proposed algorithms. The experimental result shows that the RONN-OA algorithm outperforms the two R-tree based algorithms significantly. Moreover, we show that the OOB scheme achieves the best tree balance in O-tree and outperforms two baseline schemes.

CONN search

…

Illustration for Lemma 1

…

Wrong crossing

…

Figures - uploaded by Xiaochun Yang

Content may be subject to copyright.

Content uploaded by Xiaochun Yang

Content may be subject to copyright.

Range-based Obstructed Nearest Neighbor Queries

Huaijie Zhu†, Xiaochun Yang†, Bin Wang†, Wang-Chien Lee‡

†Northeastern University, China

‡Pennsylvania State University, University Park, PA 16802, USA

zhuhjneu@gmail.com, {yangxc, binwang}@mail.neu.edu.cn, wlee@cse.psu.edu

ABSTRACT

In this paper, we study a novel variant of obstructed near-

est neighbor queries, namely, range-based obstructed near-

est neighbor (RONN) search. A natural generalization of

continuous obstructed nearest-neighbor (CONN), an RONN

query retrieves the obstructed nearest neighbor for every

point in a speciﬁed range. To process RONN, we ﬁrst pro-

pose a CONN-Based (CONNB) algorithm as our baseline,

which reduces the RONN query into a range query and four

CONN queries processed using an R-tree. To address the

shortcomings of the CONNB algorithm, we then propose

a new RONN by R-tree Filtering (RONN-RF) algorithm,

which explores eﬀective ﬁltering, also using R-tree. Next,

we propose a new index, called O-tree, dedicated for index-

ing objects in the obstructed space. The novelty of O-tree

lies in the idea of dividing the obstructed space into non-

obstructed subspaces, aiming to eﬃciently retrieve highly

qualiﬁed candidates for RONN processing. We develop an

O-tree construction algorithm and propose a space division

scheme, called optimal obstacle balance (OOB) scheme, to

address the tree balance problem. Accordingly, we propose

an eﬃcient algorithm, called RONN by O-tree Acceleration

(RONN-OA), which exploits O-tree to accelerate query pro-

cessing of RONN. In addition, we extend O-tree for indexing

polygons. At last, we conduct a comprehensive performance

evaluation using both real and synthetic datasets to validate

our ideas and the proposed algorithms. The experimen-

tal result shows that the RONN-OA algorithm outperforms

the two R-tree based algorithms signiﬁcantly. Moreover, we

show that the OOB scheme achieves the best tree balance

in O-tree and outperforms two baseline schemes.

Keywords

Nearest neighbor; Range-based nearest neighbor; Obstacle

1. INTRODUCTION

With the growing popularity of smart mobile devices and

the rapid advance in wireless communication and position-

Permission to make digital or hard copies of all or part of this work for personal or

classroom use is granted without fee provided that copies are not made or distributed

for proﬁt or commercial advantage and that copies bear this notice and the full cita-

tion on the ﬁrst page. Copyrights for components of this work owned by others than

ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or re-

publish, to post on servers or to redistribute to lists, requires prior speciﬁc permission

and/or a fee. Request permissions from permissions@acm.org.

SIGMOD ’16 June 26–July 1, 2016, San Francisco, USA

⃝2016 ACM. ISBN 123-4567-24-567/08/06. . . $15.00

DOI: http://dx.doi.org/10.1145/2882903.2915234

ing technologies, various spatial queries have been proposed

to support the needs of mobile users. Range nearest neigh-

bors (RNN) query [1, 3, 5, 10, 11, 14, 15] is a popular spatial

query that retrieves the nearest neighbors to a spatial region

where the mobile user is possibly located. It has been re-

garded as a fundamental query in support of various real

applications, which have been discussed widely in the liter-

atures [1, 3, 5, 10] and can be categorized into the following

four use scenarios.

•Privacy-preserving. Mobile users are not willing to re-

veal their exact location information to location-based

service providers, due to the concern of location privacy

leaking. Spatial cloaking, a well-received privacy pre-

serving technique, proposes to blur the user’s exact lo-

cation into a spatial region in order to satisfy the user’s

speciﬁed privacy requirements [9, 11, 12, 13, 14, 15, 16,

24, 27]. RNN facilitates search of the nearest neighbors

with respect to the spatial region.

•Continuous queries. For services that facilitate contin-

uous lookup of nearest neighbors while a user is moving

around an area, it is ineﬃcient to process individual

NN queries against the whole data set at the server. To

submit an RNN query to prefetch all possible nearest

neighbors for this area as a candidate set is a better

choice. As a result, NN queries issued for objects in

this area can be processed locally against the candidate

set, saving both computation and communication cost

signiﬁcantly [14, 15, 16].

•Batch processing. Consider the situation where very

close NN queries are issued from many diﬀerent users.

Instead of processing these queries separately, it’s more

eﬃcient to group spatially adjacent NN queries and pro-

cess them in a batch. An RNN query with a spatial

range bounding all the NN query points in the batch

can be issued to obtain a candidate set and then the in-

dividual NN queries can be resolved with the candidate

set returned by RNN [14].

•Location uncertainty. Due to the limitation of under-

lying positioning techniques (especially those network-

based techniques relying on cellular and wiﬁ access points)

as well as the sampling imprecision caused by continu-

ous motion, network delays, and frequent location up-

dates (even for devices equipped with GPS), the loca-

tion uncertainty of a mobile user remains an issue. To

support such a user, an RNN query with a spatial re-

gion where the user location is guaranteed (to cope with

the location uncertainty) can be issued to retrieve the

NNs to the user [15, 16].

2053

Nevertheless, these prior RNN works mostly assume a

plain space with no obstacles, and thus use Euclidean dis-

tance, the shortest distance between a data object and a

query range, as the distance measure. While their contribu-

tions are signiﬁcant to the ﬁeld, there exists a gap to real-life

applications, as the space we live in usually has obstacles in-

stead of being plain. For example, a traveler drives around

mountains and lakes; troops take detours to avoid enemy’s

ground; wireless communication messages may avoid rout-

ing through an area that suﬀers electromagnetic interfer-

ence. Indeed, a recent trend in research of spatial queries is

to consider the obstructed space where the obstructed dis-

tance [23], i.e., the shortest path between two points with-

out crossing any obstacle, is adopted as the distance mea-

sure. For example, research studies on the obstructed nearest

neighbor (ONN) query, which retrieves k(k≥1) NNs of a

given query point in the obstructed space, are reported [23,

25]. Moreover, to support mobile applications, continuous

obstructed nearest neighbor (CONN) query for ﬁnding near-

est neighbors to a given line segment in the obstructed space

is proposed [6]. The idea of visibility graph (VG) is also pro-

posed to simplify the calculation of obstructed distance [4].

In addition, the problem of spatial clustering in the presence

of obstacles is studied [21, 22]. The realistic needs and these

research works for supporting mobile users in the obstructed

space inspired our research presented in this paper.

Considering the problem of RNN under the context of ob-

structed space, in this paper, we propose a new spatial query,

namely, range-based obstructed nearest neighbor (RONN)

query.1Given a set of data objects P, and a set of obstacles

O, an RONN query, speciﬁed by a query range R, retrieves

all the nearest neighbors corresponding to every point in R

based on obstructed distance.2An example of the RONN

query is illustrated in Fig. 1. As shown, the circled points

denote the data objects, the red line segments logically rep-

resent the obstacles and the rectangle at the center speciﬁes

a query range.3The result of this RONN search consists of

the blue points. Notice that point p7is not in the result set

owing to obstacle o1.p3is not in the result set since p4is

nearer to the query range than p3. A major challenge faced

in processing RONN lies in the adoption of the obstructed

distance as a measure, which makes the existing techniques

for conventional RNN inapplicable.

In this paper, we ﬁrst propose a CONN-Based (CONNB)

algorithm as the baseline to tackle the RONN problem. By

reducing an RONN query to a range query and four CONN

queries [6], we exploit an R-tree of data objects and obstacles

to answer the query. Moreover, observing some shortcom-

ings of the CONNB algorithm, we propose a new RONN

by R-tree Filtering (RONN-RF) algorithm, which explores

1To the best knowledge of the authors, the problem of RNN

has not yet been addressed under the context of the ob-

structed space.

2Following the convention in spatial database research, we

use a rectangle to represent the query range, which is as-

sumed to be in parallel to x-axis or y-axis. Alternative rep-

resentations of the query range will be explored in future

works.

3Although an obstacle can be in any shape (e.g., triangle,

pentagon, etc.), line segment is the basic unit of shapes, e.g.,

rectangular and polygon. for simplicity, we ﬁrst follow the

common assumption adopted in the literature of obstructed

spatial queries to use line segment for obstacles. Later we

also consider the polygons as an extension.

p3p4

Query

Figure 1: RONN search

eﬀective ﬁltering strategies to facilitate eﬃcient query pro-

cessing. Nevertheless, to process RONN, there exist two

issues in R-tree due to the presence of obstacles. First, it re-

quires multiple traversals of the R-tree for both RONN-RF

and CONNB, i.e., as to be detailed later, RONN-RF needs

to traverse R-Tree twice and CONNB requires ﬁve scans of

the R-tree. Second, R-tree indexes both data objects and

obstacles in the same Euclidean space, resulting in expen-

sive online computation of obstructed distance during query

processing.

To address the above two issues, we propose a new in-

dex, namely, O-tree, which organizes data objects and ob-

stacles in a way that non-obstructed subspaces can be pro-

duced to alleviate the computation of obstructed distance.

The novelty of O-tree lies in the idea of dividing the ob-

structed space into non-obstacle subspaces, aiming to eﬃ-

ciently retrieve highly qualiﬁed candidates for accelerating

RONN processing. O-tree is constructed by sequentially se-

lecting obstacles to ﬁnd pivot points for space division re-

cursively. Eventually, R-trees are embedded in the O-tree

to handle the non-obstructed subspaces.

In order to build a balanced O-tree, we propose a no-

tion of tree balance, namely, obstacle balance. Based on our

analysis, we observe that the tree balance is primarily af-

fected by the orders of obstacles being selected to build the

O-index. Accordingly, we propose a space division scheme,

namely, optimal obstacle balance (OOB). Moreover, we pro-

pose an eﬃcient algorithm, called RONN by O-tree Accelera-

tion (RONN-OA), which exploits O-tree to accelerate query

processing of RONN. In addition, we extend O-tree for in-

dexing polygons. At last, we conduct a comprehensive per-

formance evaluation using both real and synthetic datasets

to validate our ideas and evaluate the proposed algorithms.

The contributions made in this research are ﬁve-fold:

•We formalize the Range-based Obstructed Nearest Neigh-

bor (RONN) query, a new variant of spatial queries in

the obstructed space. To the best of our knowledge, this

work is the ﬁrst attempt to tackle the RONN problem.

•We propose two original RONN algorithms based on R-

tree. The ﬁrst one (CONNB) answers an RONN query

with one range query and four CONN queries. The

second one (RONN-RF) explores eﬀective heuristics to

facilitate eﬃcient ﬁltering by R-tree.

•As an alternative to R-tree, we propose a novel in-

dex (O-tree) by dividing the obstructed space into non-

obstructed subspaces. This new index prevails R-tree in

supporting eﬃcient RONN query processing. In addi-

tion, we propose a space division scheme (OOB) for con-

structing the O-tree and study the tree balancing issue

as an optimization problem. Moreover, we extend the

O-tree to index obstacles as polygons (see Appendix).

•Based on O-tree, we propose a new algorithm (RONN-

OA) for eﬃcient query processing of RONN.

2054

•We conduct extensive experiments to evaluate the pro-

posed algorithms and schemes. The experimental re-

sult shows that the RONN-OA and RONN-RF algo-

rithms signiﬁcantly outperform the baseline algorithm

(CONNB), with 360% and 300% improvement, respec-

tively. Meanwhile, our study shows that OOB is the

scheme of choice for space division in O-tree.

2. RELATED WORK

A lot of research eﬀort related to our work, including

range based spatial queries,ONN queries, and spatial cloak-

ing query (for location privacy preservation), have been made

in the ﬁeld of spatial databases.

2.1 Range-based Spatial Queries

Many spatial queries have been studied over the years,

e.g., range queries, nearest neighbor (NN) queries, continu-

ous nearest neighbor (CNN) queries [18, 26, 20], range near-

est neighbor (RNN) queries [10], and range based skyline

(RSky) queries [16]. Among them, the “range-based” spatial

queries are the most relevant to our work.

The notion of kRNN queries has been proposed to retrieve

knearest neighbors for every point in a speciﬁed range [10].

New algorithms are developed based on either planar ge-

ometry in two-dimensional space or linear programming in

high-dimensional spaces. In addition, a solution-based aux-

iliary index, called EXO-tree, is developed to support a va-

riety of NN queries, including traditional NN, CNN, and

RNN searches. However, the design of EXO-tree does not

consider the obstructed space and thus is not applicable to

RONN.

Another relevant work is the RSky queries in mobile envi-

ronment [16]. Two algorithms, namely, I-SKY and N-SKY,

are proposed to ﬁnd skyline objects in static and highly dy-

namic datasets, respectively. The authors also further con-

sider the problem under the contexts of probabilistic and

continuous queries. To process probabilistic RSky queries,

they propose new pruning techniques to improve the compu-

tational eﬃciency. Moreover, they compute the valid scope

for each skyline object to process continuous RSky queries

eﬃciently. Again, the above RNN and RSky queries are

considered only in non-obstructed space. In this work, we

design a novel index for obstructed space to enhance query

processing of RONN.

2.2 ONN Queries

Due to the impact of obstacles, a growing number of re-

search lines considering the obstructed spatial space, includ-

ing obstructed spatial queries, visible spatial queries, and

obstructed spatial clustering, have been explored.

In obstructed spatial queries, Zhang et al. [25] propose

algorithms for answering classical spatial queries, such as

range queries, NN search, distance join queries, closest pair

queries and distance semi-join queries, in the obstructed

space. Xia et al. study the obstructed nearest neighbor

(ONN) query which ﬁnds the kNNs of a given query point

based on the obstructed distance [23]. This work adopts

R-tree to index the data objects and obstacles in order to

incrementally compute the obstructed distance by ﬁltering

out a large number of obstacles. Gao and Zheng study the

continuous obstructed nearest neighbor (CONN) search [6].

As our baseline algorithm is to reduce an RONN to a range

query and four CONN queries, we discuss some key ideas

of this work in Section 3. RONN is fundamentally diﬀerent

from CONN as the RONN query is based on a range rather

than a line segment.

Research works on visible spatial queries also consider the

existence of obstacles. However, these queries adopt Euclid-

ian distance instead of obstructed distance as the distance

measure, i.e., two objects are visible to each other iﬀ the

straight line segment connecting them does not cross any

obstacle. A visible nearest neighbors query returns the NNs

visible to a given query point [19]. Gao explores the contin-

uous visible nearest neighbor query which ﬁnds the visible

NN of every point along a query line segment [7].

In addition, the problem of spatial clustering in the pres-

ence of obstacles has attracted considerable attention in re-

cent years. It divides a set of two-dimensional data objects

into smaller homogeneous groups (i.e., clusters) by taking

into account the impact of obstacles or obstructed distance.

A number of clustering algorithms with obstacle constraints

have been reported in the literature [21, 22].

Although these works have considered the obstructed space

or the impact of the obstacles, the RONN query has not been

studied previously.

2.3 Spatial Cloaking Query

In location privacy preservation, several works propose to

perform spatial cloaking for protecting users’ exact location

from leaking by specifying a cloaked range, instead of the

exact query location. The spatial cloaking query calculates

a safe region and provides all data objects inside the safe

region (i.e. candidates) instead of a single exact answer [17].

Casper [17] blurs a user’s exact location into a cloaked

spatial range, in form of a rectangle, based on privacy re-

quirements speciﬁed by the user. Accordingly, the server

processes the query based on the cloaked spatial ranges.

Gruteser, et al. [9] propose a Quadtree-based algorithm for

spatial cloaking. It achieves identity anonymity in LBS by

spatio-temporal cloaking based on a k-anonymity model,

i.e., the cloaked location is made indistinguishable from the

location information of at least k-1 other users. Gedik and

Liu [8] propose a novel spatio-temporal cloaking algorithm,

called CliqueCloak, which provides k-anonymity in LBSs for

mobile users. The most close work to ours is [27]. Zhu et

al. [27] construct a cloaked range corresponding to the exact

location using a pyramid tree to address the obstructed near-

est neighbor problem under the context of obstructed space.

Diﬀerent from the above cloaked ranged query techniques,

RONN returns the nearest neighbors to every possible point

in the cloaked range. We show the comparison results in

Section 7.

3. PRELIMINARIES

In this section, we ﬁrst introduce some background and

deﬁnitions for our research. Then we formulate the RONN

problem.

3.1 Background and Deﬁnitions

In this work, we focus on the obstructed space. We assume

that there exists a set of data objects Pand a set of obstacles

Oin the space. We assume that people moving from one

point to another in the obstructed space follow the shortest

path while avoiding the obstacles. In this space, a proper

distance measure (other than Euclidean distance) is needed

for obstructed spatial queries. Thus, we deﬁne the notion of

2055

obstacle-free path and use it to deﬁne the obstructed distance

between two points.

Deﬁnition 1. (OBSTACLE-FREE PATH [23]). Consider

two points p, p′in the obstructed space. An obstacle-free

path,P(p, p′) = ⟨d0, d1, ..., dn⟩(where diis a point in the

space and d0=p, dn=p′), consists of nline segments

of straightline subpaths ⟨d0, d1⟩,⟨d1, d2⟩, ..., ⟨dn−1, dn⟩that

connect pand p′.4

Deﬁnition 2. (OBSTRUCTED DISTANCE [23]). Let the

distance of an obstacle-free path P(p, p′) be |P(p, p′)|=

∑i∈[0,n−1] dist(di, di+1) where dist(di, di+1 ) is the Euclidean

distance between diand di+1. The obstructed distance be-

tween pand p′, denoted by ||p, p′||, is the length of the short-

est obstacle-free path (shortest path for short) from pto p′,

i.e., @P(p, p′),|P(p, p′)|<||p, p′||.

Based on the notion of obstructed distance, we now deﬁne

the obstructed nearest neighbor to a query point.

Deﬁnition 3. (OBSTRUCTED NEAREST NEIGHBOR).

For a given query point qin the obstructed space, the data

object p∈Pis the obstructed nearest neighbor (ONN) of q

iﬀ ∀p′∈P−p, ||p, q|| ≤ ||p′, q||.

By capturing the anticipated moving trajectory of a mo-

bile user as a line segment, the idea of ONN can be extended

to CONN as deﬁned below.

Deﬁnition 4. (CONTINUOUS OBSTRUCTED NEAREST

NEIGHBOR). Given a query line segment q, the continuous

obstructed nearest neighbors corresponding to q, denoted

by CON N(q), is a subset of objects in Psuch that ∀point

q∈q,ON N (q)∈C ONN (q).

Notice that the CONN of a line segment deﬁned above

is the union of ONN objects with respect to the (inﬁnite

number of) points on the line segment q. Alternatively,

CON N (q) can be compactly represented as the union of

CONNs corresponding to a sequence of line segments q1,q2,

..., qtsuch that (i) qis the concatenation of q1, q2, ..., qt,

(ii) there exists only one CONN, i.e., the ONN, for each qi

(where i= 1..t), and (iii) ON N (qi)̸=ONN (qi+1) (where

i= 1..t −1).

Accordingly, the points s1, s2, ..., st−1that split qas q1,

q2, ..., qt, termed as split points, are deﬁned below.

Deﬁnition 5. (SPLIT POINTS FOR CONN). Let the query

line segment qbe denoted by ab, where aand bare the two

end points of q. Consider the q1, q2, ..., qtassociated with

CON N (q) where q1=as1, q 2=s1s2, ..., qt=st−1b. The

points s1, s2, ..., st−1are split points of ab. Moreover, let pi

and pi+1 be the ONNs to all the points along si−1siand

sisi+1, respectively. Then the obstructed distance from si

to piand pi+1 are the same, i.e., ||pi, si|| =||pi+1, si||.

Example 1. Fig. 2 illustrates CONN. As shown, {p1,p2,

..., p6}are data objects, {o1,o2,...,o4}are obstacles, and ab

is the query line segment. The output of the CONN search is

{⟨p1, as1⟩,⟨p2, s1s2⟩,⟨p6, s2s3⟩,⟨p3, s3b⟩}, where s1,s2and

s3are the split points. The split point s1for point p1and

p2can be computed by a quadratic algorithm [6].

The notion of CONN and split points are used to describe

some of our algorithms built upon CONN.

4Note that there exists a straightline path between two

points if and only if there is no obstacle blocking the path.

p1p2

s2s3

Figure 2: CONN search

3.2 Problem Formulation

In this section, we formally deﬁne the RONN query.

Problem Statement. Given a set of data objects P and

a set of obstacles O, a range-based obstructed nearest neigh-

bor (RONN)query Q (speciﬁed by a rectangular range R),

retrieves all the obstructed nearest neighbors of points in the

query range, i.e., RONN(Q) = {ON N (p)|p∈R}.

4. R-TREE BASED QUERY PROCESSING

In this section, we propose two R-tree based algorithms

for RONN query processing.

Given an RONN query Q, speciﬁed by a query range R,

any object located inside Ris automatically an RONN to

the query. Thus, an idea to answer an RONN is to ﬁrst issue

a range query to ﬁnd all “internal” RONNs located within

Rand then ﬁnd the “external” RONNs located outside R.

Intuitively, as indicated in Lemma 1, these external RONNs

of Rare the ONNs to the boundary of R.

lemma 1. An object pis an external RONN of Rif and

only if 1) p is not located in R; and 2) p is an ONN for at

least one point on the boundary of R.

Proof. For the necessary condition, it is inherent from

the RONN deﬁnition that if pis an external RONN to R,

then (1) and (2) above are satisﬁed.

Next, we prove the suﬃcient condition by contradiction.

Assume that there exists an RONN poutside Rwhich is

not the ONN to any point on the boundary of R. Since

pis an RONN, pis the ONN for at least one point min

R(see Fig. 3 for illustration). Let nbe the intersection

point of line segment pm and the boundary of Rwhere n

is located. Based on the assumption, as pis not the ONN

of n(which is a point on the boundary), there must be

another data object p′such that ||n, p′|| <||n, p||. Thus,

||n, p′||+||n, m|| <||n, p||+||n, m||. As a result, we have

||m, p′|| <||m, p||, which contradicts our assumption that

pis the ONN of m. Therefore, the suﬃcient condition must

hold.

Combine the cases of necessary and suﬃcient conditions,

the proof completes.

4.1 CONN-Based Algorithm

According to Lemma 1, this RONN query can be answered

by issuing four CONN queries (using each edge of Ras the

query line segment) and one range query. The union set of

data objects returned from the above ﬁve queries answers

the RONN query. This CONN-based (CONNB) algorithm,

serves as a baseline for comparison in our evaluation.

In CONNB, R-tree has been employed to index both data

objects and obstacles. Accordingly, a quadratic algorithm

proposed in [6] can be used to return the CONNs for each

edge of R. As a result, CONNB is also quadratic.

2056

p' p

Figure 3: Illustration

for Lemma 1

o1o2

Figure 4: Maximum

Obstructed Distance

4.2 R-tree Filtering Algorithm

The CONNB algorithm directly invokes range and CONN

queries and thus does not explore some potential ﬁltering

strategies during query processing. Notice that some in-

termediate RONNs acquired during query processing may

potentially disqualify certain RONN candidates located far

away from the query range R. Thus, an idea to facilitate

eﬃcient ﬁltering is to explore the RONNs using a distance-

browsing approach to obtain RONNs in accordance with

their minimal Euclidean distance to R. Accordingly, in this

section, we propose a new RONN algorithm, called R-tree

Filtering Algorithm (RONN-RF), which built upon R-tree

to facilitate distance-based browsing and ﬁltering.

To achieve our goal, RONN-RF maintains a number of

maximum obstructed distances, one for each edge of the

query range, from the set of intermediate RONNs to the

query range Rin order to ﬁlter out unqualiﬁed candidates.

While using R-tree facilitates browsing of data objects in

ascending order of their minimal Euclidean distance to R, a

major challenge faced in answering the RONN query is the

use of obstructed distance as a measure. Here we introduce

the notion of max obstructed distance of an object set to a

line segment, as deﬁned below, for discussion of RONN-RF.

Deﬁnition 6. (MAXIMUM OBSTRUCTED DISTANCE

OF AN RONN SET TO LINE SEGMENT). Given a set

of intermediate RONNs Pand a line segment ab on the

obstacle space, we say that ||m, pm|| is the maximum ob-

structed distance from Pto ab, denoted by max odis(P, ab),

iﬀ there exist a point mon ab such that ∀point ion ab,

||i, ON NP(i)|| ≤ ||m, ON NP(p)|| where ONNP(i) denotes

the point in Pserving as the ONN of point i.

Example 2. Consider Fig. 4, where O={o1, o2}is an

obstacle set, P={p1, p2}is a set of intermediate ONNs

obtained during query processing, and ab is a side edge of

the query range. As shown, p1and p2are the RONNs for

all points located on as and sb, respectively. In this ex-

ample, we have max odist(p1, as) = ||p1, s|| <||p2, b|| =

max odist(p2, sb), which thus is max odis(P, ab).

In RONN-RF, four maximum obstructed distances (MODs),

one for each edge of the query range, are maintained to ﬁlter

out unqualiﬁed objects from examination and to determine

the terminating condition for query processing. RONN-RF

ﬁrst issues a range query to obtain the internal RONNs,

which are used to initialize the four MODs.5Next, RONN-

RF traverses the R-tree to visit the data objects in ascending

5These MODs are incrementally updated as the RONN an-

swer set is reﬁned during query processing.

order of their minimal Euclidean distances to query range R

in order to determine whether they are ONNs to each edge.

During the traversal of R-tree, a heap His used to store the

R-tree entries (including minimum bounding boxes (MBBs),

data objects, and obstacles) in order to process them in as-

cending order of their minimum distances to R. Initially,

Hcontains only the root entry. By de-heaping the entry e

with the shortest distance to R, RONN-RF checks whether

the minimal Euclidean distances from eto Ris greater than

all the current MODs. If this is true, ewill be eliminated

and a new entry is de-heaped. Otherwise, if eis an MBB

(i.e., internal index node), all its child entries are pushed

into Hfor further processing. On the other hand, if eis a

leaf node of R-tree, there are two cases: (i) eis an obstacle:

it is inserted into the visibility graph V G, which is used for

computing the split points on edges for RONNs; and (ii) e

is a data object: it updates RONN results and MODs corre-

sponding to each edge by ﬁnding split points on edges of the

range boundary (to be discussed further later). The above

query processing steps (i.e., de-heaping a new entry from H

and so on) are repeated until Hbecomes empty.

The operation of ﬁnding split points (called F indSP ) plays

an important role in query processing of RONN-RF. We fol-

low the idea of CONN algorithm [6] to compute the split

points for the query boundary. Notice that, instead of se-

quentially ﬁnding RONNs corresponding to each edge (as

CONNB does), RONN-RF examines a data object corre-

sponding to all edges simultaneously. This results in signif-

icant performance improvement, in addition to the major

saving obtained from ﬁltering out unqualiﬁed RONN candi-

dates, because RONN-RF needs to scan R-tree only once in-

stead of four times. Let ri(i= 1..4) denote the corner points

of the query range Rand Eij denote the edge rirj. Given

the intermediate RONN result set (RSij) and the visibility

graph (VG) of obstacles visited so far, we are able to decide

whether the data object under examination pis an RONN

to Eij . If it is, we are able to obtain a list of split points

that divide Eij into line segments being corresponding to

RONNs in RSij. Moreover, the MOD for Eij is updated.

p3p4

N1N2

N3N4

p4p6o2

p1p3

N1,N2

N2,N6,N7

N3,N6,N4,N7

N6,N4,p5,p6,N7

o1,N4,p5,p6,N7

Insert o2into

visibility graph

N4,p5,p6,N7

o2,p5,p6,N7

p5,p6,N7

p6,N7

Insert o1into

visibility graph

Heap status

Figure 5: An example of RONN-RF

Fig. 5 shows an example for illustration of the RONN-RF

algorithm. As shown, there are eight data objects p1, p2,...

,p8and two obstacles o1and o2, maintained in an R-tree

index. In the index, the root node contains two internal

nodes N1and N2which cover N5, N6, N7, and N3, N4, re-

spectively. In turn, N5contains data objects p1,p2, and

p3,N7contains data objects p7and p8, and N3contains

2057

data objects p4,p5, and p6. On the other hand, N6and

N4contains o1and o2, respectively. Given a query range R

(i.e., the red rectangle), RONN-RF ﬁrst issues a range query

to obtain the internal RONN p4. Accordingly, it initializes

the four MODs corresponding to each edge of Rusing p4.

Next, to identify external RONNs by traversing R-tree, we

initialize Hwith N1and N2, sorted in ascending order of

their Euclidean distances to R.6We proceed to pop the top

entry N1from H. As N1is an internal node, we compare

the Euclidean distances of its child entries N5,N6and N7

with the maintained MODs and insert N6and N7back to

H. Notice here that N5is ﬁltered by the maintained MODs.

Similarly, N2is de-heaped from Hand then N3, N4are in-

serted back. Continue with the process, N3is de-heaped

and data objects p5and p6are inserted back. Here, since

p4is already in the RONN result set, it is not put back to

H. After that, we pop N6which contains an obstacle o1.

Thus, we insert o1into the visibility graph. Similarly, we

handle N4and insert o2into visibility graph. Then p5is

de-heaped as a data object and found to be an RONN (by

updating current split points via VG). MODs corresponding

to each edge of Rare updated. Similarly, p6are included

in the result set. Finally, we pop N7and insert p7into H

because the Euclidean distance of p8is greater than some

of the maintained MODs. p7is de-heaped as a data object

but it can not update the current RONN results due to the

blocking of obstacle o1. At this stage, Hbecomes empty

so the query processing ends. The ﬁnal RONN result set

includes p4,p5and p6.

As shown in the above example, we maintain four MODs,

one for each edge. An RONN candidate is ﬁltered only if all

its Euclidean distances to the edges of query boundary are

greater than the corresponding MODs. The ﬁltering condi-

tion, as speciﬁed in Theorem 1, eﬀectively reduces the search

space of RONN solutions and thus alleviates signiﬁcant com-

putation overhead. Algorithm 1 presents the pseudo-code

of RONN-RF. Notice that the main issue on R-tree is that

the minimal Euclidian distance of accessing order is not very

eﬀective due to the aﬀecting of obstacles, i.e., the minimal

Euclidian distance of N7is very small but this is not the

RONN result.

Theorem 1. Given a query range Rand the current RONN

result set RS, an entry eof R-tree is ﬁltered iﬀ for each edge

ab of R,min dist(e, ab)> max odis(RS, ab).

In Theorem 1, notice that min dist(e, ab) is Euclidean

distance, which is easy to compute. Moreover, it can ﬁl-

ter distant data objects eﬀectively because the Euclidean

distance represents the lowerbound of obstructed distance

from an entry eto the query range R.

Comparing to CONNB algorithm, the RONN-RF algo-

rithm has two advantages: 1) RONN-RF needs only one

traversal of R-tree to ﬁnd the external RONNs while CONNB

requires four scans of the R-tree; and 2) In traversal of the

R-tree, RONN-RF explores eﬀective heuristics to prune the

RONN search space, while CONNB do not exploit interme-

diate (internal and external) RONNs for ﬁltering in query

processing.

6Notice that both N1and N2have Euclidean distances to

some edges of Rshorter than the current RONNs’ MODs to

these edges.

Algorithm 1: R-tree Filtering Algorithm (RONN-RF)

Input: Data objects and obstacles R-tree T, min-heap

H= (root(T),0), and query range R

Output: RONN results list RS

1inRes ←Rangequery(T, R);/* internal RONNs*/

2RS ←inRes;RSk←inRes ;k= 1,2,3,4 /*storing

ONNs for four edges*/

3overk←0; k= 1,2,3,4 /*Terminating signal for four

edges, 0 means query processing for the corresponding

edge is not over yet*/

4 while true do

5k←1;/*Starting from edge 1*/

6 for Each edge Eij of Rdo

7RSk←FindSP(Eij , RSk); /*Finding split

points*/

8MODk←max odis(Eij , RSk);

9 if H̸=∅and H.head.key > M ODkthen

10 overk= 1. /* Processing for edge kis over*/

11 if over1&over2&over3&over4or H.

=∅then

12 return; /*There is no entry which will be an

ONN for any edge*/

13 k+ +;

14 while H̸=∅do

15 de-heap the top entry (e, key) of H;

16 if eis a data object then

17 k←1;

18 for key < M ODkdo

19 insert einto RSk;

20 k+ +;

21 break;

22 else

23 for each child entry ei∈edo

24 if min dist(ei, R)< MAX(M ODk)

then

25 insert eiinto Hbased on

min dist(ei, R).

26 RS ←RS ∪RSk;

27 return RS;

5. O-TREE FOR OBSTRUCTED SPACE

As shown earlier, R-tree is used in CONNB and RONN-

RF to index both data objects and obstacles in the ob-

structed space. While R-tree is a well-received index struc-

ture for spatial databases, however, it faces two issues in pro-

cessing RONN queries due to the presence of obstacles. One

is the need for multiple traversals of R-tree in both RONN-

RF and CONNB. When doing internal RONN (range query)

and external RONN query, RONN-RF needs to traverse R-

tree twice and CONNB requires ﬁve scans of the R-tree.

This inspires us to explore processing ideas that requires

only one scan of the (indexed) data to process internal and

external RONN queries all together. The other issue is the

expensive online computation of obstructed distance. Note

that R-tree is built based on the Euclidean space, i.e., Eu-

clidean distance is used as the measure among data objects,

obstacles and minimum bounding boxes (MBBs) in the in-

dex. As a result, R-tree-based RONN algorithms have to

2058

deal with expensive online computation of obstructed dis-

tance. Recall the example in Fig. 5. While p1, p2and p3

(actually N5) have been eﬀectively pruned early by RONN-

RF, p7and p8have to be accessed since their Euclidean

distances to Ris small. This incurs expensive computation

of their obstructed distances to Rbefore they are eventually

excluded from the answer. Thus, we aim to reduce access of

data objects in online computation of obstructed distance.

To address these issues, we propose a novel index, called

O-tree, to support eﬃcient RONN query processing in the

obstructed space. The novelty of O-tree lies in the idea of

dividing the obstructed space into non-obstructed subspaces

such that we can use R-tree to index the data objects asso-

ciated with these subspaces. Our idea is to divide the space

into four subspaces along the line of a selected obstacle and

some perpendicular lines, e.g., its bisector. Recursively we

divide the subspaces into more subspaces to build an in-

dex structure. The indexed subspaces may ideally consist

of non-obstructed spaces which can facilitate eﬃcient com-

putation and pruning in query processing. While the idea

is simple and elegant, the construction of O-tree still faces

some design challenges, e.g., tree balance.

In the following, we ﬁrst discuss the issues of space division

in O-tree construction and propose a scheme to address this

issue. At last, we present the O-tree construction algorithm.

5.1 Space Division

As mentioned earlier, we aim to construct the O-tree by

dividing an obstructed space into less obstructed subspaces

(towards non-obstructed subspaces). Without loss of gen-

erality, we ﬁrst consider an obstructed subspace with only

one obstacle. Fig. 6(a) shows an example of such obstructed

space. By dividing the space along the lines of the obstacle

and its perpendicular bisector, we have four non-obstructed

subspaces.

Following the design of conventional space-division based

spatial indexes, e.g., Quad-tree or k-d tree, we use a sub-

tree to cover the data objects located in each subspace (e.g.,

three black circles are covered in Region R2). Nevertheless,

this design inherently faces an issue in spatial query process,

i.e., for a query point (a query range in the case of RONN)

located close to the boundary of a region, the data objects

located in neighbor regions need to be accessed. For exam-

ple, in Fig. 6(a), not only the black circles in R2but also

all the other grey circles in R1,R3and R4need to be ac-

cessed to ﬁnd the RONNs, which would obviously defeat the

ultimate goal of our O-tree. To address this issue, we trade

some storage overhead for processing eﬃciency in O-tree by

including, for a giving region (subspace), all data objects

that should be considered to answer any RONN query spec-

iﬁed in the region. This set of additional data ob jects can be

obtained by precomputing CONNs to the boundary line seg-

ments of the region during O-tree construction. Therefore,

for the example in Fig. 6(a), O-tree stores the data objects

denoted by both black circles in R2and the grey circles,

p1, p2, ..., p5. As such, all the candidate data objects for a

given query range Rcan be quickly retrieved by accessing

the regions intersected with R. This feature of O-tree is also

applicable to the general case of a space with multiple ob-

stacles. As shown in Fig. 6(b), only data objects associated

with R24 and R3need to be examined in query processing

(to be detailed in Section 6).

R3R4

p1p2

(a) One obstacle

R24

R3R4

p10

p13

R21

R22

R23

(b) Multiple obstacles

Figure 6: An example of space division

Figure 7: No strict

obstacle balance

o21

o31

Figure 8: Illustration

for equal perpendicu-

lar line

5.2 Optimal Obstacle Balance Scheme

Two key questions that may naturally arise during the

discussion of space division earlier are:

•Which obstacle should be ﬁrst selected to divide the

space?

•For a given obstacle, is its perpendicular bisector the

best choice to divide the space?

To answer these two questions, a fundamental yet closely

related question is “How the divided spaces and thus the re-

sultant O-tree are structured, if we choose obstacles in diﬀer-

ent orders with diﬀerent“pivot points” to divide the space?”

We argue that a balanced O-tree is desirable and thus pro-

pose a space division strategy to achieve tree balance. In

O-tree, as obstacles are used to determine the pivot points

for space division, we aim to maintain the obstacle balance,

which is deﬁned as follows:

Deﬁnition 7. (OBSTACLE BALANCE) Selecting one ob-

stacle (obstacle line and its perpendicular line) to divide the

space into four subspaces. In each dividing step, the obsta-

cle balance is satisﬁed by the condition |ni−ni+1|<= 1 (ni

denotes the number of obstacles in each subspace), i.e., the

diﬀerence of height of each branch is less than 1.

According to the above strict obstacle balance deﬁnition,

can we always construct an obstacle balance tree? The an-

swer is not. In fact, we can not always construct a balanced

tree based on selection of obstacles and their perpendicu-

lar lines. (Sometimes there is no answer for the problem).

Fig. 7 illustrates an example. From Fig. 7, we can see that if

we choose o4(or o3) to divide the space, there is no suitable

perpendicular line to satisfy strict obstacle balance. The

same situation still exists when choosing other obstacles to

divide the space.

Since we can not always build a strictly balanced tree,

we aim to build an optimal approximate balanced tree. Ac-

2059

cordingly, in order to construct an optimal balanced tree, we

propose a new space division scheme, called optimal obstacle

balance (OOB) scheme, for O-tree. This scheme corresponds

to an optimization problem which can be solved using enu-

meration.

Deﬁnition 8. (OPTIMIZATION PROBLEM) Select one

obstacle (i.e., obstacle line ) and one of its perpendicular

lines to divide the space into four subspaces in each space

dividing step. The goal is to minimize the ob jective function

∑(|ni−nj|2) (i < j, i, j = 1,2,3,4) where ni, njdenote the

number of obstacles into two diﬀerent subspaces.

The OOB scheme to choose an obstacle and a perpendic-

ular line:

(1) there are nchoices of obstacles to divide the space.

This can be tried one at a time, so it takes O(n) time.

(2) for a chosen obstacle, there are inﬁnite choices to se-

lect a perpendicular line, which is obviously infeasible to

enumerate one by one. So we aim to try some ﬁnite (a

constant m) choices in selecting the best perpendicular line.

Let’s show an example in Fig. 8 for illustration. We project

the endpoints of each obstacle on the chosen obstacle line.

The projection of the endpoint o21 on obstacle line o4is h1,

while the projection of the endpoint o31 is h2. We can see

that there are also inﬁnite perpendicular lines (such as l1

and l2) to choose from between h1and h2. However, the

objective score of choosing l1(or l2) is the same as that

of choosing perpendicular line through h1(or h2). In other

words, a certain range of perpendicular lines always equal to

one perpendicular line through the projection of endpoints.

We only need to compute the objective scores of the perpen-

dicular lines through the projections of all endpoints to ﬁnd

the minimal score. As there are 2(n−1) perpendicular lines

through all the endpoints, this step takes O(2(n−1)) time.

In total, this problem can be solved in O(n∗2(n−1)) =

O(n2) time.

5.3 O-tree Index Structure

Based on the above discussion on space division, O-tree

index ﬁrst exploits obstacles to divide the obstructed space

into a number of non-obstructed subspaces, which cover only

data objects. Accordingly, we use R-tree to index the data

objects in these Euclidean subspaces.

In summary, the O-tree index is structured as follows.

The root node, denoting the whole obstructed space, points

to four child nodes. Each of these child nodes denotes a

region in the obstructed space. The internal nodes of O-

tree maintain entries of (reg, obs, optr1, optr2, optr3, optr4),

while the leaf nodes contain entries of the form (reg, rptr)

where reg denotes a 2-dimensional polygon region, obs is

the obstacle dividing the space, and optriand rptr denote

pointers to O-tree and R-tree nodes, respectively. Fig. 9

illustrates the space divisions and the corresponding O-tree.

Obstacle o1ﬁrstly divides the region into four subregions and

the root has four child nodes (R1,R2,R3,R4). Next, R3is

divided into four subregions (R31,R32,R33 ,R34 ) utilizing

obstacle o2, and the other three subregions directly form the

leaf node. Internal node R3stores obstacle o2,optr pointing

to four O-tree children R31,R32,R33 , and R34 . Leaf node

R34 stores rptr to an R-tree which contains N1and N2. In

turn, N1contains data objects p3and p4, while N2contains

p5and p6. Similarly, R33 stores rptr to another R-tree which

contains N2.

R1R4

Heap status

N1; R34; N2

R33; R34

R44; N2

R32

R33

R34

R31

O-tree

p5; p6

p1p2p3p1p2p8

p2p7p8

R33

R32

R31

p4p7

R34

p5p6p5p6

p3p4

N1N2

: : :

isInResult[] objects[]

: : :

Two arrays

Figure 9: An example of O-tree

Reducing Data Redundancy Using Arrays for O-

tree. As shown in Fig. 9, data objects p1and p2are stored

twice in R-trees of nodes R2and R3, respectively. The same

case is for data objects p5and p6. As there exists some data

redundancy for O-tree index, two potential issues arise: (i)

index size increases and (ii) possible redundant traversals of

the same data object in query processing.

To reduce the index size, we store all the data objects in

an array objects and store the array index like id for each

object in our O-tree index. As shown in Fig. 9, there is an

array objects for all the data objects. This reduces index

size to some extent. As for traversal of data objects in the

O-tree, if data object pis examined, we can check if its

duplicate p′is already in the result set. We also build an

array isInResult corresponding with objects array in Fig. 9.

isInResult marks whether a data object is in result. As we

can obtain the id of data object pfrom O-tree index, whether

pis already in the result can be checked quickly. If pis not

already in result, we mark it with isInResult[id] = true.

5.4 Building O-tree

In this section, we present our O-tree construction algo-

rithm that performs space division recursively and ﬁlls its

leaf nodes with R-trees containing data objects. The pseudo

code of space division algorithm is shown in Algorithm 2.

Algorithm 2: Space division Algorithm

Input: A set of obstacles O, An O-tree node denoted

by Node (which contains information of its

covered region, denoted by reg)

Output: The constructed O-tree denoted by Node

1 if O̸=∅then

2Use OOB scheme to decide one obstacle obs;

3Divide Region into four subregions and create four

children optrs;

4 for each optr do

5Find all the obstacles O′inside the region of

optr;

6Space division(O′,optr);

2060

At ﬁrst, we initialize the root node of O-tree with reg that

covers the whole obstructed space. That is, the region of

root covers all the points and obstacles. Then we recursively

invoke the space division algorithm with Space division(O,

root) to build up the O-tree. As shown in Algorithm 2, if

the current O-tree node has obstacles in its covered region,

i.e., O̸=∅, we utilize OOB scheme to choose one obstacle

obs (Line 2). Then we divide the current node’s region into

four subregions and create four children nodes (Line 3). At

last, we recursively invoke the Space division algorithm to

build up subtrees for the subregions. Finally, the subtrees

are put together as an O-tree (Lines 4-6). After the O-tree

is constructed, we then ﬁll the leaf nodes of O-tree index

with corresponding R-trees by invoking CLN algorithm.

We analyze the time complexity of Space division Algo-

rithm. For each region, if there are nobstacles to divide

the space, utilizing OOB scheme to do the optimal space

division takes O(n2), and for its four children regions, it

takes O(4( n

4)2). So for all the space divisions, it takes

O(n2+ 4( n

4)2+ 4( n

42)2+... + 4( n

4h)2)=O(n2), where his

the height of the tree.

Algorithm 3: Complete Leaf Nodes Algorithm (CLN)

Input: A set of points P, a set of obstacles Oand

O-tree node denoted by Node

Output: A O-tree

1 if N ode is not Leaf Node then

2 for each child optr of Node do

3CLN(P, O, optr);

4 else

5Create an R-tree root node pointed by rptr for

Node;

6Find all the data objects inside region of ON ode

and insert them into the R-tree via rptr;

7 for each edge Eij of the region of Node do

8res ←CONN(Eij , O, P );

9insert res into the R-tree via rptr;

The pseudo code of the CLN algorithm is presented in

Algorithm 3. The basic idea is to hang corresponding R-

tree which contains ONNs of boundary and points inside

with each leaf node. To complete the leaf node of O-tree

with R-tree, we recursively traverse O-tree from the root.

For each O-tree node, we check whether it is a leaf node. If

not, we move on its children (Lines 1-3). If it is a leaf node,

we create an R-tree for the O-tree node by inserting all the

points inside this leaf node’s region into the R-tree (Lines

5-6). At last, we compute the ONNs for each edge of the

node’s region [6] and insert them into the R-tree (Lines 7-9).

6. QUERY PROCESSING ON O-TREE

In this section, we ﬁrst introduce a new RONN query pro-

cessing algorithm, called RONN-OA, based on O-tree. Next,

we analyze the time complexity of CONNB, RONN-RF and

RONN-OA algorithms.

6.1 RONN-OA Algorithm

Similar to RONN-RF, RONN-OA maintains four maxi-

mum obstructed distances (MODs), one for each edge of

the query range R, to ﬁlter out unqualiﬁed objects and to

determine the terminating condition. RONN-OA ﬁrst is-

sues a range query to obtain the internal RONNs, which

are used to initialize the four MODs. It is worth noting

that the range query also returns the leaf nodes of O-tree

that intersects with the query range. So the result of range

query of RONN-OA Algorithm is diﬀerent from that of range

query of RONN-RF. Meanwhile, the obstacles visited in the

process are retrieved to build a visibility graph (VG) for

later RONN processing. Next, with the leaf nodes of O-tree

obtained, RONN-OA continues to traverse the correspond-

ing R-trees to access the data objects to ﬁnd the external

RONNs. During the traversal of R-trees, a heap His used

to store the R-tree entries. Initially, Hcontains only the

root entries of R-trees associated with the retrieved O-tree

leaf nodes. By de-heaping the entry ewith the minimal

distance to R, RONN-OA checks whether the minimal Eu-

clidean distances from eto Ris greater than all the current

MODs. If this is true, ewill be eliminated and a new en-

try is de-heaped. Otherwise, if eis an MBB (i.e., internal

node of R-tree), all its child entries are examined for ﬁlter-

ing based on Theorem 1 (as RONN-RF does) and inserted

back to Hfor later processing. On the other hand, if eis a

data object, RONN-OA examines it as an RONN candidate,

and then updates RONN results and MODs as appropriate

(corresponding to each edge by ﬁnding split points on edges

of the range boundary, as discussed earlier for RONN-RF).

The above query processing steps (i.e., de-heaping a new en-

try from Hand so on) are repeated until Hbecomes empty.

The pseudo-code of RONN-OA algorithm can be found in

the Appendix A.

Recall the example in Figure 9. Given a query range R

as speciﬁed by the red rectangle, RONN-OA ﬁrst issues a

range query to obtain the internal RONN p4. At the same

time, it will obtain the corresponding leaf nodes R11 and

R14 that intersect with R. So we only need to traverse R11

and R14 for external RONN query. This ﬁlters a lot of data

objects to accelerate RONN query processing. Obstacles o1

and o2are also inserted into the visibility graph V G dur-

ing the process of performing the range query. Accordingly,

RONN-OA initializes the four MODs corresponding to each

edge of Rusing p4. Next, to traverse the R-trees correspond-

ing to R11 and R14, we initialize Hwith N1and N2, sorted

in ascending order of their Euclidean distances to R. We

proceed to pop the top entry N1from H. As N1consists of

p3and p4, we check and ﬁlter p3by the maintained MODs.

Meanwhile, we ﬁnd p4is already the RONN result so it is

not put back to H. Similarly, N2is de-heaped from Hand

then N3,N4are inserted back. Continue with the process,

N3is de-heaped and data objects p5and p6are inserted

back. Then we pop p5and ﬁnd it to be an RONN. Thus,

we update MODs corresponding to each edge of R. Finally,

p6is examined and included in the result set. At this stage,

Hbecomes empty so the query processing ends. The ﬁnal

RONN result set includes p4,p5and p6. For the example,

it is worth noting that p7and p8, which need to be exam-

ined before eventually being excluded from the result set in

RONN-RF, are not even considered in the processing due to

the strength of O-tree.

Theorem 2. The RONN-OA algorithm retrieves exactly

the ONN of every point on a given query range, i.e., the

algorithm has no false misses and no false hits.

Please see the proof in Appendix B.

2061

6.2 Complexity Analysis

In this section, we analyze the time complexity of the

three algorithms proposed in this paper. Let |OT |be the

size of O-tree, |T|be the size of R-tree used in RONN-RF,

|T′|be the total size of R-trees maintained in the leaf nodes

of O-tree, and |V G|be the number of nodes in visibility

graph. Moreover, let nCONNB , nRONN −RF and nRONN−OA

denote the number of data objects accessed during the ex-

ternal RONN search in CONNB, RONN-RF, and RONN-

OA, respectively. The time complexity of these algorithms

can be derived as follows.

Based on [6], the time complexity of the CONN algorithm

is O(nCONNB ×log |T|×|V G| × log |V G|). Thus, we can

easily deduce that the time complexity of our CONNB is

O(log |T|+4×nCONNB ×log |T|×|V G|×log |V G|)=O(nCONNB ×

log|T|×|V G| × log|V G|).

The time complexity of RONN-RF is O(log|T|+nRONN −RF ×

log |T|×|V G| × log |V G|)= O(nRONN −RF ×log|T| × |V G| ×

log|V G|) where O(log |T|) is for range query. Notice that

nRONN−RF is the number of data objects accessed during ex-

ternal RONN query, which is much smaller than 4×nCONNB

because RONN-RF traverses R-tree only once and ﬁlters a

lot of unqualiﬁed results during traversal of R-trees.

Finally, for RONN-OA, it takes O(log|OT |) for obtaining

the internal RONNs while obtaining the obstacles to con-

struct the visibility graph V G for the query. Additionally,

it traverses some R-trees to visit data objects in order to ﬁnd

external RONNs. For the nRONN−OA data objects accessed,

RONN-OA takes O(nRONN−OA ×log |T′|×|V G|× log |V G|).

Therefore, the time complexity is O(log |O T |+nRONN−OA ×

log |T′|×|V G| ×log |V G|)= O(nRONN −OA ×log|T′|×|V G|×

log|V G|). From the above analysis, we may observe that the

three algorithms have about the same order of complexity.

However, the number of data objects accessed during query

processing is a key factor contributing to the actual com-

putational cost which is to be evaluated experimentally in

Section 7.

7. EXPERIMENTS

We evaluate the performance of the proposed algorithms

for the RONN query. All the algorithms were implemented

in C++, while the experiments were conducted on an Intel

Core 2 Duo 3.40 GHz PC with 4GB RAM.

We conduct experiments on three real datasets.

•Greece. It consists of 22,650 MBRs of rivers (polylines)

as obstacles and 5,921 locations of cities and villages (as

the data objects).7

•Tiger Census Blocks. It contains 556,696 census blocks

(polygons) of Iowa, Kansa, Missouri, and Nebraska.7

•California Roads. It contains the MBRs of 2,249,727

streets (polylines) of California.7

In addition, we construct synthetic datasets using Spa-

tialDataGenerator.7We generated a synthetic dataset Uni-

form that contains 80,000 data objects and 20,000 line seg-

ments and rectangles as obstacles, respectively, following

uniform distribution. Also we generate another 80,000 uni-

formed distributed data objects as the data of Tiger Cen-

sus Blocks dataset. Finally, we randomly extract 500,000

streets from California Roads and generate two sets of 1

million data objects following uniform and zipf distributions

to form two large-scale datasets UniformCA and ZipfCA,

7http://www.chorochronos.org.

respectively. All datasets are normalized to a 10,000 ×

10,000 space in order to ﬁt the search range.

Table 1: Parameter ranges and defaults values

Parameters Range

♯of obstacles (|O|) 1,000, 2,000, 4,000, 8,000, 16,000, 500,000

♯of data objects (|P|) 5,000, 20,000, 10,000, 40,000, 80,000, 1,000,000

query size (qsize)100, 200, 300, 400, 500

We conduct the performance evaluation in two aspects:

(1) evaluating the eﬃciency of RONN algorithms – we com-

pare the latency of the proposed RONN algorithms under

various parameters (summarized in Table 7, numbers in bold

are the default settings); and (2) evaluating the eﬃciency of

O-tree – we compare the height, number of leaf nodes, in-

dex size and construction time of O-tree with respect to the

space division strategies under comparison. Three diﬀer-

ent kinds of obstacles are indexed: (1) line segment obsta-

cles (i.e., polylines in datasets); (2) rectangle obstacles (i.e.,

MBRs in the datasets); and (3) polygon obstacles. We ran-

domly generate ﬁve groups of RONN queries with diﬀerent

query sizes where each group of queries consists of 50 RONN

queries using SpatialDataGenerator. 8In each experiment,

we test one parameter at a time (other parameters are ﬁxed

at their default values). The reported experimental results

are obtained by averaging the processing time of queries.

7.1 Efﬁciency of RONN Algorithms

In this section, we evaluate the eﬃciency of the proposed

RONN algorithms, including CONNB, RONN-RF and RONN-

OA. For CONNB and RONN-RF algorithms, data and ob-

stacle sets are indexed with an R*-tree [2], where the page

size is ﬁxed at 4KB. For RONN-OA, we construct the O-

tree index by adopting the proposed optimal obstacle bal-

ance (OOB) scheme for space division. We measure the

average latency as the performance metric corresponding to

three diﬀerent parameters: (a) numbers of obstacles |O|; (b)

numbers of data objects |P|; and (c) query size qsize. Ac-

cordingly, we test the three RONN algorithms using diﬀerent

datasets as described above.

Eﬀect of the number of obstacles. In this experiment,

we compare the average running time of three algorithms for

processing RONN queries by increasing the number of obsta-

cles. The average running time for ﬁnding internal RONNs

and external RONNs, are depicted in Fig. 10. Fig. 10(a)

shows the result on Uniform Segment dataset. As shown, it

takes very little time to ﬁnd the internal RONNs (too small

to be seen), as the main cost in answering RONN queries is

spent on external RONN search. RONN-OA and RONN-RF

outperform CONNB signiﬁcantly, with 360% and 300% im-

provement, respectively. On average, RONN-OA takes 50ms

for each query. The excellent performance was achieved

by the eﬀective pruning facilitated by O-tree. As a re-

sult, RONN-OA searches a small number of R-trees for only

once. RONN-RF also performs much better than CONNB

because it traverses R-tree only once (instead of four times

as CONNB does). The ﬁltering in RONN-RF also helps. As

expected, the performance becomes worse when the number

of obstacles increases, especially in Greece Segment dataset,

but all three algorithms scale well. The results obtained

using the other three datasets are shown in Figs. 10(b)-

(d). The observed trend and conclusion are consistent with

8Query size denotes the size of edges in query rectangle.

2062

100

150

200

250

16,0008,0004,0002,0001,000

Running time (ms)

#. of obstacles (|P|=40,000, qsize=100)

CONNB

RONN-RF

RONN-OA

Internal RONN

(a) Uniform Segment

100

150

200

250

300

16,0008,0004,0002,0001,000

Running time (ms)

#. of obstacles ( |P|=40,000, qsize=100)

CONNB

RONN-RF

RONN-OA

Internal RONN

(b) Uniform Rectangle

100

150

200

250

16,0008,0004,0002,0001,000

Running time (ms)

#. of obstacles (|P|=5,921, qsize=100)

CONNB

RONN-RF

RONN-OA

Internal RONN

100

150

200

250

300

350

400

450

16,0008,0004,0002,0001,000

Running time (ms)

#. of obstacles (|P|=5,921, qsize=100)

CONNB

RONN-RF

RONN-OA

Internal RONN

(d) Greece Rectangle

Figure 10: Performance vs. |O|

100

150

200

250

8*104

4*104

2*104

104

0.5*104

Running time (ms)

#. of P (|O|=8000, qsize=100)

CONNB

RONN-RF

RONN-OA

(a) Uniform Segment

100

150

200

250

300

350

400

8*104

4*104

2*104

104

0.5*104

Running time (ms)

#. of P (|O|=8000, qsize=100)

CONNB

RONN-RF

RONN-OA

(b) Uniform Rectangle

100

5*103

4*103

3*103

2*103

103

Running time (ms)

#. of P (|O|=8000, qsize=100)

CONNB

RONN-RF

RONN-OA

100

120

140

5*103

4*103

3*103

2*103

103

Running time (ms)

#. of P (|O|=8000, qsize=100)

CONNB

RONN-RF

RONN-OA

(d) Greece Rectangle

Figure 11: Performance vs. |P|

100

150

200

250

300

500400300200100

Running time (ms)

Query size(|P|=40,000, |O|=8,000)

CONNB

RONN-RF

RONN-OA

(a) Uniform Segment

100

150

200

250

300

500400300200100

Running time (ms)

Query size(|P|=40,000, |O|=8,000)

CONNB

RONN-RF

RONN-OA

(b) Uniform Rectangle

100

150

200

250

300

350

400

500400 300 200 100

Running time (ms)

Query size(|P|=5,921, |O|=8,000)

CONNB

RONN-RF

RONN-OA

100

200

300

400

500

500400 300 200 100

Running time (ms)

Query size(|P|=5,921, |O|=8,000)

CONNB

RONN-RF

RONN-OA

(d) Greece Rectangle

Figure 12: Performance vs. query size

0.2

0.4

0.6

0.8

500400300200100

Propotion of objects accessed

Query size (|P|=5921, |O|=8,000)

RONN-RF

RONN-OA

(a) Greece Segment

0.2

0.4

0.6

0.8

500400300200100

Propotion of objects accessed

Query size(|P|=5921, |O|=8,000)

RONN-RF

RONN-OA

(b) Greece Rectangle

Figure 13: Proportion of data objects accessed

Fig. 10(a). Compare with the obstacles represented as line

segments, all the three algorithms spent longer time when

obstacles are represented as rectangles. This is because more

vertices of obstacles need to be inserted to their correspond-

ing visibility graphs.

Eﬀect of the number of data objects. Fig. 11 com-

pares the performance of RONN algorithms by varying the

number of data objects. For the Uniform dataset with seg-

ment and rectangle obstacles, respectively, all the three al-

gorithms answer the query within half second. Both RONN-

RF and RONN-OA are very eﬃcient, cost less than 0.1 sec-

ond when |P|was 80,000. RONN-OA outperforms the two

R-tree based algorithms signiﬁcantly under various number

of data objects because O-tree index provides good ﬁltering

ability. Moreover, the processing time of all three algorithms

increases as the number of data objects increases. Also com-

paring Fig. 11(a) and Fig. 11(b), the three algorithms spent

longer time when obstacles were represented as rectangles.

The results on Greece dataset show that the three algorithms

take less time compared with the Uniform dataset because

the number of data objects in Uniform dataset is smaller.

Eﬀect of query size. We then compare the RONN algo-

rithms by varying query sizes and show the result in Fig. 12.

For Uniform dataset, the results in Fig. 12(a) shows the

superiority of RONN-OA over the two R-tree based algo-

rithms. The search time of RONN-OA only takes about 35

ms, but the CONNB algorithm takes more than 150 ms,

while RONN-RF takes almost 45 ms, because O-tree eﬀec-

tively divides the obstructed space into non-obstacle space.

As a result, points in the non-obstacle spaces can be eﬀec-

tively ﬁltered using R-trees. As Figs. 12(a) and 12(b) show,

it takes more time to process rectangle dataset than line seg-

ment dataset. Similar results on Greece dataset are observed

in Figs. 12(c) and 12(d).

Proportion of the number of data objects accessed.

As mentioned in the complexity analysis earlier, the num-

ber of data objects accessed during query processing is a

2063

16,0008,0004,0002,0001,000

Height

#. of obstacles

Random

Obs-Ctr

OOB

(a) Uniform Segment

16,0008,0004,0002,0001,000

Height

#. of obstacles

Random

Obs-Ctr

OOB

(b) Uniform Rectangle

16,0008,0004,0002,0001,000

Height

#. of obstacles

Random

Obs-Ctr

OOB

16,0008,0004,0002,0001,000

Height

#. of obstacles

Random

Obs-Ctr

OOB

(d) Greece Rectangle

Figure 14: The height of O-Tree vs. ♯of O

key factor. We compare our RONN-OA and RONN-RF al-

gorithm on proportion of the number of data objects ac-

cessed (dividing the number of data objects accessed by

CONNB algorithm) by varying query size in Fig. 13. In

Fig. 13(a), RONN-OA and RONN-RF, signiﬁcantly outper-

forms CONNB, and RONN-OA outperforms RONN-RF be-

cause O-tree index can ﬁlter some data objects whose Eu-

clidean distance to query range is small but obstructed dis-

tance to query range is big while R-tree can not ﬁlter such

kind of data objects. The similar results are shown in on

Rectangle dataset in Fig. 13(b).

7.2 Construction of the O-tree Index

As discussed in Section 5, space division is an essential

issue in construction of O-tree. To achieve tree balance, we

propose the optimal obstacle balance (OOB) scheme. In this

section, we evaluate the performance of the OOB scheme, in

comparison with the following two baseline schemes:

•Random. This scheme randomly chooses an obstacle for

space division in construction of O-tree.

•Obs-Ctr. Obs-Ctr is to simply use the perpendicular

bisector of selected obstacle as the perpendicular line

for space division. It chooses the obstacle with the best

score to divide the current space.

To compare these space division schemes, we use the heights

and leaf nodes of the corresponding O-trees as performance

metrics, as they reﬂect how balanced these O-trees are, i.e.,

the smaller the height and the number of leaf nodes are, the

better the tree balance is. Additionally, we compare the size

of the resulted index and their construction time.

Fig. 14 depicts the tree height of the space division schemes

by varying the numbers of obstacles in four datasets. We

can see that the OOB scheme outperforms the other two

schemes in all circumstances using diﬀerent datasets with

diﬀerent sizes of obstacle sets. The result also shows that

Obs-Ctr is better than Random. When increasing the num-

ber of obstacles, the tree height for all schemes increases

slowly in the experiment on uniform obstacle sets. How-

ever, for skewed datasets, i.e., Greece dataset, the impact of

the number of obstacles is obviously more signiﬁcant, espe-

cially on the Random and Obs-Ctr schemes. OOB performs

the best because it achieves better obstacle balance in each

space division step. In addition, the Obs-Ctr scheme per-

forms better than the Random scheme in general, as they

do achieve certain degree of obstacle balance. Moreover,

by comparing Fig. 14(a) and Fig. 14(b), we observe that the

tree height resulted from various schemes are higher on Rect-

angle dataset than on the Segment dataset, because these

schemes deal with more obstacles on Rectangle dataset.

To evaluate those schemes more comprehensively, we fur-

ther compare the number of leaf nodes in their resulted O-

trees. With the growing numbers of obstacles, the number of

leaf nodes in the tree is larger. And the number of leaf nodes

using Random scheme increases faster than that using OOB

or Obs-Ctr, showing OOB schemes achieving better tree bal-

ance. The corresponding experiment results and additional

experimental results about index size and construction time

can be found in Appendix D.

In summary, the height and the number of leaf nodes show

that the proposed OOB scheme achieves the best tree bal-

ance among these three schemes.

8. CONCLUSION

In this paper, we formulate a new query, namely, range-

based obstructed nearest neighbor (RONN) query, for ﬁnding

nearest neighbors to a given spatial range in the obstructed

space. We carry out a systematic study on the RONN query.

First, we propose the CONNB algorithm as a baseline for

the tackled RONN problem. To address the shortcomings of

the CONNB algorithm, we further propose a new RONN-

RF algorithm, that explores heuristics and ﬁltering strate-

gies via seamless integration of range and CONN queries

to facilitate eﬃcient query processing. Next, we propose

a new index, called O-tree, dedicated for indexing objects

in the obstructed space. The novelty of O-tree lies in the

idea of dividing the obstructed space into non-obstacle space,

aiming to exploit eﬃcient query processing in non-obstacle

subspaces. We propose a space division scheme OOB to ad-

dress the tree balance problem. Moreover, we propose an

eﬃcient algorithm, called RONN-OA, which exploits O-tree

to accelerate query processing of RONN. At last, a compre-

hensive performance evaluation is conducted to validate the

proposed ideas and demonstrate the eﬃciency and eﬀective-

ness of the proposed index and algorithms.

This work may lead towards several new directions for

future work, e.g., diﬀerent shapes of query range, mobile

obstacles, as well as parallel processing techniques.

9. ACKNOWLEDGMENTS

This work is partially supported by the NSF of China for

Outstanding Young Scholars under grant No. 61322208, the

National Basic Research Program of China (973 Program)

under grant No. 2012CB316201, the NSF of China for Key

Program under grant No. 61532021, and the NSF of China

under grant Nos. 61272178 and 61572122. We are grateful

to the anonymous reviewers for their constructive comments

on this paper. Xiaochun Yang is the corresponding author

of this work.

2064

10. REFERENCES

[1] J. Bao, C.-Y. Chow, M. F. Mokbel, and W.-S. Ku.

Eﬃcient evaluation of k-range nearest neighbor

queries in road networks. MDM, pages 115–124, 2010.

[2] N. Beckmann, H.-P. Kriegel, R. Schneider, and

B. Seeger. The R*-tree: An eﬃcient and robust access

method for points and rectangles. SIGMOD, pages

322–331, 1990.

[3] N. Chan and T. Dang. On eﬃcient processing of

complicated cloaked region for location privacy aware

nearest-neighbor queries. International Conference on

Information Communication Technology, pages

101–110, 2013.

[4] M. de Berg, M. van Kreveld, M. Overmars, and

O.Schwarzkopf. Computational Geometry: Algorithms

and Applications, Second Edition. Springer-Verlag,

Reading, Massachusetts, 2000.

[5] P. Galdames and Y. Cai. Eﬃcient processing of

location-cloaked queries. IEEE INFOCOM,

131(5):2480–2488, 2012.

[6] Y. Gao and B. Zheng. Continuous obstructed nearest

neighbor queries in spatial databases. SIGMOD, pages

577–558, 2009.

[7] Y. Gao, B. Zheng, W.-C. Lee, and G. Chen.

Continuous visible nearest neighbor queries. EDBT

09, pages 144–155, 2009.

[8] B. Gedik and L. Liu. A customizable k-anonymity

model for protecting location privacy. Distributed

Computing Systems (ICDCS), pages 763–774, 2005.

[9] M. Gruteser and D. Grunwald. Anonymous usage of

location-based services through spatial and temporal

cloaking. ACM MobiSys, pages 763–774, 2003.

[10] H. Hu and D. L. Lee. Range nearest-neighbor query.

TKDE, 18(1):78–91, 2006.

[11] C. H. J and J. R. Eﬃcient processing of moving

k-range nearest neighbor queries in directed and

dynamic spatial networks. Mobile Information

Systems, 2016.

[12] P. Kalnis, G. Ghinita, K. Mouratidis, and

D. Papadias. Preventing location based identity

inference in anonymous spatial queries. TKDE,

26(3):1719–1733, 2007.

[13] A. Khoshgozaran, C. Shahabi, and H. Shirani-Mehr.

Enabling location privacy; moving beyond

k-anonymity, cloaking and anonymizers. Knowledge

Information Systems, 26(3):435–465, 2011.

[14] H.-I. Kim and J.-W. Chang. k-nearest neighbor query

processing algorithms for a query region in road

networks. Journal of Computer Science and

Technology, 28(4):585–596, 2013.

[15] E. Lee, H. Cho, T. Chung, and K. Ryu. Moving range

k nearest neighbor queries with quality guarantee over

uncertain moving objects. Information Sciences,

325:324–341, 2015.

[16] X. Lin, J. Xu, and H. Hu. Range-based skyline queries

in mobile environments. TKDE, 25(4):835–849, 2013.

[17] M. F. Mokbel, C. Y. Chow, and W. G. Aref. The New

Casper: Query processing for location services without

compromising privacy. VLDB, pages 763–774, 2006.

[18] K. Mouratidis, M. Yiu, D. Papadias, and

N. Mamoulis. Continuous nearest neighbor monitoring

in road networks. VLDB, pages 43–54, 2006.

[19] S. Nutanong, E. Tanin, and R. Zhang. Visible nearest

neighbor queries. DASFAA, pages 876–883, 2007.

[20] Y. Tao, D. Papadias, and Q. Shen. Continuous nearest

neighbor search. VLDB, pages 287–298, 2002.

[21] A. K. Tung, J. Hou, and J. Han. Spatial clustering in

the presence of obstacles. ICDE, pages 322–331, 1990.

[22] X. Wang and H. J. Hamilton. Clustering spatial data

in the presence of obstacles. International Journal on

Artiﬁcial Intelligence Tools, 14(1-2):177–198, 2005.

[23] C. Xia, D. Hsu, and H. Tung. A fast ﬁlter for

obstructed nearest neighbor queries. BNCOD, pages

203–215, November 2004.

[24] C. Zhang and Y. Huang. Cloaking locations for

anonymous location based services: a hybrid

approach. Geoinformatica, pages 159–182, 2009.

[25] J. Zhanga, D. Papadias, K. Mouratidis, and M. Zhu.

Spatial queries in the presence of obstacles. EDBT,

pages 366–384, 2004.

[26] B. Zheng, W.-C. Lee, and D. L. Lee. On searching

continuous k nearest neighbors in wireless data

broadcast systems. IEEE Transactions on Mobile

Computing, 6(7):748–761, 2007.

[27] H. Zhu, J. Wang, B. Wang, and X. Yang. Location

privacy preserving obstructed nearest neighbor

queries. Journal of Computer Research and

Development, 51(1):115–125, 2014.

APPENDIX

A. PSEUDO-CODE OF RONN-OA

Algorithm 4 depicts the pseudo-code of the RONN-OA

Algorithm. It ﬁrst issues an internal RONN query (see Al-

gorithm 5) (Line 1). The corresponding leaf nodes intersect

with the query range are returned. Notice that this is done

simultaneously while traversing O-tree to perform the range

query. With support of O-tree, CONN-OA avoids travers-

ing O-tree from the root again (unlike what RONN-RF does

to process the external RONN query). Lines 7-27 are very

similar to the RONN-RF Algorithm. The main diﬀerence is

that we initiate the min-heap with the root entries of R-trees

associated with the found leaf nodes of O-tree in Line 1.

The range query used in RONN-OA is depicted in Algo-

rithm 5. We recursively traverse O-tree to the leaf nodes.

For each node visited, we ﬁrst check whether its region in-

tersects with the query range R(Line 1). If yes, we con-

tinue to check whether the node is a leaf node. There are

two conditions: (i) if the node is a leaf node, we will tra-

verse its associated R-tree to ﬁnd data objects located in the

query range and add them into the result set Res. Mean-

while, the leaf node itself is added to a leaf node list, namely,

LNodeList (Lines 2-4). (ii) if the node is not a leaf node,

we will recursively search each of its child node (Lines 5-7).

B. PROOF OF THEOREM 2

Proof. We prove Theorem 2 by showing that the RONN-

OA algorithm returns result with no false misses and no false

hits.

(1) The result returned by the RONN-OA algorithm has

no false misses. According to the RONN-OA algorithm, we

know that an object pcould be an answer to RONN only if

(i) it is inside a region overlapped with the query range, or

2065

Algorithm 4: RONN-OA Algorithm

Input: A O-tree OT , query range R, and min-heap H

Output: RONN results list RS

1RQ(R, OT ,InternalRes,LN odeList);

2 for k= 1 to 4 do

3RSk←InternalRes;overk←0;

4RS ←InternalRes;

5 for Each node n∈LN odeList do

6insert( n.rptr.root,mindist(n, R)) into H;

7 while true do

8k←1;

9 for Each edge Eij of Rdo

10 RSk←FindSP(Eij , RSk);

11 MODk←max odis(RSk);

12 if H.head.key > M ODkthen

13 overk= 1.

14 if over1&over2&over3&over4or H.

=∅then

15 return;

16 k+ +;

17 while H̸=∅do

18 de-heap the top entry (e, key) of H;

19 if eis a data object then

20 check whether eis in result RS; /* using

isInResult array*/

21 k←1;

22 for key < M ODkdo

23 insert einto RSk;k+ +;

24 break;

25 else

26 for each child entry ei∈edo

27 if mindist(ei, R)< MAX(M ODk)then

28 insert(ei, mindist(ei, R)) into H.

29 RS ←RS ∪RSk;

30 return RS;

Algorithm 5: O-tree::Range Query Algorithm (RQ)

Input: A query region R, a O-tree Node N ode, and a

Node list LN odeList.

Output: A set of data objects and LN odeList

1 if isIntersect(Node.r eg, R)then

2 if Node is leaf node then

3Res ←rangequery(R,Node.rptr);

4insert Node into LNodeList;

5 else

6 for Each optr of Node do

7RQ(R,optr,Res,LNodeList);

8return Res and LNodeList;

(ii) it belongs the ONNs to the boundary line segments of a

overlapped regions. Assume data object pdoes not satisfy

the above two conditions but pis an answer to RONN. Then

pis the nearest neighbor to any point inside the query range.

Let qis a such point inside the query range. Moreover, q

must exists inside a certain region Rithat are overlapped

with the query range. From Lemma 1, we know that the

ONN to qmust be inside Rior belong to the ONNs to

the boundary of Ri. This contradicts with the assumption.

Therefore, the returned results by RONN-OA has no false

misses.

(2) The RONN-OA algorithm inserts all the data objects

that satisfy the above two conditions into the heap for check-

ing if it is an RONN answer. Hence, the algorithm has no

false hits.

From (1) and (2) above, the proof completes.

C. EXTENSION ON O-TREE

In this section, we extend the O-tree index for polygons.

Fig. 15 shows an example of indexing a pentagon o1. For

space division, we can take each edge of the polygon as a sep-

arate obstacle line to divide the space. Then we can use the

division schema for obstacles represented as line segments

to divide the space. Firstly, we use the edge ab to divide the

region into four subregions and the root has four child nodes

(R1,R2,R3,R4). As R1can be divided into four subregions

(R11,R12 ,R13,R14 ) utilizing the edge ae,R4can be divided

into four subregions using cd, and the other two subregions

can directly form the leaf node. At last, R11 and R42 can

be divided into four subregions by de and bc, respectively.

An example of space division is shown on the top side of

the ﬁgure, while the corresponding O-tree is depicted on the

bottom side. Internal node R1stores obstacle ae, and optr

pointing to three O-tree children R11,R12, and R13 . Notice

that O-tree does not store R14 since R14 locates inside the

pentagon o1. Internal node R11 stores obstacle de, and optr

pointing to two O-tree children R111 and R114. Leaf node

R12 stores rptr pointing to an R-tree which contains data

objects p3and p4. The other nodes is similar to Internal

node R11 and leaf node R12 . Notice that when we use ab to

divide the whole space, we choose the perpendicular bisector

through the endpoint of de and cd to avoid splitting de and

cd into more obstacles.

R1R4

R13

R12

R11 R44

R43

R41

R114

R111 R433 R434

p3p4

p4p5

p4p5p6

p7p7

Heap status

p5, R41

R111, R114 , R41

These leaf nodes

by range query.

R12

R13

R14

R434

R433

R111

R114

R42 R41

R44

. . . . . .

hb, ci

hc, di

ha, ei

hd, ei

can be obtained

Figure 15: Index about polygon

Next, we introduce the RONN processing for polygon-

shaped obstacles based on O-tree. Firstly, we issue a range

query to obtain the interval RONN p4. At the same time,

it obtains the corresponding leaf nodes R111 ,R114 and R44

that intersect with R. Then we follow the similar steps of

RONN-OA shown in the heap status part of Fig. 15 until H

2066

becomes empty so the query processing ends. Actually, the

other data objects are ﬁltered by internal RONN p4except

p5and p6while searching O-Tree. Finally, the RONN result

set includes p4,p5and p6.

While computing the ONNs with polygon obstacles, we

observe a problem. As shown in Fig. 16, there is a rectangle

obstacle o, and data objects p1and p2. In our O-tree index

of this rectangle obstacle, we take each edge of the rectangle

as a separate line segment obstacle to perform space division

and build O-tree index. However, in query processing, when

we compute the obstructed distance of p2to p1, we choose

the distance of shortest path of ⟨p2, b, d, p1⟩instead of actual

shortest path ⟨p2, a, p1⟩. Actually, we can not go across from

bto d. This is the wrong crossing problem.

Figure 16: Wrong crossing

To avoid wrong crossing, we adopt an idea of ∆ shifting

to address the problem. As shown in Fig. 16, when going

through p2to b(ending points of obstacle), we assume it

goes through p2to b′which is obtained by shifting bwith ∆

distance (Note that ∆ is relatively small here).

D. EXTRA EXPERIMENTAL RESULTS

To evaluate the experiments comprehensively, we list some

extra experimental results.

D.1 Number of Leaf Nodes of O-tree

To evaluate the three space division schemes for building

an O-tree more comprehensively, we further compare the

number of leaf nodes in O-tree. As depicted in Fig.17(a),

when increasing the numbers of obstacles, the number of leaf

nodes in the tree become larger. In addition, the number

of leaf nodes using Random scheme increases faster than

that using OOB and Obs-Ctr. OOB scheme achieves the

best tree balance. As expected, more division needed to be

considered in Uniform Rectangle dataset, so the number of

leaf nodes is much larger than that on Uniform Segment

dataset. Figs.17(c) and 17(d) show the similar results on

Greece datasets.

D.2 Index Size and Construction Time

Then, we show the index size and construction time of

O-tree in Figs. 18(a)-(b) with respect to diﬀerent numbers

of obstacles on Uniform Segment dataset. For index size

in Fig.18(a), O-tree generated using the OOB scheme only

takes less than 10 MB, much smaller than the other two

schemes. This result is consistent with the results on the

height and the number of leaf nodes of O-trees. Addition-

ally, the Random scheme results in largely varied sizes of

O-trees. The construction times of index resulted from all

space division schemes are shown in Fig. 18(b). Random is

faster than the other two schemes because it takes no time

to decide how to divide the space. However, this results an

unbalanced tree which does not support RONN processing

well. The other schemes take under 12 minutes to ﬁnish

the index construction. In the last index comparison part of

experiments, we compare O-tree performance of space divi-

sion schemes when we assume each obstacle is an rectangle

on Uniform dataset, and the experimental results are shown

in Figs. 18(c)-(d) . O-tree index with line segment obstacles

is more eﬃcient for space division and building index in most

cases. This is because if we take each rectangle obstacle as

four line segment obstacles, many obstacles will be divided

into more parts, and thus more diﬃcult to deal with than

the line segment obstacles.

D.3 Overlap of data objects stored in leaf nodes

We further evaluate the number of overlapped data ob-

jects among leaf nodes in O-tree. We compare the three

space division schemes to see their eﬀects on the number of

overlapped data objects. Fig. 19 shows the comparison re-

sults. When increasing the number of obstacles, the number

of overlapped objects increases. When the number of ob-

stacles is less than 4,000, using OOB division scheme only

produces less than 0.3 overlap. Random division scheme

generates more overlap since it does not consider the den-

sity of the obstacles.

D.4 Comparison with Cloaking Query

Cloaking query in obstructed space [27] is the most related

work to ours. Cloaking query is mainly proposed for pro-

tecting the query point so that the untrusted server cannot

infer the correct query point among queries in the cloaked

region. Due to the lack of the exact location information at

the server, the anonymous query processor calculates a safe

region and provides all data objects inside the safe region

(i.e. candidates) instead of a single exact answer [17].

Diﬀerent from cloaking query, an RONN query returns

nearest neighbors to every query in the query region. A good

point is that RONN can also protect the real query point

and guarantees its returned result covering the exact answer

to the protected query point in the query range. RONNA

returns less number of points than the candidates returned

by the cloaked query, therefore saving the communication

cost as well as the post-processing time at client.

Fig. 18 shows the comparison between RONN-OA and

cloaking query in obstructed space [27]. We use the number

of returned points to represent communication cost. Fig. 20(a)

shows that RONN incurs much less communication cost than

cloaking query does. Fig. 20(b) shows that cloaking query

runs a little faster than RONN-OA since it directly returns

those candidates within the safe region without any exami-

nation. Notice that, more candidates returned from server

require more cost for post-processing at client since the ﬁnal

answer should be reﬁned and returned to the user. Fig. 20(c)

shows the processing time at client.

D.5 Results on Polygon Obstacles

We also test the running time of our algorithms on Tiger

Census Blocks datasets with polygon obstacles. They con-

tain uniform and zipf distributed data objects datasets, re-

spectively. Fig. 21 shows that RONN-OA and RONN-RF

outperform CONNB signiﬁcantly. The running time on uni-

form distributed dataset is faster than that on zipf dis-

tributed dataset, because the skewed data distribution makes

the maximum obstructed distances (MODs) (i.e. ﬁltering

distance) longer than uniform data. Moreover, more data

objects need to be examined.

2067

16,0008,0004,0002,0001,000

#. of leaf nodes (x103)

#. of obstacles

Random

Obs-Ctr

OOB

(a) Uniform Segment

16,0008,0004,0002,0001,000

#. of leaf nodes (x103)

#. of obstacles

Random

Obs-Ctr

OOB

(b) Uniform Rectangle

100

150

200

16,0008,0004,0002,0001,000

#. of leaf nodes (x103)

#. of obstacles

Random

Obs-Ctr

OOB

100

150

200

250

300

16,0008,0004,0002,0001,000

#. of leaf nodes (x103)

#. of obstacles

Random

Obs-Ctr

OOB

(d) Greece Rectangle

Figure 17: The number of leaf nodes

16,0008,0004,0002,0001,000

Index size (MB)

#. of obstacles

Random

Obs-Ctr

OOB

(a) Uniform Segment

16,0008,0004,0002,0001,000

Construction time (minutes)

#. of obstacles

Random

Obs-Ctr

OOB

(b) Uniform Segment

16,0008,0004,0002,0001,000

Index size (MB)

#. of obstacles

Random

Obs-Ctr

OOB

16,0008,0004,0002,0001,000

Time (minutes)

#. of obstacles

Random

Obs-Ctr

OOB

(d) Uniform Rectangle

Figure 18: Index size and construction time on Uniform Segment obstacles

0.5

1.5

2.5

16,0008,0004,0002,0001,000

Overlap number/|P|

|P|/|O|

OOB

Obs-Ctr

Random

Figure 19: Data overlap

100

500400300200100

Communication cost

Query size(|P|=40,000, |O|=8,000)

RONN-OA

Cloaking query

(a) Communication cost

100

500400300200100

Running time (ms)

Query size(|P|=40,000, |O|=8,000)

RONN-OA

Cloaking query

(b) Running time at server

100

500400300200100

Processing time (ms)

Query size(|P|=40,000, |O|=8,000)

RONN-OA

Cloaking query

Figure 20: Comparison with Cloaking query

200

400

600

800

1000

500400300200100

Running time (ms)

Query size(|P|=40,000, |O|=8,000)

CONNB

RONN-RF

RONN-OA

(a) Uniform Tiger

500

1000

1500

2000

2500

3000

3500

4000

500400300200100

Running time (ms)

Query size (|P|=40,000, |O|=8,000)

CONNB

RONN-RF

RONN-OA

(b) Zipf Tiger

Figure 21: Performance on polygon obstacles

100

120

500400300200100

Running time (sec)

Query size (|P|=1000KB, |O|=500KB)

CONNB

RONN-RF

RONN-OA

(a) (UniformCA) dataset

100

120

140

160

500400300200100

Running time (sec)

Query size (|P|=1000KB, |O|=500KB)

CONNB

RONN-RF

RONN-OA

(b) (ZipfCA) dataset

Figure 22: Performance on large datasets

D.6 Results on Large Datasets

We also evaluate the three algorithms on large datasets

UniformCA and ZipfCA and show the experimental results

in Fig. 22. The running time for the three algorithms were

within 2 minutes on UniformCA dataset, faster than the

time on ZipfCA.

Our RONN-OA and RONN-RF outperform the baseline

algorithm CONNB with 405% and 300% improvement, re-

spectively, which is consistent to the result on the other

tested datasets. For example, when query size was 300, the

running time was 75.3 seconds, 18.9 seconds, and 14 seconds

for CONNA, RONN-RF, and RONN-OA, respectively. We

can see that RONN-OA improves RONN-RF by 35% on the

large dataset UniformCA.

2068

On VR Spatial Query for Dual Entangled Worlds

Conference Paper

Nov 2019

With the rapid advent of Virtual Reality (VR) technology and virtual tour applications, there is a research need on spatial queries tailored for simultaneous movements in both the physical and virtual worlds. Traditional spatial queries, designed mainly for one world, do not consider the entangled dual worlds in VR. In this paper, we first investigate the fundamental shortest-path query in VR as the building block for spatial queries, aiming to avoid hitting boundaries and obstacles in the physical environment by leveraging Redirected Walking (RW) in Computer Graphics. Specifically, we first formulate Dual-world Redirected-walking Obstacle-free Path (DROP) to find the minimum-distance path in the virtual world, which is constrained by the RW cost in the physical world to ensure immersive experience in VR. We prove DROP is NP-hard and design a fully polynomial-time approximation scheme, Dual Entangled World Navigation (DEWN), by finding Minimum Immersion Loss Range (MIL Range). Afterward, we show that the existing spatial query algorithms and index structures can leverage DEWN as a building block to support kNN and range queries in the dual worlds of VR. Experimental results and a user study with implementation in HTC VIVE manifest that DEWN outperforms the baselines with smoother RW operations in various VR scenarios.

On VR Spatial Query for Dual Entangled Worlds

Preprint

Aug 2019

Direction-Aware Nearest Neighbor Query

Article

Full-text available

Feb 2019

Although the direction aspect is as important as the distance aspect when retrieving spatial objects, the studies of direction-aware queries are limited. The direction-based surrounder query (DBS query) cannot guarantee a good directionally diversity. In this paper, we propose the direction-aware nearest neighbor query (DNN query), which can recommend the nearest objects and ensure that the results have a good directionally diversity. Given a query point q and an angular threshold θ, the DNN query searches for the nearest neighbors on various directions around q. The DNN query can apply to the scenario where the user searches for the nearest objects around him. It can also apply to the scenario where the user searches for the photos of a geographic object according to the locations where the photos are taken. It can find photos that capture different views of the query object. In this paper, we propose the point-DNN query, where the query object is a point, and extend it to the range-DNN query, where the query object is a rectangle. We design algorithms to answer the DNN queries, and also conduct comprehensive experiments to evaluate the performances of the algorithms.

Obstructed Nearest Neighbor Query Under Uncertainty in the Internet of Things Environment

Article

Full-text available

Feb 2021

The internet of things provides a convenient life for all of us. Physical devices with locational systems connected to the internet can provide navigation and recommendation services in cooperation with the Location Based Services(LBS). The nearest neighbor query is the most common query type in the LBS research area. On the other hand, due to privacy protection or inaccuracies in measurement instructions, data uncertainties are typical in the IoT environment. This uncertainty will affect the results of nearest neighbor queries. The existence of plane obstacles also brings the challenge to the spatial data query. In this paper, the obstacles and data uncertainties in continuous nearest query are studied. We also designed a pruning strategy that significantly reduced the number of objects to be calculated. In addition, we propose a concept of safe region and an algorithm to generate a safe region. We also designed an index method for saving safe regions. The experimental results show that our approach outperforms the state-of-the-art one in efficiency and scalability.

Processing Conflict-Aware $k$ Nearest Neighbor Queries in Euclidean Space

Conference Paper

Feb 2024

Bidirectional k-nearest neighbor spatial crowdsourcing allocation protocol based on edge computing

Article

Feb 2023

Spatial crowdsourcing refers to the allocation of crowdsourcing workers to each task based on location information. K-nearest neighbor technology has been widely applied in crowdsourcing applications for crowdsourcing allocation. However, there are still several issues need to be stressed. Most of the existing spatial crowdsourcing allocation schemes operate on a centralized framework, resulting in low efficiency of crowdsourcing allocation. In addition, these spatial crowdsourcing allocation schemes are one-way allocation, that is, the suitable matching objects for each task can be queried from the set of crowdsourcing workers, but cannot query in reverse. In this article, a bidirectional k-nearest neighbor spatial crowdsourcing allocation protocol based on edge computing (BKNN-CAP) is proposed. Firstly, a spatial crowdsourcing task allocation framework based on edge computing (SCTAFEC) is established, which can offload all tasks to edge nodes in edge computing layer to realize parallel processing of spatio-temporal queries. Secondly, the positive k-nearest neighbor spatio-temporal query algorithm (PKNN) and reverse k-nearest neighbor spatio-temporal query algorithm (RKNN) are proposed to make the task publishers and crowdsourcing workers conduct two-way query. In addition, a road network distance calculation method is proposed to improve the accuracy of Euclidean distance in spatial query scenarios. Experimental results show that the proposed protocol has less time cost and higher matching success rate compared with other ones.

Processing Continuous k Nearest Neighbor Queries in Obstructed Space with Voronoi Diagrams

Article

Dec 2020

With the emergence and growing popularity of and location-based service (LBS) technologies, the continuous k nearest neighbor (CO k NN) query in obstructed space is becoming a very important service. In this article, we study the CO k NN in obstructed space, which retrieves k obstructed nearest neighbors (ONNs) for every point on a query segment q̄ . The state-of-the-art approach, called Euclidean based CONN (E-CONN) , exploits an R-tree to traverse the dataset P in ascending order of their Euclidean distances to q̄ . Taking a different point of view, in this article, we explore the idea of Voronoi diagram to define the notion of obstructed Voronoi diagram (OVD) . The Voronoi cells with obstacles are divided into visible and invisible regions for quickly answering nearest neighbor queries. To facilitate efficient retrieval of Voronoi cells and processing of continuous nearest neighbor (CONN) queries, we propose a new grid-based index, called Voronoi diagram with Obstacles in Grid (VO-Grid) , which indexes Voronoi cells and associated obstacle information with a grid file. Based on VO-Grid, we propose an efficient algorithm, called CONN with VO-Grid Acceleration (CONN-VOA) , to accelerate the CONN query processing. Moreover, we extend CONN-VOA to the CO k NN query, which also explores effective filtering and early termination for reducing redundant access of data objects. A comprehensive performance evaluation using both real and synthetic datasets is conducted to validate the proposed ideas and demonstrate the efficiency of our algorithms. The experimental results show that the CONN-VOA algorithm substantially outperforms E-CONN algorithm.

Range Nearest Neighbor Query with the Direction Constraint

Chapter

Feb 2020

In this paper, we study a direction-aware spatial data query method, i.e., range nearest neighbor query with the direction constraint (Range-DCNN query). Traditional DCNN query retrieves the top-k nearest neighbors within an angular range. Our Range-DCNN query finds all nearest neighbors within an angular range for all points in a rectangle. Dissimilar to the traditional DCNN query, the user’s location in the Range-DCNN query is abstracted to a rectangle rather than a point and the user’s location can be anywhere in the rectangle. In doing so, the user’s precise location will not be leaked, which ensures an effective privacy protection of user’s location. In Range-DCNN query, an observation is made that splitting points can be utilized to obtain all query results without having to search for all points. We propose some properties of locating splitting points. According to these properties, efficient algorithms are designed with the assistance of the R-tree. Extensive experiments have been conducted on both real and synthetic datasets. The experimental results demonstrate that our algorithms are capable of locating all results precisely and efficiently.

A Survey of Moving Objects kNN Query in Road Network Environment

Chapter

Jan 2020

Private Trajectory Data Publication for Trajectory Classification

Chapter

Sep 2019

Trajectory classification (TC), i.e., predicting the class labels of moving objects based on their trajectories and other features, has many important real-world applications. Private trajectory data publication is to anonymize trajectory data, which can be released to the public or third parties. In this paper, we study private trajectory publication for trajectory classification (PTPTC), which not only preserves the trajectory privacy, but also guarantees high TC accuracy. We propose a private trajectory data publishing framework for TC, which constructs an anonymous trajectory set for publication and use in data services to classify the anonymous trajectories. In order to build a “good” anonymous trajectory set (i.e., to guarantee a high TC accuracy), we propose two algorithms for constructing anonymous trajectory set, namely Anonymize-POI and Anonymize-FSP. Next, we employ Support Vector Machine (SVM) classifier to classify the anonymous trajectories. Finally, the experimental results show that our proposed algorithms not only preserve the trajectory privacy, but also guarantee a high TC accuracy.

Efficient Processing of Moving k -Range Nearest Neighbor Queries in Directed and Dynamic Spatial Networks

Article

Full-text available

Jan 2016

A k -range nearest neighbor ( k RNN) query in a spatial network finds the k closest objects to each point in the query region. The essential nature of the k RNN query is significant in location-based services (LBSs), where location-aware queries with query regions such as k RNN queries are frequently used because of the issue of location privacy and the imprecision of the associated positioning techniques. Existing studies focus on reducing computation costs at the server side while processing k RNN queries. They also consider snapshot queries that are evaluated once and terminated, as opposed to moving queries that require constant updating of their results. However, little attention has been paid to evaluating moving k RNN queries in directed and dynamic spatial networks where every edge is directed and its weight changes in accordance with the traffic conditions. In this paper, we propose an efficient algorithm called MORAN that evaluates moving k -range nearest neighbor (M k RNN) queries in directed and dynamic spatial networks. The results of a simulation conducted using real-life roadmaps indicate that MORAN is more effective than a competitive method based on a shared execution approach.

Efficient Processing of Location-Cloaked Queries

Conference Paper

Full-text available

Mar 2012

When requesting location-based services, users can associate their queries with a purposely blurred location such as a circular or rectangular geographic region instead of their exact position. This strategy makes it possible for privacy protection, but presents problems in query processing. Since the server does not know a user's exact position, it has to retrieve query results for each position inside the user's cloaking region. While the server workload dramatically increases, a client downloading all query results will waste its battery power, because most of the data may be irrelevant to its query interest. This paper considers the problems of efficient processing of location-cloaked queries (LCQs). Our key observation is that queries may overlap in their cloaking regions and thus share some query results. In light of this, we propose to pro-cess queries as a batch instead of one by one independently. The technical contributions of this paper are threefold. 1) We propose to decompose queries into subqueries based on their interested region. Since the subqueries with a common region need to be processed only once, the server workload is minimized. 2) We propose a novel scheduling technique that addresses the dilemma between minimizing server latency and ensuring good fairness in query processing. 3) We present a personalized air indexing technique by which a client can filter out and download only the needed query results, thus avoiding the waste of energy in downloading irrelevant data.

The R*-tree: An efficient and robust access method for points and rectangles

Article

Jan 1990

Anonymous usage of location-based services through spatial and temporal cloaking

Article

Jan 2003

On Efficient Processing of Complicated Cloaked Region for Location Privacy Aware Nearest-Neighbor Queries

Conference Paper

Mar 2013

The development of location-based services has brought not only conveniences to users’ daily life but also great concerns about users’ location privacy. Thus, privacy aware query processing that handles cloaked regions has become an important part in preserving user privacy. However, the state-of-the-art private query processors only focus on handling rectangular cloaked regions, while lacking an efficient and scalable algorithm for other complicated cloaked region shapes, such as polygon and circle. Motivated by that issue, we introduce a new location privacy aware nearest-neighbor query processor that provides efficient processing of complicated polygonal and circular cloaked regions, by proposing the Vertices Reduction Paradigm and the Group Execution Agent. In the Vertices Reduction Paradigm, we also provide a new tuning parameter to achieve trade-off between answer optimality and system scalability. Finally, experimental results show that our new query processing algorithm outperforms previous works in terms of both processing time and system scalability.

Location privacy preserving obstructed nearest neighbor queries

Article

Jan 2014

Location privacy has been a hot topic in recent years. However, the methods of existing location privacy preserving only support simple nearest neighbor queries, which do not consider the obstructed space. But in fact, the obstructed space is very popular in our life. Therefore, we study location privacy preserving obstructed nearest neighbor queries. Due to the effect of obstacles, this problem is also very hard. In this paper, we adopt a normal approach based on the third trusted party for privacy preserving obstructed nearest neighbor (ONN) queries. The approach can entertain the location-based service without leaking the user's exact location and obtain the exact answer. In our approach, firstly, the third trusted party constructs a cloaked region corresponding to the exact location and sends the cloaked region to the LBS. Then LBS processes the query region. In the process of query processing, we use two methods to return a set of candidate answers with respect to the cloaking region for the actual user location: 1) Basic approach of query processing, which uses the max obstructed distance of segment to expand the region and returning the results in the expanded region; 2) Improved approach of query processing, which further narrows the expanded region based on the basic approach. At last, the third trusted party returns the actual answer to the users corresponding to user's exact location. Finally, experimental results and proof theory show the effectiveness and correctness of our approach.

Moving range k nearest neighbor queries with quality guarantee over uncertain moving objects

Article

Jul 2015
INFORM SCIENCES

To avoid traffic accidents, drivers must constantly be aware of nearby vehicles. Unfortunately, nearby vehicles often go unnoticed because of various obstacles such as other vehicles, buildings, or poor weather. In this paper, we study Moving range k-nearest neighbor (MRkNN) queries as a tool for continuously monitoring nearby moving objects. A simple approach to processing MRkNN queries is to have each object periodically broadcast information regarding its movements (i.e., location and velocity at a particular time) to other objects. However, this simple technique cannot be used to process MRkNN queries owing to the limited network bandwidth in mobile peer-to-peer environments. Therefore, we address this bandwidth limitation by proposing a probabilistic algorithm, called MINT, for MovIng range k-NN queries with qualiTy guarantee over uncertain moving objects. MINT provides users with approximate answers with a quality guarantee, rather than exact answers, with near optimal communication costs. Using a series of simulations, we demonstrate the efficiency and efficacy of MINT in evaluating MRkNN queries with a quality guarantee.

Range-Based Skyline Queries in Mobile Environments

Article

Apr 2013

Skyline query processing for location-based services, which considers both spatial and nonspatial attributes of the objects being queried, has recently received increasing attention. Existing solutions focus on solving point- or line-based skyline queries, in which the query location is an exact location point or a line segment. However, due to privacy concerns and limited precision of localization devices, the input of a user location is often a spatial range. This paper studies a new problem of how to process such range-based skyline queries. Two novel algorithms are proposed: one is index-based (I-SKY) and the other is not based on any index (N-SKY). To handle frequent movements of the objects being queried, we also propose incremental versions of I-SKY and N-SKY, which avoid recomputing the query index and results from scratch. Additionally, we develop efficient solutions for probabilistic and continuous range-based skyline queries. Experimental results show that our proposed algorithms well outperform the baseline algorithm that adopts the existing line-based skyline solution. Moreover, the incremental versions of I-SKY and N-SKY save substantial computation cost, especially when the objects move frequently.

Computational Geometry, Algorithms And Applications

Book

Jan 2008

k-Nearest Neighbor Query Processing Algorithms for a Query Region in Road Networks

Article

Jul 2013

Recent development of wireless communication technologies and the popularity of smart phones are making location-based services (LBS) popular. However, requesting queries to LBS servers with users’ exact locations may threat the privacy of users. Therefore, there have been many researches on generating a cloaked query region for user privacy protection. Consequently, an efficient query processing algorithm for a query region is required. So, in this paper, we propose k-nearest neighbor query (k-NN) processing algorithms for a query region in road networks. To efficiently retrieve k-NN points of interest (POIs), we make use of the Island index. We also propose a method that generates an adaptive Island index to improve the query processing performance and storage usage. Finally, we show by our performance analysis that our k-NN query processing algorithms outperform the existing k-Range Nearest Neighbor (kRNN) algorithm in terms of network expansion cost and query processing time.

Range-based Obstructed Nearest Neighbor Queries

Abstract and Figures

Recommended publications

Efficient Bulk Loading to Accelerate Spatial Keyword Queries

Range-Based Nearest Neighbor Queries with Complex-Shaped Obstacles

Processing Continuous k Nearest Neighbor Queries in Obstructed Space with Voronoi Diagrams

Continuous visible nearest neighbor query processing in spatial databases

Continuous obstructed nearest neighbor queries in spatial databases