ArticlePDF Available

Attributing pedestrian networks with semantic information based on multi-source spatial data

March 2021
International Journal of Geographical Information Science 36(8):1-23

March 2021
36(8):1-23

DOI:10.1080/13658816.2021.1902530

Authors:

Xue Yang

China University of Geosciences （wuhan）

Kathleen Stewart

University of Maryland, College Park

Luliang Tang

Wuhan University

The lack of associating pedestrian networks, i.e. the paths and roads used for non-vehicular travel, with information about semantic attribution is a major weakness for many applications, especially those supporting accurate pedestrian routing. Researchers have developed various algorithms to generate pedestrian walkways based on datasets, including high-resolution images, existing map databases, and GPS data; however, the semantic attribution of pedestrian walkways is often ignored. The objective of our study is to automatically extract semantic information including incline values and the different categories of pedestrian paths from multi-source spatial data, such as crowdsourced GPS tracking data, land use data, and motor vehicle road (MVR) networks. Incline values for each pedestrian path were derived from tracking data through elevation filtering using wavelet theory and a similarity-based map-matching method. To automatically categorize pedestrian paths into five classes including sidewalk, crosswalk, entrance walkway, indoor path, and greenway, we developed a hierarchical strategy of spatial analysis using land use data and MVR networks. The effectiveness of our proposed method is demonstrated using real datasets including GPS tracking data collected by volunteers, land use data acquired from OpenStreetMap, and MVR network data downloaded from Gaode Map.

Inclination computation based on the matched tracking points and the different colors of points and lines in the above two panels present the different trace segments and pedestrian path segments, respectively; (a) the average value computation of elevation data; (b) inclination calculation.

…

Categories of pedestrian paths, (a) the entrance walkway of pedestrian networks; (b) the crosswalk of pedestrian networks.

…

Crowdsourced tracking data collected by volunteers in the City of Wuhan, (a) 3D perspective of the tracking points; (b) 2D perspective of the tracking points.

…

Land use data and MVR networks overlaid with pedestrian networks, (a) land use data; (b) MVR networks.

…

ASTER DEM and ASTER-derived elevation data on pedestrian trajectories in the experimental region; (a) DEM data with 30 m spatial resolution; (b) elevation data of tracking data converted to raster format.

…

Figures - uploaded by Xue Yang

Content may be subject to copyright.

Content uploaded by Xue Yang

Content may be subject to copyright.

Full Terms & Conditions of access and use can be found at

https://www.tandfonline.com/action/journalInformation?journalCode=tgis20

International Journal of Geographical Information

Science

ISSN: (Print) (Online) Journal homepage: https://www.tandfonline.com/loi/tgis20

Attributing pedestrian networks with semantic

information based on multi-source spatial data

Xue Yang, Kathleen Stewart, Mengyuan Fang & Luliang Tang

To cite this article: Xue Yang, Kathleen Stewart, Mengyuan Fang & Luliang Tang (2021):

Attributing pedestrian networks with semantic information based on multi-source spatial data,

International Journal of Geographical Information Science, DOI: 10.1080/13658816.2021.1902530

To link to this article: https://doi.org/10.1080/13658816.2021.1902530

View supplementary material

Published online: 30 Mar 2021.

Submit your article to this journal

Article views: 160

View related articles

View Crossmark data

RESEARCH ARTICLE

Attributing pedestrian networks with semantic information

based on multi-source spatial data

Xue Yang

, Kathleen Stewart

, Mengyuan Fang

and Luliang Tang

School of Geography and Information Engineering, China University of Geosciences, Wuhan, China;

Department of Geographical Sciences, University of Maryland, College Park, MD, USA;

State Key Laboratory

for Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, China

ABSTRACT

The lack of associating pedestrian networks, i.e. the paths and roads

used for non-vehicular travel, with information about semantic attri-

bution is a major weakness for many applications, especially those

supporting accurate pedestrian routing. Researchers have developed

various algorithms to generate pedestrian walkways based on data-

sets, including high-resolution images, existing map databases, and

GPS data; however, the semantic attribution of pedestrian walkways

is often ignored. The objective of our study is to automatically extract

semantic information including incline values and the dierent cate-

gories of pedestrian paths from multi-source spatial data, such as

crowdsourced GPS tracking data, land use data, and motor vehicle

road (MVR) networks. Incline values for each pedestrian path were

derived from tracking data through elevation ltering using wavelet

theory and a similarity-based map-matching method. To automati-

cally categorize pedestrian paths into ve classes including sidewalk,

crosswalk, entrance walkway, indoor path, and greenway, we devel-

oped a hierarchical strategy of spatial analysis using land use data

and MVR networks. The eectiveness of our proposed method is

demonstrated using real datasets including GPS tracking data col-

lected by volunteers, land use data acquired from OpenStreetMap,

and MVR network data downloaded from Gaode Map.

ARTICLE HISTORY

Received 21 September 2020

Accepted 9 March 2021

KEYWORDS

Pedestrian networks;

semantic attribution; incline

values; pedestrian path

categorization; multi-source

spatial data

1. Introduction

Pedestrians, including individuals who travel on foot or tiny wheels, e.g. wheelchairs,

scooters, skateboards, etc., are usually recognized as a group of road users who are

vulnerable to dierent aspects of road use, such as limited access, injury, and crime,

especially in an outdoor environment (Zhang et al. 2012, Yang et al. 2020). To ensure

the safety and ecient travel of pedestrians, navigation applications and optimiza-

tion schemes should consider not only information about the geometry and con-

nectivity of pedestrian networks but also the semantic attribution of pedestrian

paths, such as incline values and the dierent categories of paths (John et al. 2017,

Sun et al. 2019). For example, a wheelchair user may not be able to climb a path

with over 10% incline; and a cyclist may choose a mountainous route for training or

CONTACT Luliang Tang tll@whu.edu.cn

Supplemental data for this article can be accessed here.

INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE

https://doi.org/10.1080/13658816.2021.1902530

a at route for commuting (John et al. 2017). Urban planners need to know what

kinds of paths connect, for example, a shopping mall with a metro system, and

further estimate the walkability based on the category of pedestrian paths (Elias

2007, Sun et al. 2019). Semantic attribution of pedestrian paths including incline

value and path category is an essential foundation for routing tools, planners, and

geospatial analysts to better understand and support pedestrian movements.

Over the past decade, studies on pedestrian network information extraction have been

conducted using various databases, such as existing map data, social media data, and GPS

tracking data (Yang et al. 2020). Current approaches for generating these networks have

been summarized from three dierent perspectives: buering methods, image proces-

sing, and collaborative mapping (Karimi and Kasemsuppakorn 2013). Buering methods

have been used to extract the structure of sidewalks and crosswalks from existing MVR

networks (Kim et al. 2009, Ballester et al. 2011, Tal and Handy 2012, Guo et al. 2017). Image

processing methods have been another choice for extracting pedestrian networks with

dierent kinds of path types. Since the rst two methods require signicant post-

processing work such as eliminating route segments where people cannot walk and

lling gaps caused by missing segments that were shielded by trees, buildings, etc.,

a collaborative mapping approach was developed (Kasemsuppakorn and Karimi 2009,

2013b, Martin and Rob 2013). The crowdsourced geographic data used in collaborative

mapping are derived from a combination of local knowledge, eld notes, and ‘Armchair

mapping

’ as well. Using crowdsourced tracking data to extract pedestrian networks is

one type of collaborative data collection, however, most research has only focused on

topology and geometry detection, and has ignored semantic information extraction

associated with pedestrian networks, such as incline values and the type or category of

paths (Xie and Ou 2018, Yang et al. 2020). The incline values used for current routing tools

(e.g. OpenTripPlanner

) have been mainly extracted from publicly available elevation

datasets, such as Shuttle Radar Topology Mission (SRTM) and Advanced Spaceborne

Thermal Emission and Reection Radiometer (ASTER) data. However, John et al. (2017)

found that these elevation data sources were associated with three issues: a high cost of

data acquisition, data being available only for a limited set of locations, and insucient

horizontal resolution or vertical accuracy. Meanwhile, path category information used for

walkability analyses is still mostly dependent on manual identication (Elias 2007,

Kasemsuppakorn and Karimi 2013, Sun et al. 2019).

In this study, we present an approach for automatically extracting information on

the category and ne-granular incline value of pedestrian paths from multi-source

spatial data including crowdsourced GPS tracking data, land use data, and motor

vehicle road (MVR) network data. The steps include computing pedestrian road

incline values using three-dimensional (3D) GPS tracking data; and pedestrian path

categorization through combining land use and MVR network data, with existing

pedestrian networks. To improve the reliability for computing incline values, two

steps involving the preprocessing of crowdsourced GPS tracking data were needed.

The rst step was to lter the GPS trajectory elevation data using an approach based

on wavelet theory. Then, we matched the GPS data to the existing pedestrian

networks based on a similarity measurement algorithm. The incline values were

calculated based on existing gradient calculation formulas. The types or categories

of pedestrian paths, such as the sidewalk, crosswalk, entrance walkway, indoor path,

2X. YANG ET AL.

and greenway, were detected based on the hierarchical strategy of spatial analysis

(HSSA) method that used land use data and MVR network data. It should be

emphasized that pedestrian path categories including sidewalk and crosswalk are

dened based on the spatial relation between pedestrian paths and other physical

infrastructures such as MVR networks, buildings, and green land; the detailed deni-

tions are shown in Section 3.3. The main contributions of this paper include: (1) how

spatial datasets can be used to extract semantic information relating to pedestrian

networks including path incline values and path categories that lls a gap in studies

involving detailed pedestrian network mapping; (2) how including paths with poten-

tial risks for pedestrians identied based on road incline values benets the approach

by highlighting risks for users of these paths, especially for mobility-restricted indi-

viduals; and (3) the identication of categories of pedestrian paths based on the

proposed HSSA method with an average precision and recall of 87.22% and 91.63%

respectively.

2. Related work

Studies on road information extraction from spatial datasets, e.g. GPS tracking data,

images, videos, etc., have been attracting more attention in recent years. Many of these

works have been conducted using detailed MVR information mining, such as multi-level

MVR information extraction (Uduwaragoda et al. 2013, Ding et al. 2014, Tang et al. 2016,

Yang et al. 2018a, 2018b, Zhang et al. 2020); and MVR change detection (Rade et al. 2018,

Tang et al. 2019a). The details of multi-level MVR information extraction included road

shapes, connectivity, and semantic attribution acquisition, such as road boundary, lane

marking, turns, road type, road width, etc. In contrast, studies on pedestrian network

generation have been relatively few and most of them only explore automatic acquisition

techniques from the perspective of geometry and connectivity (Ballester et al. 2011,

Kasemsuppakorn and Karimi 2013, 2013b, Xie and Ou 2018, Yang et al. 2020). For many

pedestrian-related applications, semantic attribution of pedestrian networks including

incline values and category of the path is also important, e.g. for accurate pedestrian

routing especially for wheelchair users, cyclists, and elderly persons, etc. As few studies

have been conducted to detect incline values and categories of pedestrian networks from

dierent spatial datasets, the problems of information detection with insucient accuracy

and low automation remain (John et al. 2017).

With the evolution of the Web and the generalization of positioning techniques,

massive amounts of spatial data have been produced, e.g. GPS tracking data, social

media data with location pins, and crowdsourced road maps shared via OSM

(OpenStreetMap); and used for extracting various location-related information (Chin

et al. 2008, Ben et al. 2016, Zhang and Ye 2017, Gao et al. 2020). In contrast to the most

common sources of elevation data, e.g. data collected from satellite missions such as

TanDem-X, airborne LiDAR, and terrestrial surveying (John et al. 2017), crowdsourced

tracking data with elevation information are collected by soliciting contributions from

volunteers and are a low-cost and ecient way to extract and create semantic attributions

of pedestrian networks. While individuals are guaranteed to follow pedestrian paths,

a DEM might return elevation values that are a few meters away from the path and that

could already dier by several meters if the terrain is hilly. John et al. (2017) proposed

INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE 3

extracting road incline values from crowdsourced tracking data based on a segmentation

strategy. Specically, they split streets into segments at the intersection points with other

streets and then extracted the average incline value for a street segment using the

matched tracking points. However, this coarse-grained approach does not provide

detailed information on the slopes of roads, especially for long roads with greater

inclination uctuations. Thus, using an average incline value to represent the changing

characteristics of pedestrian road surfaces is insucient for pedestrian-based geospatial

applications.

3. Methodology for semantic attribution extraction of pedestrian paths

3.1. Scope and overall idea

Semantic information for pedestrian paths such as incline values and path categories can

be extracted from multi-source spatial data, as shown in Figure 1.

In this study, two kinds of spatial data were used to extract the semantic information

relating to pedestrian networks. The rst was crowdsourced GPS tracking data collected

by volunteers using mobile positioning devices (e.g. mobile phones, hand-held GPS

Figure 1. The architecture of semantic information extraction for the pedestrian network using multi-

source spatial data.

4X. YANG ET AL.

devices) that were used to extract incline values for the pedestrian paths. Specically,

a crowdsourced trajectory is comprised of a set of corresponding tracking points, denoted

as T= (p

, . . ., p

), where n is the number of tracking points belonging to the trajectory.

Each tracking point is represented by p (x, y, z, t), where x, y, z, and t are the longitude,

latitude, elevation, and time stamps, respectively, for a tracking point. The second kind of

spatial data used in this study was land use data and MVR network data, which were

acquired from OSM and Gaode Map respectively. The land use data used in this study

included boundary information for green spaces and buildings within the study area. The

MVR network data included information on roads that captured the plane position of road

segments and nodes, connectivity of each segment, the number of lanes, driving direc-

tion constraints (e.g. one-way, or two-way), and road length. The land use data and MVR

network data together were applied to automatically categorize pedestrian networks.

3.2. Pedestrian path incline values extraction using crowdsourced tracking data

Existing positioning systems such as GPS (global positioning system) do not work per-

fectly in all locations, especially in urban areas. Some outliers caused by tall buildings,

shadowing, and multi-path issues, are mixed in with the raw positioning results. For

crowdsourced tracking data with 3D positioning information, there are positional errors

for both the spatial locations (x, y) and the elevations (z). To reduce the high-frequency

noise that may be present in the elevation data (John et al. 2017), we applied a wavelet

denoising method because of its eectiveness for removing high-frequency noise (Sardy

et al. 2001). Based on the denoising theory of the wavelet method, high-frequency noises

were mainly present in the high-frequency signal component. Thus, after signal decom-

position is applied, the low frequency component was used to reconstruct a ltered

elevation dataset. The specic value of the resolution levels during wavelet denoising

will be described in detail in the case study section of this paper. The second preproces-

sing step for incline values computation was map-matching. This was used to remove the

outliers present in the spatial locations (x, y); and assisted in computing the incline values

of the pedestrian paths. That is, we needed to get the elevation of the pedestrian path rst

based on the matched tracking points and then compute the incline values.

The base map for the pedestrian networks used during map-matching including

geometry and connectivity information was derived from crowdsourced tracking data

using a method proposed in a previous study (Yang et al. 2020). The geometry and

connectivity for pedestrian networks are usually similar but simpler than those for MVR

networks (Yang et al. 2020). Based on this consideration, we applied a similarity-based

map-matching algorithm after reviewing the existing map-matching methods proposed

for MVR networks (Yang et al. 2018b). Compared with existing map-matching methods

(e.g. probabilistic modeling), a similarity-based method was less complex and oered

more exibility concerning similarity modeling. For this study, the similarity between

tracking points and pedestrian paths was calculated based on two criteria: (1) the vertical

distance between the tracking point and the pedestrian path; (2) the angle dierence

between the tracking vector and the pedestrian path. The calculation of the similarity

between the GPS tracking data and pedestrian paths followed the method proposed by

Yang et al. (2018a). The specic values of weights of ω

and ω

, constant D, and similarity

threshold Ts, for similarity computation during map-matching are discussed further in the

INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE 5

case study section. The detailed steps for the similarity-based map-matching method are

shown in the Appendix. After map-matching, all tracking points were categorized into

two types: (1) successfully matched points and (2) unsuccessfully matched points.

Tracking points successfully matched to pedestrian paths were used to compute the

incline value of the corresponding pedestrian path based on its elevation, and unsuccess-

fully matched points were regarded as planar drifting points and removed.

To keep the details of incline values for pedestrian paths, we partitioned the pedestrian

path segment ps

into a series of sub-segments (denoted as: sps

) based on a partition size

(denoted as: α) during the incline values computation. The value of partition size α for any

pedestrian road segment ps

was determined based on the planar positioning accuracy of

tracking points.

For a sub-segment sps

, there may be some matched tracking points p

, p

, . . . p

, as

shown in Figure 2(a). We computed the average elevation value of these matched

tracking points and this value was used as the elevation information for sub-segment

sps

. The elevation data used in this paper was based on the WGS-84 Ellipsoid height (see

Figure 2(b)). Based on existing methods for computing inclines (John et al. 2017), the

incline value between pedestrian path sub-segment sps

and sps

m+1

was dened as:

im¼100%�Δem

dis cm;cmþ1

ð Þ (1)

where Δe

was the elevation dierence between pedestrian path sub-segment sps

and

sps

m+1

; dis(c

, c

m+1

) was the Euclidean distance between the sub-segment center point c

and c

m+1

, as shown in Figure 2(b). Also, note that the coverage of tracking data for

a pedestrian path mainly depended on the ow of pedestrians who were traveling on

it. This would cause a high coverage of trajectories for some pedestrian paths, while

others had only a few or even no matching points. Therefore, the incline values for some

sub-segments without matching points were inferred based on their adjacent sub-

segments.

3.3. Automatic identication for pedestrian path categories based on HSSA

method

Based on previous studies (Kasemsuppakorn and Karimi 2013, Zhou et al. 2015), the

categories of pedestrian paths in this study were dened as the sidewalk, indoor path,

entrance walkway, greenway, and crosswalk. The denition of indoor paths was essen-

tially the same as used in previous research (Kasemsuppakorn and Karimi 2013, Zhou et al.

2015). Sidewalks were dened as paths that were next to the MVR network. An entrance

walkway was dened as the path between the intersection of pedestrian segments and

the entrance of a building (Figure 3(a)). A greenway was dened as a path that is in

a green space and where the path does not belong to any other type. A crosswalk was

a path that pedestrians use to cross a road from one side to the other. In this study, we

used a circular buer to visualize a crosswalk because it is very hard to get the specic

width of a crosswalk due to the mobility of pedestrians (see Figure 3(b)). The radius

(denoted as r) of the circular buer for a crosswalk was decided based on the road width

(denoted as wml) of MVR networks and pedestrian path width (denoted as ε). Typically,

6X. YANG ET AL.

sidewalks are set on both sides of most MVR networks, and the radius r can be computed

based on Equation (2).

r¼wml þ2ε

2(2)

To detect these categories, we designed an HSSA method that used three steps. As the

rst step entrance walkways and indoor paths were detected based on spatial overlay

analysis of polygon data for buildings and the pedestrian networks. Then, crosswalks and

sidewalks were extracted from the rest of the pedestrian paths by using MVR networks. As

the third step, we detected greenways from the spatial overlay results of polygon data for

Figure 2. Inclination computation based on the matched tracking points and the diﬀerent colors of

points and lines in the above two panels present the diﬀerent trace segments and pedestrian path

segments, respectively; (a) the average value computation of elevation data; (b) inclination

calculation.

INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE 7

green spaces with the remaining pedestrian paths. The detailed operation for the HSSA

method is shown in the Appendix. By applying these steps, the raw pedestrian networks

were enriched with the semantics of the dierent path categories, i.e. indoor path,

crosswalk, entrance walkway, sidewalk, and greenway. Note that the overlay computation

of pedestrian paths with polygon data (e.g. buildings and green spaces) was conducted

by using the built-in function ‘inpolygon’ of the MATLAB 2018a platform. The details of

parameter value setting are discussed in more detail in the case study section.

4. Case study: semantic information extraction for pedestrian networks

The proposed approach for semantic enhancement of pedestrian networks was tested

using real-world multi-source spatial data. These spatial data included: (1) 3D tracking

data generated by pedestrians in the City of Wuhan in China (see Figure 4); (2) land use

data obtained from the OSM platform (see Figure 5(a)); and (3) MVR networks down-

loaded from Gaode platform through a public API, as shown in Figure 5(b). The tracking

data used in this study were collected for two weeks in 2016 by about 83 participants,

using built-in positioning devices in mobile phones. There were about 138,863 tracking

points with approximately 10–15 m planar positioning accuracy and 1–10s sampling

intervals. The study site was in the north of Hongshan district of the City of Wuhan, an

area of approximately 5 square kilometers, which had an undulating terrain and con-

tained many dierent kinds of pedestrian paths, such as sidewalk, crosswalk, entrance

walkway, indoor path, and greenway. The land use information for buildings and green

Figure 3. Categories of pedestrian paths, (a) the entrance walkway of pedestrian networks; (b) the

crosswalk of pedestrian networks.

8X. YANG ET AL.

spaces was represented by polygons, as shown in Figure 5(a). MVR networks were stored

based on an arc-node model, including road attributes and data on the number of lanes,

driving direction constraints (e.g. one-way or two-way), and road length (Figure 5(b)).

4.1. Data preprocessing for crowdsourced tracking data

The method for road incline values computation was tested with preprocessed data

through the steps of elevation data ltering and map-matching. For the methods

described earlier, we used the wavelet denoising tools provided by MATLAB 2018a to

improve the certainty of elevation data. The Symlet wavelet (denoted as ‘Sym5ʹ at

MATLAB platform) was selected as the wavelet basis following an earlier study

(Soleymani et al. 2017). To estimate the quality of data ltering under dierent decom-

position and reconstruction levels, two kinds of approaches were conducted. The rst one

was to compute the elevation dierences between the ltered tracking data and the

ground truth data. Because the high-resolution elevation data was not available for public

use, we used ASTER GDEM (Global Digital Elevation Model) data with a 30 m spatial

resolution to verify the eectiveness of elevation data ltering (Figure 6(a)). To facilitate

the computation of elevation dierences, the tracking data was converted to a raster

format and had the same spatial resolution as the ASTER DEM data (see Figure 6(b)).

The second evaluation step was to compare the elevation of tracking points with or

without wavelet denoising using PSNR (Peak signal to noise ratio) and SSIM (Structural

similarity) indicators. PSNR was used to quantize the distortion of ltered elevation data

under dierent decomposition and reconstruction levels (Sheikh et al. 2006). The higher

the PSNR value, the smaller the dierence between the raw data and processed data. We

adopted SSIM to estimate the structural similarity of the raw and processed data (Wang

et al. 2004). Generally, the value of SSIM ranges from 0 to 1; and the higher the value, the

better the quality of processed data. The values of PSNR and SSIM for ltered data at the

specic level of denoising are computed based on the Equations shown in Appendix.

Table 1 shows the elevation results between the ltered elevations of tracking points and

the DEM data at dierent levels of decomposition and reconstruction. The mean and

Figure 4. Crowdsourced tracking data collected by volunteers in the City of Wuhan, (a) 3D perspective

of the tracking points; (b) 2D perspective of the tracking points.

INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE 9

Figure 5. Land use data and MVR networks overlaid with pedestrian networks, (a) land use data; (b)

MVR networks.

10 X. YANG ET AL.

standard deviation of elevation dierences between DEM data and the corresponding

raw tracking data were about 12.8911 (m) and 51.4372 (m), respectively. The experimental

results showed that the mean and standard deviation of the ltered elevation data were

improved after wavelet denoising. The processed elevation data did not dier greatly for

dierent decomposition and reconstruction levels.

Figure 6. ASTER DEM and ASTER-derived elevation data on pedestrian trajectories in the experimental

region; (a) DEM data with 30 m spatial resolution; (b) elevation data of tracking data converted to

raster format.

INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE 11

In Table 2, we nd that the average running time of tracking data ltering increased

with an increasing decomposition level. For a specic decomposition level, the values of

PSNR and SSIM gradually got larger and smaller respectively with an increasing recon-

struction level. The experimental results showed that the value of PSNR and SSIM of

ltered data was associated with the reconstruction level. The value of PSNR and SSIM for

ltered data at the second reconstruction level was the same for dierent decomposition

levels. That means the quality of ltered tracking point elevation data depended mainly

on the reconstruction level. Combined with the results of Table 1 and running time, the

optimized decomposition level and reconstruction level in this study were each set as 2,

as a tradeo.

Map-matching was the second step for preprocessing the tracking data. The unsuc-

cessfully matched points were regarded as planar outliers and removed. The rest of the

matched points were used to extract the incline values of the pedestrian paths. To fulll

the task of map-matching, we rst calculated the similarity between tracking points and

pedestrian paths. As pedestrians can walk in any direction and can freely change their

direction, therefore, using research results from a previous study (Yang et al. 2018c), the

weights for distance and angle of similarity in the evaluation model were set to 0.91 and

0.09; respectively. Since the width of a pedestrian path in an urban area should be

between 2.5 m and 3.0 m (Yang et al. 2020) and the planar positioning accuracy of

tracking data was about 10–15 m, the constant D was set to 13 m. The similarity threshold

(denoted as Ts) was used to decide whether a tracking point could be matched to the

current pedestrian road segment. Its value was 0.8058 when the distance and angle were

3 m and 0 degrees or 180 degrees, respectively. To obtain an optimal value of Ts, we

randomly selected 20 trajectories from the tracking dataset and computed the values of

Table 1. Elevation diﬀerences between ﬁltered data and DEM data at diﬀerent levels of decomposition

and reconstruction.

Wavelet basis Decomposition level Reconstruction level Mean (m) STD (m)

Sym 5 (Symlet wavelet) 2 1 10.6537 8.7003

2 10.6558 8.6095

3 1 10.6680 8.6552

2 10.6558 8.6095

3 10.6836 8.6387

4 1 10.6680 8.6552

2 10.6558 8.6095

3 10.6836 8.6387

4 10.6517 8.5473

Table 2. Evaluation results of RSNR and SSIM for ﬁltered data at diﬀerent levels of decomposition and

reconstruction.

Wavelet basis Decomposition level Reconstruction level Running Time (s) PSNR SSIM

Sym 5 (Symlet wavelet) 2 1 0.6875 15.3306 0.8741

2 0.7344 26.4584 0.7328

3 1 1.1250 15.3306 0.8741

2 0.9531 26.4584 0.7328

3 0.9063 34.4572 0.6020

4 1 1.3750 15.3306 0.8741

2 1.5000 26.4584 0.7328

3 1.0313 34.4572 0.6020

4 0.8438 40.6113 0.5004

12 X. YANG ET AL.

indicators λ

and λ

of matching results by manual inspection. The values of λ

and λ

were

obtained based on Equations (9) and (10) shown in the Appendix.

Table 3 shows the results for Ts through repeated experiments using 1, 685 tracking

points collected in the City of Wuhan. Based on manual inspection, there were about 932

tracking points that should have been matched to the pedestrian paths. The rest of the

tracking points were regarded as outliers because of signal drifting. In Table 3, the relation

exhibits a parabolic trend between Ts and λ

. The relation between Ts and λ

displayed

a complete reversal trend to that with λ

. The value of λ

started to fall when the value of

Ts was decreased, even as the value of λ

grew. The values of λ

reached a peak when Ts

was set to 0.8–0.9. To reduce the uncertainty of road inclination computation and ensure

the integrity of the matched tracking points, the value of Ts was set as 0.8058, as

a tradeo.

4.2. Inclination computation and analysis for pedestrian paths

The semantic information about pedestrian networks was enhanced by adding the

attributes of pedestrian paths to the original database, including path incline values as

well as the semantic categories of the paths. The incline values of paths were calculated

based on the proposed partitioning strategy. Specially, the partition size α for a pedestrian

path was set to 10 m based on the planar positioning accuracy of the tracking data. The

objective of the inclination computation was to identify steep inclines for those path users

who need this information. Based on the design standards of road grades in China, the

longitudinal slope of the main road for pedestrians in a residential area should be less

than 8%. In a hilly area, the longitudinal slope of main roads for pedestrians should be less

than 12%; otherwise, anti-skid treatment should be done. The main roads for pedestrians

should not have stairs. When stairs were necessary, the longitudinal slope of stairs should

be less than 36%. For each branch of the main road, the longitudinal slope is recom-

mended to be less than 18%. Steps should have anti-skid treatments if the slope of

pedestrian paths exceeds 58%. The experimental data used in this study were collected

in the City of Wuhan where the main terrain is mostly plains, hills, and small to medium

relief mountains (Figure 6(a)). To facilitate the identication of pedestrian paths with

potential risks, we classied the inclination of paths into six levels by combining the

design standards for pedestrian roads with the inclination values (Figure 7(a)). In addition,

the design standards of pedestrian road inclination showed that the optimum slope for

setting steps in the hilly area ranged from 23% to 38%. Pedestrian paths with over 38%

inclination are displayed in Figure 7(b).

Table 3. Evaluation of map-matching results for diﬀerent thresholds.

Ts N NG N

0.5 1,685 932 932 0 55.31% 100%

0.55 1,644 932 932 0 56.69% 100%

0.6 1,581 932 932 0 58.95% 100%

0.65 1,520 932 919 13 60.46% 100%

0.7 1,393 932 907 25 65.11% 98.61%

0.75 1,152 932 898 34 77.95% 96.35%

0.8 870 932 870 62 100% 93.35%

0.85 805 932 805 127 100% 86.34%

0.9 691 932 691 241 100% 74.12%

INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE 13

Figure 7. Visualization of inclination of pedestrian paths, (a) six inclination classes; (b) pedestrian paths

with over 38% inclination.

14 X. YANG ET AL.

The total length of pedestrian paths with incline values was about 39,424 m. In Table 4,

we can see that the highest proportion of pedestrian paths had less than 8% inclination

(57.05%). That means most pedestrian paths in the experimental area were at and

suitable for walking. Approximately 8% to 12% of pedestrian paths had incline values

greater than 8.88% because the experimental region was in a hilly area, which was a little

steeper than pedestrian paths with less than 8% inclination. Also, about 25.77% pedes-

trian paths were shown to need steps, and 13.13% of paths should have the anti-skid

treatment. As shown in Figure 7(b), pedestrian paths with over 38% inclination were

marked by red lines. Pedestrians should take care when using these roads, especially for

individuals who are mobility-restricted, such as wheelchair users or people with walking

aids. Based on these incline values, routing tools can customize walking routes for

pedestrians based on their own needs.

We also randomly selected 30 path segments with over 38% inclination and checked if

these paths have anti-skid treatments for steps by manual visual inspection with street

view images of Baidu Map. The results show that about 60% of pedestrian paths with over

38% inclination were steps and had anti-skid treatments such as handrails guardrail. The

rest of them were relatively at and not the steps. That means the inclination of these

pedestrian paths is overestimated, because of GPS drift and elevation uctuations.

Therefore, the improvement of elevation data using other types of sensors such as built-

in barometers in mobile phones could still benet from further study in the future.

4.3. Categories information extraction for pedestrian paths

In this study, ve types of pedestrian paths were automatically detected using the

proposed HSSA method, including indoor paths, entrance walkways, sidewalks, cross-

walks, and greenways (Figure 8(a)). Indoor paths and entrance walkways were identied

as part of the rst step based on overlay results between pedestrian networks and

buildings (Figure 8(b)).

For sidewalk and crosswalk identication, the values of road width wml

from MVR

networks, and GPS trajectories positioning error ε were required. The MVR networks used

in this study recorded the number of lanes and driving direction constraints (e.g. one-way,

or two-way) for road segments, however, not all road segments had this information,

especially for residential roads. For main urban roads, road width was computed by

multiplying the values for lane numbers and lane width. For some residential roads, we

needed to infer their road widths based on road construction standards and driving

direction constraints. Road construction standards in China indicate that the widths for

one-way and two-way roads range from 3.5 m to 5 m and 8 m to 12 m, respectively. The

Table 4. Statistics of pedestrian paths at diﬀerent levels of

inclination.

Incline range Total length (m) Proportion

Less than 8% 22491.39 57.05%

8%-12% 3500.85 8.88%

12%-18% 3272.19 8.30%

18%-38% 4983.19 12.64%

38%-58% 2132.84 5.41%

More than 58% 3043.53 7.72%

INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE 15

width of each lane in an urban area is about 3.5 m. Based on these standards, the widths

for one-way and two-way roads in this research were set to 5 m and 8 m, respectively. The

value of ε was set to 2.5 m. Figure 8(c) shows the result of sidewalk and crosswalk

identication based on the values of these parameters. We can see from Figure 8(c) that

most of the sidewalks were identied using the proposed HSSA method. Besides, the

extraction of greenways was set in the last step since some paths could be other types

even though they were in an area of green space, as shown in Figure 8(d).

Figure 8. The study results: (a) ﬁve classes of pedestrian path were extracted; (b) indoor path and

entrance walkway detection results; (c) sidewalk and crosswalk detection results; (d) greenway

identiﬁcation results.

Table 5. Statistical results for each detected category of pedestrian paths.

Category of pedestrian paths Total length (m) Proportion (%)

type1: indoor path 2373.63 1.93

type2: entrance walkway 761.65 6.02

type3: sidewalk 16115.62 40.88

type4: crosswalk 821.00 2.08

type5: greenway 5438.93 13.80

other 13913.33 35.29

16 X. YANG ET AL.

The statistics for the dierent path categories within the pedestrian network are

displayed in Table 5. The evaluation indicators included the length of paths in each

category and its proportion to the total length of the entire pedestrian networks. As we

can see from Table 5, sidewalks occupied the highest proportion of paths compared to

other types of pedestrian paths. It should be noted that pedestrian paths identied as

other were either located in separate areas or were missed during category identication.

According to the experimental results, the proportion of other paths was about 35.3%,

and lower than that of sidewalks. We found that the proportion of pedestrian paths of

type1 (indoor paths) was the lowest of all. That is partly because the GPS signal is lost when

pedestrians walk into buildings. How to extract a complete indoor pedestrian map is

another open research challenge that would benet from further study.

To further verify the eectiveness of category identication, we evaluated the detected

results by comparing our results with ground truth data. As shown in Table 6, two

evaluation indicators, i.e. Precision and Recall were calculated using the method of Yang

et al. (2018a). The parameters True positive, False positive, and False negative referred to the

length of pedestrian paths correctly detected, wrongly detected, and missed by the

methods proposed in this paper, respectively. Specially, the lengths of pedestrian paths

correctly detected or missed were respectively measured by using the measurement tools

in QGIS 2.18. The ground truth for pedestrian networks in the study area in the City of

Figure 9. Misclassiﬁcation: (a) incorrect indoor path identiﬁcation; (b) incorrect sidewalk identiﬁcation.

Table 6. Evaluation of category identiﬁcation of pedestrian paths.

Category of pedestrian paths True positive (m) False positive (m)

False negative

(m) Precision (%) Recall (%)

type1: indoor path 2048.73 324.90 318.00 86.30 86.56

type2: entrance walkway 611.40 150.26 129.72 80.27 82.50

type3: sidewalk 15739.79 375.84 1204.72 94.66 92.89

type4: crosswalk 695.20 125.80 55.30 84.68 92.63

type5: greenway 4975.38 463.58 250.33 91.48 95.21

other 11955.27 1958.06 0.00 85.93 100.00

INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE 17

Wuhan was obtained by manual visual inspection of online areal images found on Google

Maps.

The evaluation results in Table 6 illustrate that the methods proposed in this paper

were eective at identifying categories of pedestrian paths at an average precision and

recall of 87.22% and 91.63%. However, based on the results in Table 6, there was about

a 13% chance of incorrectly identifying a pedestrian path category, and about 9% of the

paths were not categorized. In urban areas, it can be a challenging task to correctly

recognize the category of all pedestrian paths because of the accuracy of multi-source

spatial data and the randomness of pedestrians with respect to where they walk, as shown

in Figure 9.

Figure 9 shows two typical examples of incorrect path category identication. In Figure 9(a),

some paths were wrongly identied as indoor paths because of positioning errors of traces

even though they belonged to sidewalks. Moreover, some pedestrians walked close to

buildings, compounding this error. Since the entrance walkway was identied at the same

layer as indoor paths, it was also easy to get an incorrect path classication due to an error in

the detection of indoor paths (see Figure 9(a)). The additional processing needed for improv-

ing the detection accuracy of pedestrian paths is a topic for future research. Beyond that, the

spatial data applied in this study were collected from multi-platforms (e.g. volunteer crowds,

OSM platform, and Gaode Map), which caused datasets to have their own reference systems

which made it dicult to integrate all datasets into a common reference space without some

risk of distortion. Meanwhile, some spatial datasets could be transformed to protect the privacy

of users’ positions that led to local deformation. Figure 9(b) shows a transformation failure for

MVR networks, where a part of MVR roads was wrongly overlaid with buildings. This partial

position error with the MVR networks resulted in sidewalks being wrongly identied as other,

and decreased its recall score. Further analyses and improvements are therefore needed.

Overall, the statistical results shown in the above tables veried that the proposed method

in this paper could be applied for enriching the attribution of pedestrian networks. Enhanced

Figure 10. Incline analysis for pedestrian bridges and pedestrian tunnels.

18 X. YANG ET AL.

pedestrian networks with road inclination and semantic categories could be used to better

assess the walkability of a region, recommending a personalized route for pedestrians, and

assisting in decision making for pedestrian path construction.

4.4. Discussion

In this study, we explored how to automatically extract semantic information includ-

ing incline value and path category from multi-source spatial data. These two kinds

of semantic attribution are fundamental for pedestrian-related applications but are

rarely discussed (for an exception see John et al. (2017)). Building on the earlier work

of John et al. (2017), we rened the computation task by partitioning path segments

into a series of sub-segments that made the detection results for path incline more

granular, which results in more accurate pedestrian routing. We developed an auto-

matic categorization method for acquiring the type of pedestrian paths using land

use data and MVR networks, which signicantly enhanced the eciency of pedes-

trian paths’ category identication when comparing with categories manually identi-

ed. Sun et al. (2019) also investigated factors that aect the walkability of

pedestrian networks using manual digitization results to derive pedestrian paths’

categories from existing topographic maps. Our work goes further though as our

approach also oers a low cost and ecient solution to extract semantic information

relevant for pedestrian paths from public data sources that can expand the set of

data acquisition sources for pedestrian-related applications.

In this study, ve types of pedestrian paths were automatically identied by the

proposed HSSA method. These ve types extend the work of Kasemsuppakorn and

Karimi (2009) who also investigated path types and who dened pedestrian bridges

and tunnels associated with pedestrian paths. Our research did not include pedes-

trian bridges and tunnels as these features usually cross the MVR from above and

below, as shown in Figure 10. The variation of slope both for pedestrian bridges and

tunnels follows the principle of steep rst, then at, and steep again. Since the

positional accuracy of crowdsourced GPS tracking data is limited, it is very dicult

to detect this subtle change in road incline. It was challenging to accurately identify

pedestrian tunnels due to GPS signal loss. This was also an issue for complex MVR

intersections detection such as overpass and cloverleaf intersections (Yang et al.

2018a). Using other sensor data such as Street View (e.g. Google Street View) or

high-denition images, and spatial information for existing physical infrastructures

such as trac lights and subway stations to address this issue could be explored in

future work.

As part of this research, we extracted semantic information about pedestrian paths

from open data that were characterized using low-cost and highly accessible crowd-

sourced data. However, these data also have quality issues that resulted in some

uncertainty with information mining, such as data completeness. Crowdsourced GPS

tracking data are collected by volunteers, and its coverage is mainly dependent on

the number of participants and their movement area. In this study, the crowdsourced

GPS tracking data covered about 80.1% of paths of the experimental area, and the

rest of the paths were neglected. This issue was also discussed by Karimi and

Kasemsuppakorn (2013) who indicated that the quality of pedestrian network

INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE 19

information extraction was heavily dependent on the coverage and accuracy of GPS

tracking data. Similarly, the land use data downloaded from OSM were also acquired

in a collaborative way. The quality of this data from the perspective of completeness

and diversity patterns has been discussed in many studies (see, for example,

Arsanjani 2015, Wang et al. 2020). It is still an open question how to balance the

conict between data acquisition costs and the quality of data sources. Related to

this, the question of what percentage of routes would be accessible is an interesting

topic for the eld of path planning (Cui 2016, Zimmermann et al. 2017), but not

addressed in this study.

5. Conclusion and future work

Semantic attribution of pedestrian networks is essential for a variety of applications,

especially for pedestrian navigation systems and walkability assessments. In the

absence of approaches and techniques for semantic attribution extraction asso-

ciated with pedestrian networks, this study focused on automatically extracting

incline values and categories of paths using multi-source spatial datasets. To

acquire incline values, 3D crowdsourced tracking data was applied. The categories

of pedestrian paths including sidewalk, crosswalk, entrance walkway, indoor path,

and greenway were identied based on a proposed HSSA method, using land use

data, MVR networks, and a pedestrian network base map. Case studies were con-

ducted using three kinds of spatial datasets including GPS tracking data collected

by volunteers in the City of Wuhan, China, land use data acquired from

OpenStreetMap, and MVR networks downloaded from Gaode Map. Based on the

experimental results of road inclination computation, we mapped pedestrian paths

based on the incline values. These pedestrian paths attributed with incline informa-

tion can be used as foundational data for routing tools or walkability analyses. For

path category identication, the evaluation results indicated that the proposed

HSSA method was eective, with an average precision and recall of 87.22% and

91.63% respectively.

In the real world, however, the environment for pedestrian paths and walkways

is complex and their design is widely varying. For instance, some pedestrian paths

are at but with stairs or curbs. In this situation, it can be dicult for some

pedestrians to travel along these paths, e.g. wheelchair users or people with

physical disabilities. Although we extracted the incline values of pedestrian paths,

stairs or curb identication was challenging and could be a topic for future study. It

was also dicult to identify jaywalking paths and marked crosswalks from all

crosswalks; detect whether there is a physical sidewalk adjacent to a road, and

recognize pedestrian bridges and tunnels. For many pedestrian-related applications,

such as assessing the overall public safety of pedestrian networks, accurate pedes-

trian routing especially for accessibility of vulnerable populations (e.g. the elderly,

people with strollers, physical disabilities, etc.), these unidentiable features are

very important and further research is still needed. Future work could address other

limitations including: (1) improving the accuracy of collected elevation data using

built-in sensors such as barometers in mobile devices to assist positioning; (2)

20 X. YANG ET AL.

extending spatial analysis algorithms with other sensor data (e.g. Street View, or

high-denition images), to increase the accuracy of path identication results as

well as the number of categories of pedestrian paths; and (3) further analysis of the

walking environment (e.g. safety, cleanliness, and greenness), and connectivity with

other trac networks (e.g. MVR networks and bicycle networks) that are also

relevant for pedestrian travel.

Notes

1. https://wiki.openstreetmap.org/wiki/OpenTripPlanner

2. https://wiki.openstreetmap.org/wiki/Armchair_mapping

Acknowledgments

The authors would like to sincerely thank the anonymous reviewers for their constructive comments

and valuable suggestions to improve the quality of this article.

Data and codes availability statement

The data and code that support the ndings of this study are available in [gshare.com] with the

identier(s) at the link (https://doi.org/10.6084/m9.gshare.12660467.v2).

Disclosure statement

No potential conict of interest was reported by the author(s).

Funding

This work was funded by the National Natural Science Foundation of China [No. 41901394,

41971405]; Open research fund program of LIESMARS, Wuhan University [No. 19S01].

Notes on contributors

Xue Yang received the Ph.D. degree from Wuhan University, Wuhan, China, in 2018. She is currently

an associate Professor with China University of Geosciences, Wuhan. Her research interests include

intelligent transportation system, spatiotemporal data analysis, and information mining.

Homepage: http://grzy.cug.edu.cn/yangxue1/zh_CN/index.htm

Email: yangxue@cug.edu.cn

Kathleen Stewart is currently a Professor in the Department of Geographical Sciences and Director

of the Center for Geospatial Information Science. She works in the area of geographic information

science with a particular focus on geospatial dynamics. She is interested in mobility and spatial

access, often in a big geospatial data context and using approaches that lie in the expanding eld of

spatial data science. Homepage: https://geog.umd.edu/facultyprole/stewart/kathleen

Email: stewartk@umd.edu

Mengyuan Fang received Bsc degree from Wuhan University, Wuhan, China, 2014. He is currently

a Ph.D candidate at the State Key Laboratory of Information Engineering in Surveying, Mapping and

Remote Sensing, Wuhan University. His research addresses the issue of trac congestion detection

INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE 21

and prediction using big trace data.

Email: myfang@whu.edu.cn

Luliang Tang received the Ph.D. degree from Wuhan University, Wuhan, China, in 2007. He is currently

a Professor with Wuhan University. His research interests include space–time GIS, GIS for transporta-

tion, and change detection. Homepage: http://www.lmars.whu.edu.cn/index.php/js/298.html

Email: tll@whu.edu.cn

References

Arsanjani, J.J., 2015. Quality assessment of the contributed land use information from

OpenStreetMap versus authoritative datasets. In: J.J. Arsanjani, et al., eds. OpenStreetMap in

GIScience. Cham: Springer, 37–58.

Ballester, M., Pérez, M., and Stuiver, H., 2011. Automatic pedestrian network generation. In:

Proceedings of the 14th AGILE international conference on geographic information science, 18–22

April. Utrecht, Netherlands, 1–13.

Ben, J., Trisalyn, N., and Meghan, W., 2016. Mapping ridership using crowdsourced cycling data.

Journal of Transport Geography, 52, 90–97. doi:10.1016/j.jtrangeo.2016.03.006

Chin, G.K.W., et al., 2008. Accessibility and connectivity in physical activity studies: the impact of

missing pedestrian data. Preventive Medicine, 46 (1), 41–45. doi:10.1016/j.ypmed.2007.08.004.

Cui, J.X., 2016. Detecting urban road network accessibility problems using taxi GPS data. Journal of

Transport Geography, 51, 147–157. doi:10.1016/j.jtrangeo.2015.12.007

Ding, Y., et al., 2014. Inferring road type in crowdsourced map services. Pattern Recognition, 8422 (8),

392–406.

Elias, B., 2007. Pedestrian navigation - creating a tailored geodatabase for routing. In: Workshop on

Positioning, Navigation & Communication, 22–22 March. Hannover, Germany, 41–47.

Gao, J., et al., 2020. Understanding urban hospital bypass behaviour based on big trace data. Cities,

103, 1–12. doi:10.1016/j.cities.2020.102739

Guo, Q., et al., 2017. The eect of road network patterns on pedestrian safety: a zone-based Bayesian

spatial modeling approach. Accident Analysis and Prevention, 99, 114–124. doi:10.1016/j.

aap.2016.11.002.

John, S., et al., 2017. Deriving incline values for street networks from voluntarily collected GPS traces.

Cartography and Geographic Information Science, 44 (2), 152–169. doi:10.1080/15230406.2016.1190300.

Karimi, H. and Kasemsuppakorn, P., 2013. Pedestrian network map generation approaches and

recommendation. International Journal of Geographical Information Science, 27 (5), 947–962.

doi:10.1080/13658816.2012.730148.

Kasemsuppakorn, P. and Karimi, H., 2009. Pedestrian network data collection through

location-based social networks. In: International conference on collaborative computing: network-

ing, applications and worksharing, 11–14 November. Washington, DC: IEEE, 1–9.

Kasemsuppakorn, P. and Karimi, H., 2013b. A pedestrian network construction algorithm based on

multiple GPS traces. Transportation Research Part C: Emerging Technologies, 26 (1), 285–300.

doi:10.1016/j.trc.2012.09.007.

Kasemsuppakorn, P. and Karimi, H.A., 2013. Pedestrian network extraction from fused aerial imagery

(Orthoimages) and laser imagery (Lidar). Photogrammetric Engineering and Remote Sensing, 79 (4),

369–379. doi:10.14358/PERS.79.4.369.

Kim, J., Bang, Y., and Yu, K., 2009. Automatic derivation of a pedestrian network based on existing

spatial data sets. In: ASPRS/MAPPS 2009 Fall Conference, 16–19 November. San Antonio, TX, 1–7.

Martin, D. and Rob, K., 2013. Crowdsourced cartography: mapping experience and knowledge.

Environment and Planning, 45 (1), 19–36. doi:10.1068/a44484.

Rade, S., et al., 2018. Road network fusion for incremental map updates. In: 14th International

Conference on Location Based Services, 15–17 January. Zurich, Switzerland, 91–109.

Sardy, S., Tseng, P., and Bruce, A., 2001. Robust wavelet denoising. IEEE Transactions on Signal

Processing, 49 (6), 1146–1152. doi:10.1109/78.923297.

22 X. YANG ET AL.

Sheikh, H.R., Sabir, M.F., and Bovik, A.C., 2006. A statistical evaluation of recent full reference image

quality assessment algorithms. IEEE Transactions on Image Processing, 15 (11), 3440–3451.

doi:10.1109/TIP.2006.881959.

Soleymani, A., et al., 2017. Characterizing change points and continuous transitions in movement

behaviours using wavelet decomposition. Methods in Ecology and Evolution, 8, 1113–1123.

doi:10.1111/2041-210X.12755

Sun, G., Webster, C., and Zhang, X., 2019. Connecting the city: a three-dimensional pedestrian network of

Hong Kong. Urban Analytics and City Science, 48, 60–75. doi:10.1177/2399808319847204

Tal, G. and Handy, S., 2012. Measuring nonmotorized accessibility and connectivity in a robust

pedestrian network. Transportation Research Record: Journal of the Transportation Research Board,

2299, 48–56. doi:10.3141/2299-06

Tang, J., et al., 2019a. An automatic method for detection and update of additive changes in road

network with GPS trajectory data. ISPRS International Journal of Geo-Information, 9 (8), 1–20.

Tang, L., et al., 2016. CLRIC: collecting lane-based road information via crowdsourcing. IEEE Transactions

on Intelligent Transportation Systems, 17 (9), 2552–2562. doi:10.1109/TITS.2016.2521482.

Tang, L., et al., 2019b. Detecting and evaluating urban clusters with spatiotemporal big data. Sensors,

19 (3), 1–15. doi:10.3390/s19030461.

Uduwaragoda, E.R.I.A.C., Perera, A.S., and Dias, S.A.D., 2013. Generating lane level road data from vehicle

trajectories using kernel density estimation. In: Proceedings of the 16th International IEEE conference on

Intelligent Transportation Systems. Hague, Netherlands, 384–391. doi:10.1109/ITSC.2013.6728262.

Wang, S., Zhou, Q., and Tian, Y., 2020. Understanding completeness and diversity patterns of

OSM-based land-use and land-cover dataset in China. ISPRS International Journal of Geo-

Information, 9 (9), 531. doi:10.3390/ijgi9090531.

Wang, Z., et al., 2004. Image quality assessment: from error visibility to structural similarity. IEEE

Transactions on Image Processing, 13 (4), 600–612. doi:10.1109/TIP.2003.819861.

Xie, X. and Ou, G., 2018. Pedestrian network information extraction based on VGI. Geomatica, 72 (3),

85–99. doi:10.1139/geomat-2018-0006.

Yang, X., et al., 2018a. Generating lane-based intersection maps from crowdsourcing big trace

data. Transportation Research Part C: Emerging Technologies, 89, 168–187. doi:10.1016/j.

trc.2018.02.007.

Yang, X., et al., 2018b. Automatic change detection in lane-level road networks using GPS

trajectories. International Journal of Geographical Information Science, 32 (3), 601–621.

doi:10.1080/13658816.2017.1402913.

Yang, X., et al., 2018c. A data cleaning method for big trace data using movement consistency.

Sensors, 18 (3), 824. doi:10.3390/s18030824.

Yang, X., et al., 2020. Pedestrian network generation based on crowdsourced tracking data.

International Journal of Geographical Information Science, 34 (5), 1051–1074. doi:10.1080/

13658816.2019.1702197.

Zhang, H. and Ye, C., 2017. RGB-D camera based walking pattern recognition by support vector

machines for a smart rollator. International Journal of Intelligent Robotics & Applications, 1 (1), 32.

doi:10.1007/s41315-016-0002-6.

Zhang, Y., et al., 2012. Associations between road network connectivity and pedestrian-bicyclist accidents.

In: Transportation Research Board Annual Meeting, 13–17 January. Washington, DC, 1–17.

Zhang, Y., et al., 2020. A hybrid method to incrementally extract road networks using

spatio-temporal trajectory data. ISPRS International Journal of Geo-Informaiton, 9, 186.

doi:10.3390/ijgi9040186.

Zhou, B., et al., 2015. ALIMC: activity landmark-based indoor mapping via crowdsourcing. IEEE

Transactions on Intelligent Transportation System, 16 (5), 1–11. doi:10.1109/TITS.2015.2423326.

Zimmermann, M., Tien, M., and Emma, F., 2017. Bike route choice modeling using GPS data without

choice sets of paths. Transportation Research Part C: Emerging Technologies, 75, 183–196.

doi:10.1016/j.trc.2016.12.009

INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE 23

Hierarchical Segmentation Method for Generating Road Intersections from Crowdsourced Trajectory Data

Article

Full-text available

Oct 2022

Maintaining the data freshness and completeness of road intersection information is the key task of urban road map production and updating. Compared to professional surveying methods, crowdsourced trajectory data provide a low-cost, wide-coverage and real-time data resource for road map construction. However, there may exist the problems of spatio-temporal heterogeneity and uneven density distribution in crowdsourced trajectory data. Hence, in light of road hierarchies, the paper proposes a hierarchical segmentation method to generate road intersections from crowdsourced trajectories. The proposed method firstly implements an adaptive density homogenization processing on raw trajectory data in order to decrease the uneven density discrepancy. Then, a hierarchical segmentation strategy is developed to extract multi-level road intersection elements from coarse scale to fine scale. Finally, the structural models of road intersections are delineated by an iterative piecewise fitting method. Experimental results show that the proposed method can accurately and completely extract road intersections of different shapes and scales, with an accuracy of about 87–90%. Particularly, the precision and recall of road intersection detection are obviously increased by about 7% and 20% by adaptive density homogenization, indicating the advantages of dealing with uneven trajectory data.

TR2RM: an urban road network generation model based on multisource big data

Article

Full-text available

Apr 2024

Road networks are an important part of transportation infrastructure through which people experience a city. The existing methods of vector map data generation mainly depend on a single data source, e.g. images, trajectories, or existing raster maps, which are limited by information fragmentation due to incomplete data. This study proposes an urban road network extraction framework named trajectory and remote-sensing image to RoadMap (TR2RM) based on deep learning technology by combining high-resolution remote sensing images with big trajectory data; this framework is composed of three components. The first component focuses on feature map generation by fusing remote sensing images with trajectories. The second component is composed of a novel neural network architecture denoted as AD-LinkNet, which is used to identify roads from the fused dataset of the first component. The last component is a postprocessing step that aims to generate the vector map accurately. Taking Rome, Beijing, and Wuhan as examples, we conduct extensive experiments to verify the effectiveness of the TR2RM. The results showed that the correctness of both the topology and geometry of the generated road network based on the TR2RM in Rome, Beijing, and Wuhan was 83.86% and 88.27%, 74.72% and 80.36%, and 73.83% and 77.7%, respectively.

"The Pedestrian Network Concept: A Systematic Literature Review "

Article

Full-text available

Dec 2023

The design of urban spaces that foster sustainable practices requires new analytical and structural approaches to spatial planning. An appropriate pedestrian network could significantly contribute to sustainable urban development goals, particularly by promoting sustainable mobility and pedestrian friendliness. With such goals, several attempts have been made to develop suitable models for pedestrian networks. However, something that is missing from the current literature is a framework that incorporates the main findings of the various studies as an integrated concise concept of the pedestrian network. To address this knowledge gap, this paper reviews studies on pedestrian networks and evaluates this concept based on the systematic 3W1H analysis method, which asks where, what, who, and how. In essence, the following questions are thus analyzed: Where is the pedestrian network located, What criteria play a role in the pedestrian network's performance, Who uses the pedestrian network, and How can the pedestrian network be analyzed? In this context, a systematic literature review is carried out by investigating studies conducted during the period 2001 to 2023 that appear in the Scopus database. The paper presents the results of the review of a selection of 67 papers dealing with pedestrian networks. Findings show that different models have been developed based on particular characteristics. Overall, researchers aimed to identify the most suitable network based on specific criteria for optimizing the walking experience in urban areas. By synthesizing the findings reported in these papers, this paper arguably contributes to a more comprehensive understanding of pedestrian networks, provides insights into the prioritization of design phases, facilitates the use of pedestrian network assessment models for future research , and creates a bigger picture for urban planners with a multidimensional view to a new sustainable urban structure.

Interlinking BIM and GIS data for a semantic pedestrian network and applications in high-density cities

Article

Mar 2024

How do crosswalk delays affect pedestrian access in zoning areas? Walking access reduction by signalized crosswalks in Seoul, South Korea

Article

Jul 2023
APPL GEOGR

Connectivity analysis in pedestrian networks: A case study in Wuhan, China

Article

Feb 2023
APPL GEOGR

Intelligent Transportation System: Need, Working, and Tools

Chapter

Full-text available

Nov 2022

Introduction to intelligent transportation systems: need, operation, tools, intelligent transportation system (ITS) emergency vehicle scenario, and convolutional neural network model for intelligent transportation systems are covered in this chapter. In addition, the chapter discusses the need for intelligent transportation systems, how they work, and the critical stages of intelligent transportation systems. Aside from that, the chapter discusses intelligent transportation system patterns. This work discusses critical challenges in implementing an intelligent transportation system. This chapter discusses intelligent user services in an intelligent transportation system, and also discusses required architecture of vehicular networks/ITS. Big data and new technologies that make it easier and cheaper to collect, store, analyze, use, and share data from various sources have made this more accessible and cheaper. Because of the connected environment, new ways to control and manage transportation systems in real time are also emerging. These new methods of controlling and managing transportation systems will aid in the improvement of overall system performance. These systems use real-time data about traffic flow on city roads to assist people in avoiding traffic and maintaining a clean environment. There has been a significant increase in traffic monitoring, putting traditional transportation systems that rely on cloud computing under a lot of strain. In the last few years, the intelligent transportation system (ITS) has seen a lot of changes. Make trips more efficient: Many ITS technologies can assist people in reducing the number of unnecessary trips and increasing the number of trips taken by other modes. They can also help to reduce traffic congestion, reduce the need for foreign oil, and improve air quality.KeywordsIntelligent transportation system (ITS)Connected and autonomous vehicles (C/AV)Industrial Internet of things (IIoT)Machine learning-assisted intelligent traffic monitoring system (ML-ITMS)Machine learning (ML)

Point‐of‐interest detection from Weibo data for map updating

Article

Sep 2022

Points‐of‐interest (POIs) geographic information system data are increasingly important for supporting map generation and navigation services, although updating their semantic and location information still largely depends on manual labor. In this study, we propose a novel method to automatically detect the changes in POIs from Chinese text and check‐in position data provided by the Chinese social media platform, Weibo. The proposed method includes three steps: (1) POI name recognition; (2) location confirmation; (3) and change detection. First, we propose recognizing a POI's name from Weibo text using the improved conditional random field algorithm. Then, we detect the location of each named POI by integrating the text address with the check‐in position. The changes in the detected POIs are recognized by extracting the status words from Weibo text and a three‐level status word database. To verify the effectiveness of the proposed method, we examine Wuhan as a case and detect the changes in the commercial POI using real‐world Weibo data collected from January to September 2020. Based on the validation of three common map platforms, the data provided and the manual field investigation of 55 random samples, the identification accuracies for newly added POIs, the unchanged POIs, and expired POIs are approximately 100, 95.8, and 91.7%, respectively.

Walkable Cities: Using the Smart Pedestrian Net Method for Evaluating a Pedestrian Network in Guimarães, Portugal

Article

Full-text available

Aug 2022

Evidence for the benefits of walking has attracted the attention of researchers and practitioners and encouraged them to develop healthier and more sustainable walkable cities. Many methods and approaches have been developed to measure walkability; namely, by using land use attributes. This paper examines the transferability of the Geographic Information System (GIS) based multi-criteria method developed in the Smart Pedestrian Net (SPN) research project to evaluate the level of walkability in a pedestrian network in Guimarães, Portugal. The method involves the assessment of 19 built environment and streetscape attributes, which were scored by a group of experts following the analytic hierarchy process. The method proved to be efficient in evaluating the pedestrian network and in mapping walkability in the study area. Around 65% of the street lengths scored above 0.60, indicating that the overall pedestrian conditions are favourable, with the best performance criteria being those related to accessibility and street connectivity. The method also allowed for the identification of different levels of walkability within the study area and the lack of a pedestrian network of highly scored streets. According to the results, the SPN method could be replicated in other cities to evaluate walkability and could be a useful planning tool to support policies towards developing more walkable cities.

Polygonization method for automatic generation of indoor and outdoor pedestrian navigation path for smart city

Article

Oct 2021
J TRANSP GEOGR

Smart mobility cannot be supported by smart technologies alone. It has to be supported by geospatial data with navigation lines. Manual methods for creating and updating them are not only costly but also time-consuming. Automatic extraction and updating of geospatial navigation lines from GIS and indoor layout plans are much needed. The navigation path is often stored and conceived as centreline graphs in GIS. However, navigation paths in which the centrelines are located can be considered as a combination of basic navigation polygons. This paper proposed a generic polygonization method that can automatically generate and update navigation centrelines from GIS and BIM. By constructing basic navigation polygons from GIS and decomposing path polygons in street layouts and floor plans from GIS and BIM into basic navigation polygons, indoor and outdoor pedestrian navigation paths and centrelines can be extracted and generated to be used for indoor and outdoor pedestrian navigation.

Understanding Completeness and Diversity Patterns of OSM-Based Land-Use and Land-Cover Dataset in China

Article

Full-text available

Sep 2020
ISPRS

OpenStreetMap (OSM) data are considered essential for land-use and land-cover (LULC) mapping despite their lack of quality. Most relevant studies have employed an LULC reference dataset for quality assessment, but such a reference dataset is not freely available for most countries and regions. Thus, this study conducts an intrinsic quality assessment of the OSM-based LULC dataset (i.e., without using a reference LULC dataset) by examining the patterns of both its completeness and diversity. With China chosen as the study area, an OSM-based LULC dataset of the country was first generated and validated by using various accuracy measures. Both its completeness and diversity patterns were then mapped and analyzed in terms of each prefecture-level division of the country. The results showed the following: (1) While the overall accuracy was as high as 82.2%, most complete regions of China were not mapped well owing to a lack of diverse LULC classes. (2) In terms of socioeconomic factors and the number of contributors, higher correlations were noted for diversity patterns than completeness patterns; thus, the diversity pattern is a better reflection of socioeconomic factors and the spatial patterns of contributors. (3) Both the completeness and the diversity patterns can be combined to better understand an OSM-based LULC dataset. These results indicate that it is useful to consider diversity as a supplement for intrinsically assessing the quality of an OSM-based LULC dataset. This analytical method can also be applied to other countries and regions.

A Hybrid Method to Incrementally Extract Road Networks Using Spatio-Temporal Trajectory Data

Article

Full-text available

Mar 2020
ISPRS

With the rapid development of urban traffic, accurate and up-to-date road maps are in crucial demand for daily human life and urban traffic control. Recently, with the emergence of crowdsourced mapping, a surge in academic attention is being paid to generating road networks from spatio-temporal trajectory data. However, most existing methods do not explore changing road patterns contained in multi-temporal trajectory data, and it is still difficult to satisfy the precision and efficiency demands of road information extraction. Hence, in this paper we propose a hybrid method to incrementally extract urban road networks from spatio-temporal trajectory data. First, raw trajectory data are partitioned into K time slices and are used to initialize K-temporal road networks by a mathematical morphology method. Then, the K-temporal road networks are adjusted according to a gravitation force model so as to amend their geometric inconsistencies. Finally, road networks are geometrically delineated using the k-segment fitting algorithm, and the associated road attributes (e.g., road width and driving rule) are inferred. Several case studies are examined to demonstrate that our method can effectively improve the efficiency and precision of road extraction and can make a significant attempt to mine the incremental change patterns in road networks from spatio-temporal trajectory data to help with road map renewal.

Pedestrian network generation based on crowdsourced tracking data

Article

Full-text available

Dec 2019

Pedestrian networks play an important role in various applications, such as pedestrian navigation services and mobility modeling. This paper presents a novel method to extract pedestrian networks from crowdsourced tracking data based on a two-layer framework. This framework includes a walking pattern classification layer and a pedestrian network generation layer. In the first layer, we propose a multi-scale fractal dimension (MFD) algorithm in order to recognize the two different types of walking patterns: walking with a clear destination (WCD) or walking without a clear destination (WOCD). In the second layer, we generate the pedestrian network by combining the pedestrian regions and pedestrian paths. The pedestrian regions are extracted based on a modified connected component analysis (CCA) algorithm from the WOCD traces. We generate the pedestrian paths using a kernel density estimation (KDE)-based point clustering algorithm from the WCD traces. The pedestrian network generation results using two actual crowdsourced datasets show that the proposed method has good performance in both geometrical correctness and topological correctness.

An Automatic Method for Detection and Update of Additive Changes in Road Network with GPS Trajectory Data

Article

Full-text available

Sep 2019
ISPRS

Ubiquitous trajectory data provides new opportunities for production and update of the road network. A number of methods have been proposed for road network construction and update based on trajectory data. However, existing methods were mainly focused on reconstruction of the existing road network, and the update of newly added roads was not given much attention. Besides, most of existing methods were designed for high sampling rate trajectory data, while the commonly available GPS trajectory data are usually low-quality data with noise, low sampling rates, and uneven spatial distributions. In this paper, we present an automatic method for detection and update of newly added roads based on the common low-quality trajectory data. First, additive changes (i.e., newly added roads) are detected using a point-to-segment matching algorithm. Then, the geometric structures of new roads are constructed based on a newly developed decomposition-combination map generation algorithm. Finally, the detected new roads are refined and combined with the original road network. Seven trajectory data were used to test the proposed method. Experiments show that the proposed method can successfully detect the additive changes and generate a road network which updates efficiently.

Connecting the city: A three-dimensional pedestrian network of Hong Kong (RTPI Research Excellence Commended Award)

Article

Full-text available

Apr 2019

The purpose of the paper is to investigate how a three-dimensional pedestrian network reshapes connectivity and helps to integrate the built environment of high-density cities. Using the case of Hong Kong, first, we elaborate how a continuous three-dimensional network constitutes an entirely different urban morphological spatial hierarchy compared to two-dimensional because of the footbridge system, underground connected with metro stations, and paths connected with mall developments. Second, we construct a three-dimensional pedestrian network model classifying segments into 23 categories with multi-height levels (e.g. sidewalk, footbridge, underground, crosswalk, ramp, paths on the building roof). Then we map the three-dimensional network for Hong Kong territory in a geographic information system, finding that the three-dimensional pedestrian network is 2.4 times in length and 8.5 times in link size greater than the road network. Connectivity comparison through a betweenness measure found striking differences between the two networks and indicated that footbridges and underground links could enhance walkability when they are well connected with the ground-level networks. Since road networks are widely used as a proxy for pedestrian analysis, we suggest that active travel optimisation planning, especially in high-density cities, requires a bespoke three-dimensional pedestrian model. The three-dimensional pedestrian network, enabling multi-level city living in a vertical metropolis, is a fundamental consideration in urban planning and design practices for high-density cities.

Detecting and Evaluating Urban Clusters with Spatiotemporal Big Data

Article

Full-text available

Jan 2019
SENSORS-BASEL

The design of urban clusters has played an important role in urban planning, but realizing the construction of these urban plans is quite a long process. Hence, how the progress is evaluated is significant for urban managers in the process of urban construction. Traditional methods for detecting urban clusters are inaccurate since the raw data is generally collected from small sample questionnaires of resident trips rather than large-scale studies. Spatiotemporal big data provides a new lens for understanding urban clusters in a natural and fine-grained way. In this article, we propose a novel method for Detecting and Evaluating Urban Clusters (DEUC) with taxi trajectories and Sina Weibo check-in data. Firstly, DEUC applies an agglomerative hierarchical clustering method to detect urban clusters based on the similarities in the daily travel space of urban residents. Secondly, DEUC infers resident demands for land-use functions using a naïve Bayes’ theorem, and three indicators are adopted to assess the rationality of land-use functions in the detected clusters—namely, cross-regional travel index, commuting direction index, and fulfilled demand index. Thirdly, DEUC evaluates the progress of urban cluster construction by calculating a proposed conformance indicator. In the case study, we applied our method to detect and analyze urban clusters in Wuhan, China in the years 2009, 2014, and 2015. The results suggest the effectiveness of the proposed method, which can provide a scientific basis for urban construction.

A Data Cleaning Method for Big Trace Data Using Movement Consistency

Article

Full-text available

Mar 2018
SENSORS-BASEL

Given the popularization of GPS technologies, the massive amount of spatiotemporal GPS traces collected by vehicles are becoming a new kind of big data source for urban geographic information extraction. The growing volume of the dataset, however, creates processing and management difficulties, while the low quality generates uncertainties when investigating human activities. Based on the conception of the error distribution law and position accuracy of the GPS data, we propose in this paper a data cleaning method for this kind of spatial big data using movement consistency. First, a trajectory is partitioned into a set of sub-trajectories using the movement characteristic points. In this process, GPS points indicate that the motion status of the vehicle has transformed from one state into another, and are regarded as the movement characteristic points. Then, GPS data are cleaned based on the similarities of GPS points and the movement consistency model of the sub-trajectory. The movement consistency model is built using the random sample consensus algorithm based on the high spatial consistency of high-quality GPS data. The proposed method is evaluated based on extensive experiments, using GPS trajectories generated by a sample of vehicles over a 7-day period in Wuhan city, China. The results show the effectiveness and efficiency of the proposed method.

Understanding urban hospital bypass behaviour based on big trace data

Article

Aug 2020
CITIES

The debates about patients' decisions regarding which hospital to visit have been shown in many health research literature for the past few years. Researchers have developed various methods to understand hospital bypass behaviour of patients; however, previous studies ignore the impact of spatial heterogeneity and awareness of patients about surrounding hospitals such as travel distance and hospital distributions in the process of bypass behaviour definitions and evaluations. To address these limitations, this study puts forward a Hospital Bypass Index (HBI) to understand urban hospital bypass behaviour by using big trace data collected by urban taxis. To evaluate the bypass behaviour, we defines three evaluation indicators for HBI including a potential bypass rate, an overall distance decay parameter, and a diurnal variation in distance decay parameter by mining large-scale patient-hospital trips from a spatiotemporal perspective. Experiments are conducted with 30 general hospitals and 13 specialty hospitals in Wuhan city, China, by using one month of taxi traces. The results of bypass behaviour evaluation and comparisons indicate that the proposed method is effective and feasible, which is promising for health departments to optimize the medical services and rationally allocate the medical facilities.

Pedestrian network information extraction based on VGI

Article

Jan 2019

Pedestrian network information plays an important role in pedestrian location based service (LBS), and its completeness determines the quality of a pedestrian LBS. This study used volunteered data and BaiduMap to research how to extract pedestrian network information on the basis of pedestrian GPS trajectories. The method extracts human road information by three steps: cleaning track data, extracting the road network, and detecting and analysing the recognised pedestrian road facilities. Once the road network information is extracted, the information regarding road facilities can be obtained, e.g., pedestrian crossings, overpasses, and underground passages. This paper describes a new method for incrementally updating electronic maps.

Generating Lane-based Intersection Maps from Crowdsourcing Big Trace Data

Article

Feb 2018
TRANSPORT RES C-EMER

Attributing pedestrian networks with semantic information based on multi-source spatial data

Abstract and Figures

Recommended publications

Pedestrian network generation based on crowdsourced tracking data

A Review of GPS Trajectories Classification Based on Transportation Mode

Pedestrian network map generation approaches and recommendation

Generating Lane-based Intersection Maps from Crowdsourcing Big Trace Data