ArticlePDF Available

Locality-Awareness in BitTorrent-Like P2P Applications

April 2009
IEEE Transactions on Multimedia 11(3):361-371

April 2009
11(3):361-371

DOI:10.1109/TMM.2009.2012911

Source
DBLP

Authors:

Yi Cui

Vanderbilt University

Yuan Xue

Vanderbilt University

This paper presents the measurement study of locality-aware P2P solutions over real-world Internet AS topol- ogy. By using the accesses of nodes of PlanetLab testbed, we create a detailed AS-level map including the end-to-end path of all nodes, as well as the relationship of all involved ASes. Based on this map, we evaluate the performance of a set of locality-aware P2P solutions, including an optimal solution guaranteeing the minimum AS hop count, as well as modified BitTorrent system with locality-awareness built into its neighbor selection, peer choking/unchoking, and piece selection processes. Our findings suggest that locality-awareness can help existing P2P solution to significantly decrease load on Internet, and achieve shorter downloading time. By comparing the performance of different kinds of locality-aware and traditional BitTorrent systems, we also point out the necessity to tradeoff between the goals of optimizing AS-related performance and achieving fairness among peers such as intra-AS traffic and peer burden fairness.

BitTorrent Piece Picking Policies

…

Figures - uploaded by Yi Cui

Content may be subject to copyright.

Content uploaded by Yi Cui

Content may be subject to copyright.

Locality-Awareness in BitTorrent-like P2P

Applications

Bo Liu, Yi Cui, Yansheng Lu, and Yuan Xue

Abstract—This paper presents the measurement study of

locality-aware P2P solutions over real-world Internet AS topol-

ogy. By using the accesses of nodes of PlanetLab testbed, we

create a detailed AS-level map including the end-to-end path of

all nodes, as well as the relationship of all involved ASes. Based on

this map, we evaluate the performance of a set of locality-aware

P2P solutions, including an optimal solution guaranteeing the

minimum AS hop count, as well as modiﬁed BitTorrent system

with locality-awareness built into its neighbor selection, peer

choking/unchoking, and piece selection processes. Our ﬁndings

suggest that locality-awareness can help existing P2P solution

to signiﬁcantly decrease load on Internet, and achieve shorter

downloading time. By comparing the performance of different

kinds of locality-aware and traditional BitTorrent systems, we

also point out the necessity to tradeoff between the goals of

optimizing AS-related performance and achieving fairness among

peers such as intra-AS trafﬁc and peer burden fairness.

I. INTRODUCTION

Peer-to-peer (P2P) communication has been proven to be an

extremely powerful paradigm to a diverse family of Internet

applications. A few examples include bulk content distribu-

tion [1], voice over IP [2], and broadcasting of TV-quality

program [3], [4], all of which have been proved by the

commercial deployment of planet-scale systems serving tens

of millions of users. Among them, BitTorrent is arguably the

biggest constituent of P2P trafﬁc, which dominates today’s

Internet.

In P2P systems, every peer not only downloads content from

other peers, but also makes use of its upload bandwidth to

serve other peers. There is great diversity of different peers

both in terms of geographic distance and Internet topology,

which introduces tremendous amount of trafﬁc crossing the

boundary of Internet Service Providers (ISPs). Such trafﬁc

often causes great ﬁnancial loss to ISPs with active P2P

users, which motivates them to control P2P trafﬁc by “throt-

tling”, or bandwidth limiting. As countermeasures, many P2P

applications use trafﬁc obfuscation technique to make itself

indistinguishable from other applications, an attempt to stop

ISPs from regulating P2P trafﬁc. Such an interaction quickly

becomes an escalating game of mouse and cat.

This work was supported by NSF award 0643488, Vanderbilt Discovery

grant, and a gift from Microsoft Research. Views and conclusions of this

paper are those of authors, which should not be interpreted as representing

the ofﬁcial policies, either expressed or implied, of the funding agencies.

Bo Liu and Yansheng Lu are with the school of Computer Science and

Technology, Huazhong University of Science and Technology, Wuhan, Hubei,

430074 China e-mail: {newpoo, lys}@smail.hust.edu.cn.

Yi Cui and Yuan Xue are with the Department of Computer Sci-

ence, Vanderbilt University, Nashville, TN, 37240 USA e-mail: {yi.cui,

yuan.xue}@vanderbilt.edu.

To alleviate the tension between ISPs and P2P users, actions

can be taken on both sides. On one hand, ISP can install

cache nodes to increase the data availability and redirect P2P

applications to such opportunities within the same ISP. On the

other hand, P2P applications can employ a variety of tech-

niques such as ISP-friendly neighbor selection and locality-

aware piece scheduling algorithm to reduce the inter-ISP

trafﬁc. So far, most works dedicated to this subject have been

focused on the feasibility of ISP-friendly P2P applications[5],

[6], quantitative study on saving of inter-ISP trafﬁc through

trace analysis[7], and simulation study on the tradeoff between

downloading performance and enhancing intra-ISP trafﬁc[8].

Given these works, we argue that what is still missing is

a comprehensive study on various ways to embed locality-

awareness into P2P applications and their impacts on the ISPs.

In particular, the following two facts are taken into account in

our study.

First, the relationship among ISPs is not as simple as

either inter- or intra-ISP. ISPs themselves interconnect into a

complex network of autonomous systems (AS)1. Two ASes, if

connected to each other, must have one of the following four

kinds of relationships: Provider-Customer, Customer-Provider,

Peer-to-Peer2, and Sibling. Each customer AS should pay its

provider AS for both inbound and outbound trafﬁc. Trafﬁc

across peer-to-peer ASes or sibling ASes are usually free. The

users of a single P2P application can reside in many ASes

which, together with third-party ASes interconnecting them,

easily form a network consisting of all above relationships.

Second, a P2P application is usually composed of sophis-

ticated semantics. Take BitTorrent as an example, it operates

at several levels. At the macroscopic level, a peer, if intended

to download a ﬁle, ﬁrst retrieves from the tracker a list of

other peers interested in the same ﬁle (neighbor selection).

At the intermediate level, during the downloading, among

the list of peers, each peer dynamically determine the subset

of other peers to share data with (choking and unchoking).

At the microscopic level, a ﬁle is divided into many pieces.

Among multiple connections with other peers, each peer

chooses which piece to download from which peer (piece

picking policy). Readers can ﬁnd a detailed discussion on the

same topic at Sec. III-C. This architectural design is inherited

by many P2P applications including P2P streaming systems.

1In the remainder of this paper, we use the terms AS and ISP interchange-

ably.

2We note that this notion should not be confused with the same term used

to described applications such as BitTorrent. To avoid confusion, we use peer-

to-peer to refer to relationship among ASes, and P2P to refer to BitTorrent

and applications alike.

Although existing work has studied ISP-friendly neighbor

selection[8] (macroscopic level), awareness to ISP-friendliness

can be actually built in at all levels.

These two facts call for a systematic ﬁeld study on various

locality-aware P2P solutions and their impacts on ISPs. In

light of the ﬁrst fact, we need a detailed and up-to-date AS-

level map revealing the connection among ASes and their

ﬁnancial relationships. While mature techniques have been

practiced to infer AS relationship, we still need to know the

exact end-to-end path two peers connect with each other,

which often traversing multiple ASes. Such knowledge can

be only obtained when one is able to access a large number

of peer machines. In light of the second fact, it is extremely

challenging to rely on analytical study to model subtle locality-

aware mechanisms deployed at all operating levels of a P2P

application such as BitTorrent. Having a reasonably-sized

testbed to run the P2P application on all its nodes would be

a better appproach to fully capture the behavior of peers and

the impact on the ISPs they belong to.

As such, we conduct our study on the PlanetLab testbed[9],

we obtain the access to PlanetLab nodes around the globe,

and use the information on their all-pair end-to-end path to

construct a detailed AS-level map. Based on this map, we

conduct both simulation and real-system studies. First, we

devise the optimal locality-aware strategy, which minimizes

AS hop count of the entire P2P distribution structure. Since

the optimal structure is a static tree, obtaining its perfor-

mance via simulation on the AS-level map would achieve

the same effect as deploying it on PlanetLab. Second, we

modify the BitTorrent system at all operating levels and

test its performance on PlanetLab. In particular, we build

locality-awareness into neighbor selection (macroscopic level),

choking/unchoking mechanism (intermediate level) and piece

picking policy (microscopic level).

Our ﬁndings suggest that locality-awareness can help ex-

isting P2P solution to signiﬁcantly decrease load on Internet

and achieve shorter downloading time. We point out the

necessity to tradeoff between the goals of achieving fairness

among peers and optimizing AS-related performance such as

intra-AS trafﬁc and peer burden fairness. We also ﬁnd that

continuous seeding can not improve the downloading time

of standard BitTorrent, but can signiﬁcantly improve it for

locality policies. It can also reduce the number of connected

peers for choker and piece picker locality policies.

The rest of this paper is organized as follows. First, we

discuss the related work in Sec. II to prepare background

information to our study. Sec. III presents the evaluation

methodology. We present ﬁndings of our experiment in Sec. IV

and conclude the paper in Sec. V.

II. RE LATE D WOR K

We summarize previous works in four key areas related to

our research: (1) BitTorrent system and studies which analyze

and measure it, (2) P2P streaming solutions and efforts to

adapt BitTorrent to streaming applications, (3) studies on

locality-aware P2P solutions, and (4) studies on inferring

Internet AS relationships.

Many analytical studies[10], [11], [12] have proved that

BitTorrent is nearly optimal in terms of user experienced

downloading time. In particular, [12] shows that the optimistic

unchoking policy and rarest-ﬁrst policy are in fact unnecessary

for BitTorrent to achieve asymptotic optimality in terms of

user population, where random peer and piece selection

would sufﬁce. The near-optimal performance of BitTorrent is

also conﬁrmed in many simulation and measurement studies,

e.g., [13], [14]. However, all these studies, in ﬁdelity with

the original BitTorrent design, do not consider the issues of

locality and ISP friendliness.

An extended set of works have proposed P2P solutions to

the large-scale distribution problem in the context of multi-

media streaming. Many works are developed under the term

overlay multicast, which we consider equal to P2P streaming.

Narada[15] is the pioneering work promoting the utilization

of peer resources to replace the infrastructure support, i.e.,

IP multicast. Other notable works include Bullet[16] and

SplitStream[19], etc. The P2P solution is shown to be able

to handle a variety of application scenarios, including live

streaming such as conferencing, and on-demand streaming

such as video-on-demand. In particular, peer-side caching is

widely used to address the asynchronous request problem

in on-demand streaming. oStream[20] utilizes application-

layer multicast and peer buffering to support video-on-

demand. A cache-and-relay architecture is proposed in [21].

Advanced coding schemes are also applied to increase the

system resilience or throughput, such as multiple descrip-

tion coding[22], erasure coding[23], rateless coding[24], and

network coding[25], etc. All these solutions build on certain

distribution structure, e.g., single tree, multiple trees, or mesh.

They are dynamic to withstand peer joining or leaving, but

have a clearly-deﬁned parent-child relationship between any

pair of connected peers. This is in contrast with BitTorrent

design, where peers exchange data within a much less struc-

tured swarm.

Interesting enough, BitTorrent has been highly inﬂuential to

the design and development of many modern commercial P2P

streaming systems[3], [4], [26], which adopt a receiver-driven

piece selection approach. Given BitTorrent’s statue as the de

facto P2P downloading protocol and a high-quality open-

source software, there have been several proposals to directly

modify it to support the “viewing-while-downloading” feature,

such as BASS[27], BiToS[28], and Toast[29]. The basic idea

of these works is to restrain the piece picking action within

a moving window along with the playback, which is also

adopted by our research. These previous works primarily

focus on server load reduction by the aid of P2P downloading,

and proved that signiﬁcant saving is achievable via simulation

and system deployment. In our work, we also explore

other performance-enhancement dimensions such as locality-

aware P2P downloading. BitTorrent inc. also promotes

DNA[30], a content distribution solution which claims

to support “viewing-while-downloading” as well. However,

its technical details remain unknown at the moment of writing.

There have been several proposals on locality-aware P2P

solutions. Bindal et al.[8] propose biased neighbor selection

mechanism to reduce inter-ISP trafﬁc, which requires no

dedicated central servers. Karagiannis et al.[5] show locality-

aware P2P solutions can signiﬁcantly alleviate the induced

cost at the ISP. The work by Ren et al.[6] conﬁrm the

beneﬁts gained by peer-relay in VoIP, and then propose an

AS-aware peer-relay protocol for P2P VoIP system. Huang

et al.[7] conﬁrm the beneﬁts of peer-assisted VoD, and

show that locality-aware P2P solution can reduce inter-ISP

trafﬁc. [17], [18] propose some practical methods to provide

Internet locality topology information to peers. However,

there is little work on the evaluation of impact of locality-

aware P2P solutions based on real world Internet AS topology.

Finally, there are many works on inferring Internet AS

topology, which we simply leverage. The work by Gao [31] is

the ﬁrst comprehensive study targeted on this topic. By analyz-

ing BGP table entries, it ﬁnds valley-free property of Internet

AS path, and further identiﬁes relationships between neighbor

AS pairs. Spring et al.[32] present a technique for mapping

the router-level topology of an ISP or a focused portions of the

Internet, which use only end-to-end traceroute measurements.

Dimitropoulos et al.[33] introduce some heuristics to address

the problems of inferring peer-to-peer and sibling relation-

ships. They also validate the inferred AS relationships. Though

there are many works on inferring Internet AS relationships,

the data set of inferred AS relationships is still incomplete as

we will see in Sec. III.

III. EVALUATI ON ME TH OD OL OG Y

In this section, we present our methodology to evaluate

locality-aware P2P solutions. We ﬁrst introduce how we obtain

the AS-level map, the topological foundation upon which our

study is performed. We then describe key issues determined

when planning the evaluation. Finally, we introduce how we

build locality-awareness into the BitTorrent system.

A. Obtaining AS-level Map

Our experiment is conducted over real-world Internet

topology. We construct an AS-level map on the PlanetLab

testbed[9]. Fig. 1 illustrates this process step by step.

start IP Paths AS Paths AS Paths

(with AS

relationship)

1 32

(1) planet-lab

(2) http://www.cymru.com/BGP/asnlookup.html

(3) http://as-rank.caida.org/ and Valley Free

Fig. 1. Step-by-step Process of Obtaining AS-Level Map

First, we run traceroute between each pair of PlanetLab

nodes to obtain the IP-level end-to-end path between them. We

then assemble these paths into an IP-level map. Some nodes

are eliminated from the map since the traceroute program fails

to return the IP-level path over them.

Second, we convert each IP path obtained in step one into

an AS path. For each IP address shown up in the IP-level

map, we ﬁnd its AS number through public AS-lookup service

such as the one run by the CYMRU team[34]. Since an end-

to-end path linearly traverses multiple ASes, we aggregate

consecutive IP addresses with the same AS number into a

single AS node. In this way, we condense an IP-level path

into an AS-level path, and further transfer the IP-level map

into an AS-level map. A small number of nodes are further

eliminated due to the failure of AS lookup.

Third, we mark the AS-level map with AS relationship data

provided by CAIDA[35]. Each adjacent AS pair must have

one of the following four kinds of relationships: Provider-

Customer, Customer-Provider, Peer-to-Peer, and Sibling. Each

customer AS should pay its provider AS for both inbound and

outbound trafﬁc. Trafﬁc across peer-to-peer ASes or sibling

ASes are usually free. 70% of the AS pairs in our AS-level

map are identiﬁed via the CAIDA dataset.

To identify the relationship of the remaining AS pairs, we

apply the “valley-free” property proposed in [31]. In brief,

if we represent our AS-level map in a hierarchical structure

where every AS is positioned lower than its provider, higher

than its customer, and at the same level with its siblings and

peers, then any AS-level path should not form a valley, i.e.,

the path should start with zero or more customer-provider

pairs, then zero or more peer-to-peer pairs, ﬁnally zero or

more provider-customer pairs, and sibling pairs can exist in

any place of an AS path. Assuming all AS-level paths follow

this property, we improve the percentage of identiﬁed AS pairs

to 90%. Finally, we eliminate the unidentiﬁed AS pairs from

the map.

AS 32

Stanford

University

AS 2153

California

State

University

Network

AS 25

UC Berkeley

169.229.50.3

169.229.50.1 171.66.3.182

171.64.1.138

137.164.27.158

137.164.27.129

128.32.0.38

169.229.51.230

planetlab1.millennium.berkeley.edu planet2.scs.stanford.edu

AS Path AS Path

Customer AS of

AS 2153

Customer AS of

AS 2153

Provider AS of

both AS 25 and

AS 32

IP Path IP Path

Fig. 2. A Processing Example

In Fig. 2, we illustrate the above steps by a sam-

ple PlanetLab pair “planetlab1.millennium.berkeley.edu” and

“planet2.scs.stanford.edu”. In this example, the IP path con-

sists of eight IPs, which locate in three different consecutive

ASes. These ASes are UC Berkeley(AS 25), California State

University Network(AS 2153) and Stanford University(AS

32). The AS 2153 is the provider of both AS 25 and AS

32. Therefore, for trafﬁc from AS 25 to AS 32 or vice versa,

both of them will be charged by AS 2153.

B. Evaluation Setup

We evaluate various locality-aware P2P solutions on the

AS-level map we have obtained, either through simulation or

system deployment on PlanetLab. In what follows, we discuss

a few key issues we have determined when planning the

evaluation. We start by discussing the application scenarios

our evaluation covers, followed by the choice of primary

and secondary performance metrics, and ﬁnally the optimal

strategy we have derived based on our choice of the primary

performance metric.

1) Downloading vs. On-demand Streaming: The basic se-

mantic in each experiment of our evaluation is to have a

group of peers downloading a video ﬁle from a seed (a peer

possessing the whole copy of the ﬁle). Under this basic setting,

we mainly evaluate two common scenarios, downloading and

on-demand streaming.

In the downloading scenario, we assume that all peers show

the interest to the ﬁle to be downloaded at the same time.

Besides the purpose to mimic ﬂash crowd, we set all peers

to join the P2P network simultaneously to avoid the temporal

dependency problem where a new peer has to download from

an earlier-joined peer. Instead, under the current setting, all

peers initiate downloading under the same condition, where

only a single copy is available at the seed, and race to ﬁnish.

From the end users’ perspective, the most important metric

is downloading time, i.e., the time it takes from the start of

downloading until the ﬁle is fully downloaded.

In the on-demand streaming scenario, we simulate a video-

on-demand application, where all peers are interested to

view one video ﬁle and these peers start viewing the video

at different times. In addition, this scenario supports the

“viewing-while-downloading” feature, where the video ﬁle

must be downloaded in an approximately sequential fashion.

In this scenario, the temporal dependency plays a much more

important role, where a peer is more likely to download from

an earlier-joined peer, unless a later-joined peer downloads at

a much faster speed exceeding its predecessors. From the end

users’ perspective, the viewing experience becomes the most

important factor, i.e, the video viewing should be continuous.

To achieve so, the downloading should be no slower than the

video playback speed, which will cause viewing interruption

otherwise. In Sec. IV, we will introduce “interruption time”,

a metric introduced to describe this phenomenon unique to

streaming.

2) Performance Metrics: Besides user-oriented metrics

such as downloading time or interruptions, we must choose

metrics matching the theme of this work: locality-awareness.

With this regard, our primary choice is AS hop count, the

number of ASes a data piece traverses from its sender to

receiver. This is a more generalized version of the intra-ISP

trafﬁc used in previous works. Obviously, if a piece only travel

within a single ISP, its AS hop count is 0. Otherwise, its value

will be a positive integer representing the number of ASes it

has traveled. From this basic deﬁnition, we can also derive

weighted AS hop count, which is the average AS hop count

that all pieces downloaded by a peer have traversed.

We also consider redundancy proposed in [8], which mea-

sures the number of times a piece has to enter an ISP until all

peers in the ISP ﬁnish their downloading or streaming. The

lowest value is 1, which means that the piece only needs to

enter the ISP once, and all peers within the ISP are able to

distribute it without asking for additional help outside. On the

contrary, the highest value is N, the number of peers within

the ISP. We also propose normalized redundancy, which is the

redundancy normalized by the number of peers. We use this

metric to measure the relative effectiveness of a P2P solution

at restraining trafﬁc within a single ISP. Finally, we note that

these redundancy metrics only apply to ISPs with peers inside,

not ISPs which only carry through trafﬁc.

To evaluate the economic impact different P2P solutions

have on ISPs, we also introduce gain/cost, which is the

ﬁnancial gain or loss of an ISP. An ISP gains by carrying

incoming/outgoing trafﬁc for its customer ISPs, and likewise,

the customer ISP loses by asking its provider ISP to relay its

incoming/outgoing trafﬁc. The trafﬁc between a pair of peering

or sibling ISPs is not counted. Since ﬁnancial charges agreed

by ISPs are unknown, we assume all provider ISPs charge by

the same rate and use number of bytes to represent the gain

or cost.

3) Minimum-AS-hop Strategy: With AS hop count as the

primary performance metric, we are able to derive the optimal

P2P strategy which minimizes the total number of AS hops.

We are also able to ﬁnd optimal strategy for both downloading

and on-demand streaming scenarios.

For downloading scenario, the optimal strategy ﬁrst con-

structs a complete graph, where each node represents a peer,

and the edge weight represents the AS hop count between

the pair of peers at both ends. Then, it ﬁnds the minimum

spanning tree on this graph, which is the P2P distribution

structure able to minimize the AS hop count. We also note

that this structure minimizes the redundancy value for each

ISP, since all peers within the same ISP, except one, choose

the edge whose weight is 0. Therefore, the algorithm could be

accelerated by aggregating all peers within a single ISP into

a cluster node, and ﬁnd the minimum spanning tree among

these cluster nodes.

For on-demand streaming scenario, the minimum spanning

tree algorithm still applies with minor modiﬁcation, where

each peer must choose, only from earlier-joined peers, the

one with the minimum AS hop count. In other words, the

complete graph described in previous paragraph becomes

directed, where at each edge the earlier-joined peer directs

to the later-joined peer. We note that because this graph is

acyclic, the above simple procedure is optimal since no loop-

removing action is needed as in the minimum spanning tree

algorithm for general directed graphs. Also under this strategy,

the redundancy value is minimized for each ISP.

Obviously, the optimal distribution structure for both scenar-

ios is a single tree, where a peer downloads the entire content

from its only parent who has the minimum AS hop count.

This structure suffers from all drawbacks of a tree solution,

such as lowest level of resilience. Also, no degree constraint

is enforced, which means a peer might be required to upload

to unlimited number of children. Nevertheless, the minimum-

AS-hop strategy constitutes the theoretical baseline, against

which other realistic P2P solutions can be measured in terms

of AS hop count. Given the static nature of these solutions,

we can easily obtain their performance through simulation on

AS-level map, instead of implementing and deploying it on

PlanetLab.

C. Locality-Aware BitTorrent

As mentioned in Sec. I, a BitTorrent peer operates at

three levels, neighbor selection at the macroscopic level, peer

choking/unchoking at the intermediate, and piece selection at

the microscopic level. In what follows, we ﬁrst review design

of the original BitTorrent on these aspects, then propose our

modiﬁcations to bring locality-awareness into each of them.

Finally, we discuss how we adapt BitTorrent to make it serve

video streaming applications.

1) Overview: During the neighbor selection process, the

peer learns from the tracker the knowledge of other peers in

the same swarm, which is the group of peers interested in the

same ﬁle. The tracker returns a peer list to the the inquiring

peer, which contains the IP addresses and port numbers of at

most a constant number (default value is 50) of peers. If the

population of the swarm exceeds this number, the selection is

entirely random. Upon receiving the peer list, a peer connects

with the majority of them at the maximum number 35. It then

sends to its neighbors the bitﬁeld messages, which advertise

the availability information of the pieces it already owns.

Based on the have message collected from neighbors, each

peer maintains an array of interest values, where each entry is

the number of neighbors owning the corresponding piece. If a

neighbor has pieces it needs, it informs the neighbor with an

interested message.

The peer choking/unchoking is the decision process made

by a peer about which of its “interested” neighbor it should

send data to. It ﬁrst sends choke message to most of its

neighbors, which means it refuses to send the data. It then

sends unchoke message to a small number (default value 4)

of neighbors which have sent data to itself at the highest rate,

a criteria best known as tit-for-tat. Once a peer becomes seed

(the one that has the complete ﬁle), it unchokes the 4 neighbors

with the highest downloading rates from itself, in order to

speed up downloading of the entire swarm. Finally, it performs

optimistic unchoking by sending unchoke message to a random

peer, which is crucially important to bootstrap brand new peers

with nothing to share yet.

The piece picking policy is executed by each peer that

is unchoked. BitTorrent employs the rarest-ﬁrst policy, in

which the piece with the minimum interest value is chosen. If

multiple pieces have the same minimum interest value, such

as 1, then the tie is broken by randomly choosing one. An

example illustrating this policy is given in Fig. 3 (a). The

greatest beneﬁt of this policy is that it helps promote the

piece diversity of the entire swarm by help distributing the rare

pieces. A diverse swarm can ensure concurrent downloading

over multiple connections, thus increasing the aggregated

downloading speed.

All above processes are repeated at different time granular-

ities. The neighbor selection is renewed if the current active

neighbor count drops to below a certain threshold (default

value 20) due to peer leaving. The choking/unchoking process

is repeated every 10 seconds and optimistic unchoking at

every 30 seconds. The piece picking is executed every time

an unchoked connection is available again to download a new

piece.

1 1 2 1 3 1 2 1 2

Playback Window

211 3 1 4 2 3 1

(a) Rarest-First Policy

(b) Locality-First Policy

Downloaded Piece

Undownloaded Piece

Piece Chosen by the Policy

Interest Value

Distance Value

Fig. 3. BitTorrent Piece Picking Policies

2) Tracker Locality: To promote locality awareness, we

replace the random peer selection with the selection that

minimizes AS hop count metric. Upon a user request, the

tracker sorts by the ascending order all other peers in the

swarm by their distances to the requesting peer in terms of

AS hop count. The tracker sends the preﬁx of this sorted list

(e.g., ﬁrst 50 peers) to the requesting peer. We note that this

solution bears great similarity with the one proposed in [8],

where 35 peers within the same ISP (AS hop count 0) are

returned together with 15 other random peers.

We note that the tracker not only returns the peer list in

the aforementioned fashion, but also attaches the distance

values in an add-on ﬁeld to the list, due to the following

reason. The distance between two peers, if measured by AS

hop count, does not change unless the peer change its ISP

or the AS-level map is updated. Therefore, an existing peer

can quickly accumulate such knowledge as it contacts more

peers. However, a newly joined peer needs to learn its distance

to other peers as early as possible in order for other locality-

aware policies to function, which we will present immediately.

While a peer can learn on its own the distance to other peers

by probing them, the tracker is an ideal identity to collect and

disseminate such information. By the same token, the AS-level

map is best maintained by the tracker. The distance values can

be learned through any commercial package able to map an

IP address to the ISP it belongs to[36].

3) Choker Locality: We redeﬁne the peer

choking/unchoking policy by making a peer unchoke

the 4 neighbors that are closest to itself in terms of AS hop

count. Although the distance between two peers seldomly

change, this policy will not result in the same selection

of peers again and again. The primary reason is that the

choking/unchoking decision is only made among interested

neighbors, i.e., the peers whose piece collections are different

from the local peer. Since piece collection of each peer will

asymptotically grow in time, the set of interested neighbors

will constantly change too. As such, this policy will enable

a peer to exchange data with its close-by neighbors in a

rotating fashion. The radius of this neighbor set is primarily

determined by its population. The more peers are concentrated

in a compact neighborhood, say a single ISP, the smaller

radius the neighbor set needs to be.

The same unchoking policy applies to the seed. In this

phase, the selection of unchoked peers will be much stabilized.

A seed will keep unchoking 4 of its closest neighbors who

still have not ﬁnished downloading. Only when a neighbor

ﬁnishes downloading all pieces will the seed chooses another

closest neighbor to unchoke. This is in sharp contrast with the

original BitTorrent design, which gives priority to uploading

speed. Giving priority to distance, our policy enables a seed

to send pieces to its closest neighbors ﬁrst. If all peers in

the swarm have homogeneous uploading speeds, then peers

closest to the seed are highly likely to be the ﬁrst to ﬁnish

downloading and become a seed too. This will result in the

seeding of peers to be propagated by a growing radius centered

around the original seed.

Finally, we keep the original BitTorrent optimistic unchok-

ing policy intact, for the same purpose to help bootstrap brand

new peers.

4) Piece Picker Locality: We propose a locality-ﬁrst policy

to encourage a peer to download pieces closest to itself. As

shown in Fig. 3 (b), we introduce a distance value to each

piece, which is the mean value of the distances of all peers

possessing this piece. For example, if a piece is owned by three

peers (i.e., its interest value is 3), and the AS hop counts from

these peers to the downloading peer are 1, 2, 3, respectively,

then the distance value associated with this piece should be

2. The locality-ﬁrst policy chooses the piece with the smallest

distance value. While the rarest-ﬁrst policy promotes piece

diversity within the swarm regardless the distance it travels,

the locality-ﬁrst policy encourages to distribute a piece by

gradually enlarging its radius centered around the seed.

We note that the distance value could be easily calculated by

a downloading peer, as long as it remembers its distances to its

neighbors, which are passed by the tracker during the neighbor

selection process. Upon receiving the have message from

its neighbors regarding a particular piece, it will increment

the interest value of this piece, meanwhile recalculating the

average distance value of this piece by taking into account

the distance value of the peer which just announced the have

message.

5) Adaptation to Streaming Applications: To adapt BitTor-

rent to accommodate the streaming scenario, we must enforce

it to support the “viewing-while-downloading” feature. The

primary reason for the incompatibility between BitTorrent and

the streaming scenario is its piece picking policy. Both the

rarest-ﬁrst and the locality-ﬁrst policies ignore the position

of a piece in the video playback. In the worst case, the ﬁrst

piece might not be downloaded until all other pieces are, which

makes the video unable to play until the whole ﬁle is fetched.

Our solution is in accordance with existing proposals[27],

[28], [29], which restrain the piece picking action within a

window marching with the video playback. This window-

based solution applies for any type of piece picking policies.

In other words, we can deem the original rarest-ﬁrst and

locality-ﬁrst policies as one extreme case of the window-based

solution, where the window size equals to the entire ﬁle.

In order to keep up with the playback, the window must

advance itself. In our solution, the window is automatically

pushed forward whenever its leftmost piece is downloaded.

Therefore, the window might advance ahead of the playback

if the BitTorrent downloading speed is faster than the playback

rate, or behind the playback otherwise. We note that this

design, as well as all pure P2P-based solutions, are too

primitive to guarantee smooth playback. To address this issue,

[29] proposes a hybrid solution integrating P2P and client-

server downloading. Here, a stream-watcher process monitors

the downloading progress. If it falls behind the playback, it

pushes the window by downloading the leftmost pieces from

a video-on-demand server, until it catches up with the playback

again. In addition, the stream-watcher also needs external

information, such as the streaming rate of the video ﬁle, to

push the window at a proper speed.

IV. FINDINGS

We start by introducing the experiment setup, then present

the performance results related to user experience, followed by

locality-related results. Finally, we examine the results related

with peer workload.

A. Experiment Setup

We make our changes to BitTorrent on its source code

version 3.9.1 and deploy it over PlanetLab nodes. We use

two PC machines at Vanderbilt University to be the original

seed and BitTorrent tracker. Both machines run RedHat Linux,

one with a Intel Xeon(TM) 2.80GHz CPU (seed) and the

other with a Intel Pentium 4 2.80GHz CPU (tracker server).

We experiment with locality-awareness features with each of

them turned on individually, namely tracker locality, choker

locality, and picker locality. In all runs of our experiment, we

conﬁgure each peer, upon ﬁnishing downloading, to leave after

10 minutes, or stay as seed.

The test ﬁle is a ﬂash video ﬁle downloaded from a video

website, which lasts 28 minutes 28 seconds and is sized

61889761 bytes. For the purpose of simplicity, we determine

its streaming rate to be the ratio of its size over playback

length. On the day of September 5, 2007, this ﬁle has been

requested for a total of 53165 times from the time point of

18:18:33 to 22:31:59. Such statistics is obtained by parsing the

webpage hosting this video. Since we are not able to study its

request pattern in a ﬁner scale, we set the average request rate

of our experiment to be the same as the average rate this ﬁle

experienced during this period, which is around 1.5 request

per second.

In the downloading scenario, we schedule all PlanetLab

nodes to request this ﬁle at the same time. In the on-demand

streaming scenario, we schedule each PlanetLab node to

initiate a request based on this speed. All runs in the streaming

scenario follow the same request sequence.

Due to various reasons such as machine failure or testbed

administration, only part of PlanetLab nodes have successfully

participated all runs of our experiment. Therefore, we only

analyze and exhibit the results obtained from these nodes. Cor-

respondingly, we measure the performance of the minimum-

AS-hop strategy by only running it on these nodes in our

simulation.

B. User-Perceived Performance Results

1) Downloading Time in Downloading Scenario: In Fig. 4,

we show the downloading time in downloading scenario.

Fig. 4 (a) shows the case with seeding time of 10 mins.

In this case, the downloading time are very uneven with

neighbor selection policy. Some peers need less than 200

seconds to ﬁnish downloading, but some others need nearly

1200 seconds. This may come from two facts: One is that

tracker locality policy is very likely to partition the peers

into some localized sets; another is that some peers might

have limited network bandwidth. Choker locality and picker

locality policies have more even downloading time. All of

these policies download faster than standard BitTorrent. Fig. 4

(b) shows the case of unlimited seeding time. In this case,

choker policy downloads faster than other policies. Again, the

tracker locality policy makes a very uneven downloading time

among peers. In both 10 mins seeding and unlimited seeding

time cases, the choker locality policy gets the shortest and

most even downloading time. From this two ﬁgures, we ﬁnd

a fact that continuing seeding after 10 mins seeding can not

improve the downloading time of standard BitTorrent, but can

signiﬁcantly improve it for choker and picker locality policies.

200

400

600

800

1000

1200

0 20 40 60 80 100 120 140

Download Time (seconds)

Peer Index

Standard BT (st = 10 mins)

Choker Locality (st = 10 mins)

Picker Locality (st = 10 mins)

Tracker Locality (st = 10 mins)

(a) Maximum Seeding Time = 10 minutes

200

400

600

800

1000

1200

0 20 40 60 80 100 120 140

Download Time (seconds)

Peer Index

Standard BT (unlimited st)

Choker Locality (unlimited st)

Picker Locality (unlimited st)

Tracker Locality (unlimited st)

(b) Unlimited Seeding Time

Fig. 4. Downloading Time (Downloading Scenario)

2) Interruptions in Streaming Scenario: In Fig. 5, we show

the interruption time in streaming scenario. For each peer,

during its streaming, we monitor the pieces received along the

playback time. During the playback, if any piece is missing,

the peer enters the “interruption” stage where the playback

is starved. It will exit this stage when the missing pieces are

received. The aggregated interruption time is the summation

of time lengths of all interruptions experienced by the peer.

This monitoring procedure starts from the point where the ﬁrst

piece is received.

This two ﬁgures show that in both 10 mins seeding and

unlimited seeding time cases, the tracker locality policy makes

the most number of peers suffering interruptions. The other

three policies result in similar interruption experience. All

these policies make more than 80% peers with no interrup-

tions, and 90% peers with less than 100 seconds interruptions

during the whole streaming process. From this two ﬁgures,

we ﬁnd that continuing seeding after 10 mins seeding can

not further improve interruption time. The overall amounts

of data uploaded by server are 1969MB, 2776MB, 1962MB

and 3107MB for standard BitTorrent, choker locality, picker

locality and tracker locality policies respectively. From these

numbers, we can ﬁnd that picker locality policy put similar

load on server as standard BitTorrent, but choker locality and

tracker locality policies put about 40% to 50% more burden

on server than standard BitTorrent.

100

1000

10000

0 20 40 60 80 100 120 140 160

Aggregate Interuption Time (second)

Peer Index

Standard BT (st = 10 mins)

Choker Locality (st = 10 mins)

Picker Locality (st = 10 mins)

Tracker Locality (st = 10 mins)

(a) Maximum Seeding Time = 10 minutes

100

1000

10000

0 20 40 60 80 100 120 140 160

Aggregate Interuption Time (second)

Peer Index

Standard BT

Choker Locality

Picker Locality

Tracker Locality

(b) Unlimited Seeding Time

Fig. 5. Interruption (Streaming Scenario)

C. Locality-Related Performance Results

1) AS Hop Count: In Fig. 6, we show the weighted

average hop count across all downloading paths of a peer

in downloading scenario. Fig. 6 shows that tracker locality

policy makes the shortest AS hop count, except the theoretical

optimal bound by the minimum-AS-hop strategy. The other

three policies achieve similar AS hop count. All these policies

in unlimited seeding time case get a little lower hop count than

in 10 mins seeding time case. This may come from the fact

that peers have lower probability to download from close by

peers in 10 mins seeding case than in unlimited seeding case.

0 20 40 60 80 100 120 140 160

Weighted Hop count

Peer Index

Standard BT (st = 10 mins)

Choker Locality (st = 10 mins)

Picker Locality (st = 10 mins)

Tracker Locality (st = 10 mins)

Optimal Locality

(a) Maximum Seeding Time = 10 minutes

0 20 40 60 80 100 120 140 160

Weighted Hop count

Peer Index

Standard BT (unlimited st)

Choker Locality (unlimited st)

Picker Locality (unlimited st)

Tracker Locality (unlimited st)

Optimal Locality

(b) Unlimited Seeding Time

Fig. 6. AS Hop Count (Downloading Scenario)

In Fig. 7, we show the weighted average hop count across

all downloading paths of a peer in streaming scenario. Fig. 7

shows that tracker locality policy makes the shortest AS hop

count, except the theoretical optimal bound. Among the other

three policies, the choker locality policy makes smaller AS

hop count. Again, all these policies in 10 mins seeding time

case get about 1 hop larger than in unlimited seeding time.

2) Redundancy: In Fig. 8 and 9, we show the normalized

redundancy achieved in less than 80 ISPs, which host the

PlanetLab peers in our experiment. Majority of them achieve

the minimum value across all solutions, due to the fact each

ISP only hosts one peer. For ISPs hosting multiple peers, all

solutions are able to perform within 2 to 3 times of the optimal

value achieved by the minimum-AS-hop strategy.

D. Financial Impact on ISPs

The minimum-AS-hop strategy only involves less than 150

ISPs, while other solutions involve between 200 and 250 ISPs.

Among these ISPs, most of them just pass through trafﬁcs.

So the overall ﬁnancial gain/cost of these ISPs is zero. We

also notice that the number of ISPs paying money is larger

than the number of ISPs gaining money. The distribution of

ﬁnancial gain/cost is very uneven. Some ISPs cost a lot, such

as AS 680(DFN-IP service G-WiN) which costs more than 1.7

GB data. And some ISPs gain a lot, such as AS 20965(The

0 20 40 60 80 100 120 140 160

Weighted Hop count

Peer Index

Standard BT (st = 10 mins)

Choker Locality (st = 10 mins)

Picker Locality (st = 10 mins)

Tracker Locality (st = 10 mins)

Optimal Locality

(a) Maximum Seeding Time = 10 minutes

0 20 40 60 80 100 120 140 160

Weighted Hop count

Peer Index

Standard BT (unlimited st)

Choker Locality (unlimited st)

Picker Locality (unlimited st)

Tracker Locality (unlimited st)

Optimal Locality

(b) Unlimited Seeding Time

Fig. 7. AS Hop Count (Streaming Scenario)

0.2

0.4

0.6

0.8

1.2

0 10 20 30 40 50 60 70 80

Redundance

AS Index

Standard BT (st = 10 mins)

Choker Locality (st = 10 mins)

Picker Locality (st = 10 mins)

Tracker Locality (st = 10 mins)

Optimal Locality

(a) Maximum Seeding Time = 10 minutes

0.2

0.4

0.6

0.8

1.2

0 10 20 30 40 50 60 70 80

Redundance

AS Index

Standard BT (unlimited st)

Choker Locality (unlimited st)

Picker Locality (unlimited st)

Tracker Locality (unlimited st)

Optimal Locality

(b) Unlimited Seeding Time

Fig. 8. Normalized Redundancy (Downloading Scenario)

0.2

0.4

0.6

0.8

1.2

0 10 20 30 40 50 60 70 80

Redundance

AS Index

Standard BT (st = 10 mins)

Choker Locality (st = 10 mins)

Picker Locality (st = 10 mins)

Tracker Locality (st = 10 mins)

Optimal Locality

(a) Maximum Seeding Time = 10 minutes

0.2

0.4

0.6

0.8

1.2

0 10 20 30 40 50 60 70 80

Redundance

AS Index

Standard BT (unlimited st)

Choker Locality (unlimited st)

Picker Locality (unlimited st)

Tracker Locality (unlimited st)

Optimal Locality

(b) Unlimited Seeding Time

Fig. 9. Normalized Redundancy (Streaming Scenario)

GEANT IP Service) which gains more than 1.3 GB data and

AS 11537(ABILENE) which gains more than 1.8 GB data.

E. Peer Contributions

1) Trafﬁc Uploaded per Peer: We show the amount of

trafﬁc uploaded by each peer in Fig. 10. The minimum-AS-

hop strategy makes very small number(less than 70) of peers to

contribute. Among all the other policies, the tracker locality

policy makes the most uneven trafﬁc distribution. Standard

BitTorrent makes the most even trafﬁc distribution. This may

also come from the fact that tracker locality separates peers

into localized sets. In summary, locality policies localize the

trafﬁc by paying the price of uneven trafﬁc distribution.

2) Number of Downloading Neighbors per Peer: Obvi-

ously, every peer always downloads from a single neighbor

in optimal policy. We show the number of peers each peer

downloading data from in Fig. 11. Some peers download all

data from one peer, while some other peers download from

up to 80 peers. Most of peers download data from about 20

to 30 peers. The standard BitTorrent downloads from more

peers than other policies. We can see that the more neighbors

downloading from the more even the trafﬁc distribution.

F. Findings

In downloading scenario, choker and picker locality policies

can signiﬁcantly reduce downloading time, while tracker lo-

cality policy achieves similar downloading time as standard

BitTorrent. Tracker locality policy achieves the lowest AS

10000

100000

1e+06

1e+07

1e+08

1e+09

1e+10

0 50 100 150 200 250

Data Contributed by One Peer (byte)

Peer Index

Standard BT (st = 10 mins)

Choker Locality (st = 10 mins)

Picker Locality (st = 10 mins)

Tracker Locality (st = 10 mins)

Optimal Locality

(a) Maximum Seeding Time = 10 minutes

10000

100000

1e+06

1e+07

1e+08

1e+09

1e+10

0 50 100 150 200 250

Data Contributed by One Peer (byte)

Peer Index

Standard BT (unlimited st)

Choker Locality (unlimited st)

Picker Locality (unlimited st)

Tracker Locality (unlimited st)

Optimal Locality

(b) Unlimited Seeding Time

Fig. 10. Trafﬁc Uploaded per Peer (Downloading Scenario)

0 20 40 60 80 100 120 140 160

Number of Nodes Downloading Data From

Peer Index

Standard BT (st = 10 mins)

Choker Locality (st = 10 mins)

Picker Locality (st = 10 mins)

Tracker Locality (st = 10 mins)

(a) Maximum Seeding Time = 10 minutes

100

0 20 40 60 80 100 120 140 160

Number of Nodes Downloading Data From

Peer Index

Standard BT (unlimited st)

Choker Locality (unlimited st)

Picker Locality (unlimited st)

Tracker Locality (unlimited st)

(b) Unlimited Seeding Time

Fig. 11. Number of Downloading Neighbors per Peer (Downloading

Scenario)

hop count, but suffers most unbalanced peer load in terms

of number of downloading neighbors and trafﬁc uploaded per

peer. So in downloading scenario, if shorter downloading time

is of high priority, we should use choker or picker locality

policies. If less inter-AS trafﬁc is of high priority, we should

use tracker locality policy.

In streaming scenario, Standard BitTorrent achieves similar

disruption as choker and picker locality policies and less

disruption than tracker locality policy. But it comes with much

larger startup delay. The same as in downloading scenario,

tracker locality policy achieves the lowest AS hop count. So

in streaming scenario, if less playback disruption is of high

priority, we should choose choker and picker locality policies.

If less inter-AS trafﬁc is of high priority, we should use tracker

locality policy.

V. CONCLUSION

In this paper, we propose a set of locality-aware P2P

solutions. In particular, we propose an optimal solution which

returns a distribution structure with the minimum AS hop

count. We also modify the BitTorrent system to embed

locality-awareness into its neighbor selection, peer chok-

ing/unchoking, and piece selection processes. We evaluate

the performance of these solutions, as well their impacts

on ISPs on a real-world Internet AS topology derived from

the PlanetLab testbed. While it clearly shows the advantage

of locality-aware solutions at reducing inter-AS trafﬁc and

achieving shorter downloading time, they also demonstrate

deﬁciency at evenly distributing peer workload as done by

the traditional random strategy employed by BitTorrent. We

also ﬁnd that continuing seeding after 10 mins seeding can

not improve the downloading time of standard BitTorrent,

but can signiﬁcantly improve it for locality policies. It can

also reduce the number of connected peers for choker and

piece picker locality policies. As such, our study suggests the

necessity to consider, in the design of future P2P downloading

and streaming solutions, the tradeoff between the goals of

optimizing AS-related performance and achieving fairness

among peers such as intra-AS trafﬁc and peer burden fairness.

REF ER EN CE S

[1] “BitTorrent,” http://bittorrent.com.

[2] “Skpe,” http://skype.com.

[3] “PPLive,” http://pplive.com.

[4] “UUSEE,” http://uusee.com.

[5] T. Karagiannis and P. Rodriguez, and K. Papagiannaki , “Should Internet

service providers fear peer-assisted content distribution,” in Internet

Measurement Conference, 2005.

[6] S. Ren and L. Guo and X. Zhang, “ASAP: an AS-Aware Peer-Relay

Protocol for High Quality VoIP,” in ICDCS, 2006.

[7] C. Huang and J. Li and K. W. Ross, “Can internet video-on-demand be

proﬁtable?,” in ACM SIGCOMM, 2007.

[8] R. Bindal and P. Cao and W. Chan and J. Medved, G. Suwala and

T. Bates and A. Zhang , “Improving Trafﬁc Locality in BitTorrent via

Biased Neighbor Selection,” in ICDCS, 2006.

[9] “Planetlab,” http://www.planet-lab.org.

[10] Dongyu Qiu and R. Srikant, “Modeling and performance analysis of

bittorrent-like peer-to-peer networks,” in SIGCOMM ’04: Proceedings

of the 2004 conference on Applications, technologies, architectures, and

protocols for computer communications, New York, NY, USA, 2004,

pp. 367–378, ACM Press.

[11] X. Yang and G. Veciana, “Service capacity of peer to peer networks,”

in Proc. of INFOCOM, 2004.

[12] Laurent Massoule and Milan Vojnovic, “Coupon replication systems,”

in SIGMETRICS ’05: Proceedings of the 2005 ACM SIGMETRICS

international conference on Measurement and modeling of computer

systems, New York, NY, USA, June 2005, vol. 33, pp. 2–13, ACM

Press.

[13] A. Bharambe and C. Herley and V. Padmanabhan, “Some Observations

on BitTorrent Performance,” 2005.

[14] L. Guo and S. Chen and Z. Xiao and E. Tan and X. Ding and X. Zhang,

“Measurements, Analysis, and Modeling of BitTorrent-like Systems,”

2005.

[15] Y. Chu, R. Rao, and H. Zhang, “A case for end system multicast,” in

Proc. of ACM SIGMETRICS, 2000.

[16] D. Kostic, A. Rodriguez, J. Albrecht, and A. Vahdat, “Bullet: High

bandwidth data dissemination using an overlay mesh,” in Proc. of ACM

Symposium on Operating Systems Principles (SOSP), 2003.

[17] H. Xie and Y. R. Yang and A. Krishnamurthy and Y. Liu and A. Silber-

schatz, “P4P: Provider Portal for Applications,” in Proc. of SIGCOMM,

2008.

[18] D. R. Choffnes and F. E. Bustamante, “Taming the Torrent: A Practical

Approach to Reducing Cross-ISP Trafﬁc in P2P Systems,” in Proc. of

SIGCOMM, 2008.

[19] M. Castro, P. Druschel, A. M. Kermarrec, A. Nandi, A. Rowstron,

and A. Singh, “Splitstream: High-bandwidth multicast in cooperative

environments,” in Proc. of ACM Symposium on Operating Systems

Principles (SOSP), October 2003.

[20] Y. Cui, B. Li and K. Nahrstedt, “oStream: Asynchronous Streaming

Multicast in Application-Layer Overlay Networks,” IEEE Journal on

Selected Areas of Communications, Special Issue on Recent Advances

in Service Overlay Networks, vol. 22, 2004.

[21] S. Jin and A. Bestavros, “Cache-and-Relay Streaming Media Delivery

for Asynchronous Clients,” in Proc. of International Workshop on

Networked Group Communication (NGC), 2002.

[22] Venkata N. Padmanabhan, Helen J. Wang, Philip A. Chou, and Kun-

wadee Sripanidkulchai, “Distributing streaming media content using

cooperative networking,” in NOSSDAV ’02: Proceedings of the 12th

international workshop on Network and operating systems support for

digital audio and video, New York, NY, USA, 2002, pp. 177–186, ACM

Press.

[23] J. Li and Y. Cui and B. Chang, “PeerStreaming: Design and Imple-

mentation of an On-Demand Distributed Streaming System with DRM

Capabilities,” ACM/Springer Multimedia Systems Journal, 2007.

[24] C. Wu and Baochun Li, “rStream: Resilient and Optimal Peer-to-Peer

Streaming with Rateless Codes,” IEEE Transactions on Parallel and

Distributed Systems, 2007.

[25] C. Gkantsidis and J. Miller and P. Rodriguez, “Anatomy of a P2P

Content Distribution System with Network Coding,” in IPTPS, 2006.

[26] Xinyan Zhang, Jiangchuan Liu, Bo Li, and Y. S. P. Yum, “Coolstream-

ing/donet: a data-driven overlay network for peer-to-peer live media

streaming,” in INFOCOM 2005: 24th Annual Joint Conference of

the IEEE Computer and Communications Societies. Proceedings IEEE,

2005, vol. 3, pp. 2102–2111 vol. 3.

[27] C. Dana and D. Li and D. Harrison and C.N. Chuah, “BASS: BitTorrent

Assisted Streaming System for Video-on-Demand,” in IEEE MMSP,

2005.

[28] A. Vlavianos and M. Iliofotou and M. Faloutsos, “BiToS: Enhancing

BitTorrent for Supporting Streaming Applications,” in IEEE INFOCOM,

2006.

[29] Y. Choe and D. Schuff and J. Dyaberi and V. Pai, “Improving VoD

Server Efﬁciency with BitTorrent,” in ACM Multimedia, 2007.

[30] “BitTorrent DNA,” http://www.bittorrent.com/dna/.

[31] L. Gao, “On inferring autonomous system relationships in the internet,”

IEEE/ACM Transaction on Networking, 2001.

[32] N. Spring and R. Mahajan and David. Wetherall and T. Anderson,

“Measuring ISP topologies with rocketfuel,” IEEE/ACM Transaction

on Networking, 2004.

[33] X. Dimitropoulos and D. Krioukov and M. Fomenkov and B. Huffaker

and Y. Hyun and k. claffy and G. Riley, “AS relationships: inference

and validation,” in ACM SIGCOMM, 2007.

[34] “Cymru,” http://www.cymru.com.

[35] “Caida,” http://www.caida.org.

[36] “IP2Location,” http://www.ip2location.com.

Peer-to-Peer File Sharing Architecture for Software-defined TWDM-PON

Article

Full-text available

Jan 2020

Peer-to-peer (P2P) file sharing application is considered to be a global bandwidth consumer. By localizing the traffic, the internet service provider (ISP) can reduce the bandwidth burden in the feeder fiber, saving time and bandwidth. The passive optical network (PON) is regarded as one of the best future access network technologies that can provide a better environment for P2P applications. In this paper, we propose a P2P intra-traffic file sharing architecture in TWDM-PON with software-defined network (SDN) system to reduce inter- and intra- traffic in PON and ISPs, and improve the quality of service (QoS). In addition, the proposed scheme employs colorless optical network units (ONUs) by dynamically assigning wavelength for transmission to simplify the network operation, reduce installation cost, and enable easier maintenance. Moreover, we implement an integrated SDN with OpenFlow protocol to separate the control plane and data plane, enabling flexible and centralized control of the P2P intra-traffic by the ISP. Simulation results demonstrate that our proposed P2P-DWBA can realize improvements up to 21% in the QoS in terms of packet delay, 23.9% in the jitter, 13% in the throughput, and reduce traffic dropping up to 58% in scenario 6 (5:40:44:11) for the 1.5 ms cycle time.

BitTorrent locality-awareness application with colorless ONUs in an enhanced EPON system

Article

Full-text available

Jul 2019

BitTorrent (BT) file sharing is the most popular peer-to-peer (P2P) application and it accounts for a large proportion of the bandwidth utilized worldwide. However, BT ignores the traffic costs for Internet service providers (ISPs) and a large amount of cross-ISP traffic is generated. Thus, ISPs often throttle BT traffic to control costs. Ethernet passive optical network (EPON) is an important primitive technology for the future-oriented next generation access network which aims to provide a suitable environment for P2P services. In this study, we designed an optical line terminal (OLT) with a tracking-server in order to implement a fully functioning BT system that can handle the local BT traffic. We propose the use of a passive splitter redirection capability (PSR) between the OLT and colorless optical network units (ONUs) to support intra-traffic redirection communication in order to achieve traffic localization among ONUs in the enhanced EPON. Our simulation results showed that the proposed architecture can improve the quality of service (QoS) in terms of the mean packet delay, system throughput, and packet loss.

Performance Analysis of P2P Community Models using PoPs

Article

Full-text available

Jul 2018

Narayanan Madeshan

Peer-to-peer (P2P) computing or networking is a distributed application architecture that partitions tasks or workloads between peers. Peers are equally privileged, equipotent participants in the application. In this paper, we analyze the performance for peer-to-peer (P2P) community in the traffic models. Commonly Peer-to-Peer systems choose and pick their own nodes in some confident Point Of Presence (PoP) with lesser quantity of moment or time it requires to traverse a message to a particular system. , a "friction factor" has been initiated in the particular model (form). Furthermore, work includes studies of an extensive series of Peer-to-Peer society. Then the Peer-to-Peer community traffic models can search for more improvements and later the outcome of this will be in highly accurate.

QoS-awareness peer coordination control for topology-converging P2P live streaming

Article

Full-text available

Nov 2017
MULTIMED TOOLS APPL

For Peer to Peer (P2P) live streaming media, the peer coordination control mechanism becomes complex under dynamic traffic fluctuation, different node distance and capacity as well as user dynamics. Due to the lack of the adjusting ability, the existing peer coordination control mechanisms perform worse, and have large overhead. In order to overcome these problems, a QoS-awareness peer coordination control mechanism is proposed. Specifically, firstly, a ring buffer mechanism is introduced to implement the cyclic coordination, and the algorithms of both cycle length and cycle time are suggested to implement the constant startup delay and smooth play. Secondly, the selection algorithms of both the content-likeness candidate and the shortest-distance peer are proposed to implement the QoS-awareness streaming synchronization. Thirdly, the peer task assignment algorithm is proposed to implement the variable trunk and similar Cycle Task Fulfillment Time (CTFT). Next, a self-adaptive adjusting mechanism is introduced to converge P2P topology and fight against the delivery abnormality. Finally, simulation is conducted to corroborate our studies and the results show that its performance outperforms the existing mechanisms.

Socially-aware distributed hash tables for decentralized online social networks

Conference Paper

Full-text available

Sep 2015

An efficient hybrid push-pull methodology for peer-to-peer video live streaming system on mobile broadcasting social media

Article

Full-text available

Jan 2017
MULTIMED TOOLS APPL

With the rapid growth of wireless communication technology, the availability of highly flexible and video-friendly mobile terminal platforms (such as smartphones and tablets), the emergence of major video content providers (like YouTube, Ustream, and PPTV, which provide a large catalog of attractive contents), Peer-to-Peer (P2P) live video streaming over the wireless and Internet is becoming more and more attractive to users. One of the main challenges is to provide a good quality of service though the dynamic behavior of the network. Traditionally, tree-based model uses a push method, that broadcaster transfers data to other users. This model has low start-up delay. However, there are two main problems in this method: if the bandwidth of an internal node is low, children nodes may lose data and when an internal node failure, other nodes can’t receive data until completing the recovery of the tree. On the other hand, mesh-based model uses a pull method, has low bandwidth of a neighbor node by pulling necessary data from a number of neighbor nodes. However, mesh-based model requires large buffers to support pull data from neighbor peers and there is an adjustment between minimum delay by sending pull request and overhead of whole system. So, both models have their own strengths and weaknesses. This paper proposes a new hybrid push-pull live P2P video streaming protocol called MobileCast that combines the benefits of pull and push mechanisms for live video delivery. We present new topology for P2P network with more stable and provide better video streaming quality. Our main goal is to minimize the network end-to-end delay, startup time, overhead, packet loss compared to the pure mesh networks, pure tree networks and provide a good quality of service though the dynamic behavior of the network.

Dynamic Bandwidth Adjustment for Instant Replay of Live Streams on BitTorrent Networks

Conference Paper

Sep 2019

In general, people watching a live streaming program would sit and wait for its broadcasting at the very beginning. However, some of them could not do so and inevitably join in the middle of live program because of unexpected events. This situation would unfortunately result in an incomplete viewing experience, causing viewers not enjoying the full entertainment value of the live program and therefore dissatisfaction with the quality of services. In this paper, we propose the Dynamic Bandwidth Adjustment for Instant Replay of live streams on BitTorrent networks (DBAIR), which in addition to downloading the live content, the surplus bandwidth in a peer is allocated to pre-download broadcasted highlights, providing viewers with the ability to watch instant replays of missed live content. In such a system, the length or popularity of a highlight can be used to determine how to allocate the surplus bandwidth needed to download highlights. Once a user decides to watch a certain highlight, if the allocated bandwidth and the pre-downloaded video chunks does not guarantee smooth playback, bandwidth used to download other highlights can be sequentially borrowed. The experimental results show that with the aid of DBAIR, the time of completely downloading a highlight can be significantly shortened, resulting in better quality of experience.

Improved unchoking policy for BitTorrent

Article

Nov 2014

Peer to Peer communication has become very popular these days. This popularity has resulted into enormous increase in P2P traffic, which has given birth to many internet traffic management problems for ISP's. Cross network traffic is major concern for ISP's, which arise from p2p file sharing systems due to its blindness towards physical topology. Various solutions have been suggested so far, but all are implemented from client peer side. This cause load balancing problem in the overlay, where some peer have huge numbers of request while other have few service request. So we have suggested a peer selection algorithm which is implemented on serving peer's, it takes upload rate and hop distance both while deciding which peer it should unchoke. Our suggested Dynamic peer selection algorithm tries to maintain low free riding and low cross network traffic by unchoking local peers with high upload rate capability and also act as a suitable solution for load balancing problem.

Alleviating the Topology Mismatch Problem in Distributed Overlay Networks: A Survey

Article

Dec 2015
J SYST SOFTWARE

Peer-to-peer (P2P) systems have enjoyed immense attention and have been widely deployed on the Internet for well over a decade. They are often implemented via an overlay network abstraction atop the Internet’s best-effort IP infrastructure. P2P systems support a plethora of desirable features to distributed applications including anonymity, high availability, robustness, load balancing, quality of service and scalability to name just a few. Unfortunately, inherent weaknesses of early deployments of P2P systems, prevented applications from leveraging the full potential of the paradigm. One major weakness, identified early on, is the topology mismatch problem between the overlay network and the underlying IP topology. This mismatch can impose an extraordinary amount of unnecessary stress on network resources and can adversely affect both the scalability and efficiency of the operating applications. In this paper, we survey over a decade’s worth of research efforts aimed at alleviating the topology mismatch problem in both structured and unstructured P2P systems. We provide a fine-grained categorization of the suggested solutions by discussing their novelty, advantages and weaknesses. Finally, we offer an analysis as well as pictorial comparisons of the reviewed approaches since we aim to offer a comprehensive reference for developers, system architects and researchers in the field.

Congestion Distance Based BitTorrent-Like P2P Traffic Optimization

Chapter

Nov 2013

A large number of P2P traffic brings great impact on the bearer network, so how to utilize network bandwidth efficiently has become a common trade problem of Internet Service Providers (ISP) and terminal users. A P2P traffic optimization model is derived from the aims of ISPs and users. Minimizing the congestion distance is chosen as the main objective which not only can represent the requirements of two sides, but also can be distributed implemented easily. According the optimization model, Bittorrent-like P2P system is improved. The distance aware peer selection and secondary sorting choking/unchoking algorithms are proposed. The experiments results show that this optimization can reduce inter autonomous system (AS) traffic effectively and respond to network congestion automatically.

Anatomy of a P2P Content Distribution System with Network Coding

Article

Full-text available

Jan 2006

In this paper we present our experiences with a P2P content dis- tribution system that uses Network Coding. Using results from live trials, we are able to present a detailed performance analysis of suchP2Psystemfromavarietyofnovelperspectives. Weshowthat Network Coding incurs little overhead, both in terms of CPU pro- cessingandI/Oactivity, anditresultsinsmoothandfastdownloads. To ensure secure transfers, we describe a novel scheme that verifies encoded blocks on-the-fly and analyze its performance. We also study the effect of peers behind NATs and firewalls and show the robustness of the system to large number of unreachable peers.

AS Relationships: Inference and Validation

Article

Full-text available

May 2006
COMPUT COMMUN REV

Research on performance, robustness, and evolution of the global Internet is fundamentally handicapped without accurate and thorough knowledge of the nature and structure of the contractual relationships between Autonomous Systems (ASs). In this work we introduce novel heuristics for inferring AS relationships. Our heuristics improve upon previous works in several technical aspects, which we outline in detail and demonstrate with several examples. Seeking to increase the value and reliability of our inference results, we then focus on validation of inferred AS relationships. We perform a survey with ASs' network administrators to collect information on the actual connectivity and policies of the surveyed ASs. Based on the survey results, we find that our new AS relationship inference techniques achieve high levels of accuracy: we correctly infer 96.5% customer to provider (c2p), 82.8% peer to peer (p2p), and 90.3% sibling to sibling (s2s) relationships. We then cross-compare the reported AS connectivity with the AS connectivity data contained in BGP tables. We find that BGP tables miss up to 86.2% of the true adjacencies of the surveyed ASs. The majority of the missing links are of the p2p type, which highlights the limitations of present measuring techniques to capture links of this type. Finally, to make our results easily accessible and practically useful for the community, we open an AS relationship repository where we archive, on a weekly basis, and make publicly available the complete Internet AS-level topology annotated with AS relationship information for every pair of AS neighbors.

BiToS: Enhancing BitTorrent for Supporting Streaming Applications

Conference Paper

Full-text available

May 2006

BitTorrent (BT) in the last years has been one of the most effective mechanisms for P2P content distribution. Although BT was created for distribution of time insensitive content, in this work we try to identify what are the minimal changes needed in the BT's mechanisms in order to support streaming. The importance of this capability is that the peer will now have the ability to start enjoying the video before the complete download of the video file. This ability is particularly important in highly polluted environments, since the peer can evaluate the quality of the video content early and thus preserve its valuable resources. In a nutshell, our approach gives higher download priority to pieces that are close to be reproduced by the player. This comes in contrast to the original BT protocol, where pieces are downloaded in an out-of-order manner based solely on their rareness. In particular, our approach tries to strike the balance between downloading pieces in: (a) playing order, enabling smooth playback, and (b) the rarest first order, enabling the use of parallel downloading of pieces. In this work, we introduce three different Piece Selection mechanisms and we evaluate them through simulations based on how well they deliver streaming services to the peers.

Bullet: high bandwidth data dissemination using an overlay mesh

Conference Paper

Jan 2003

BiToS: Enhancing BitTorrent for supporting streaming applications

Article

Jan 2006

BASS: BitTorrent assisted streaming system for video-on-demand

Article

Jan 2005

Bullet

Article

Dec 2003

In recent years, overlay networks have become an effective alternative to IP multicast for efficient point to multipoint communication across the Internet. Typically, nodes self-organize with the goal of forming an efficient overlay tree, one that meets performance targets without placing undue burden on the underlying network. In this paper, we target high-bandwidth data distribution from a single source to a large number of receivers. Applications include large-file transfers and real-time multimedia streaming. For these applications, we argue that an overlay mesh, rather than a tree, can deliver fundamentally higher bandwidth and reliability relative to typical tree structures. This paper presents Bullet, a scalable and distributed algorithm that enables nodes spread across the Internet to self-organize into a high bandwidth overlay mesh. We construct Bullet around the insight that data should be distributed in a disjoint manner to strategic points in the network. Individual Bullet receivers are then responsible for locating and retrieving the data from multiple points in parallel.Key contributions of this work include: i) an algorithm that sends data to different points in the overlay such that any data object is equally likely to appear at any node, ii) a scalable and decentralized algorithm that allows nodes to locate and recover missing data items, and iii) a complete implementation and evaluation of Bullet running across the Internet and in a large-scale emulation environment reveals up to a factor two bandwidth improvements under a variety of circumstances. In addition, we find that, relative to tree-based solutions, Bullet reduces the need to perform expensive bandwidth probing. In a tree, it is critical that a node's parent delivers a high rate of application data to each child. In Bullet however, nodes simultaneously receive data from multiple sources in parallel, making it less important to locate any single source capable of sustaining a high transmission rate.

PeerStreaming: Design and Implementation of an On - Demand Distributed Streaming System with DRM Capabilities

Article

Jan 2007

A Case For EndSystem Multicast

Article

Jan 1999

The conventional wisdom has been that IP is the natural protocol layer for implementing multicast related function-ality. However, ten years after its initial proposal, IP Multi-cast is still plagued with concerns pertaining to scalability, network management, deployment and support for higher layer functionality such as error, ow and congestion con-trol. In this paper, we explore an alternative architecture for small and sparse groups, where end systems implement all multicast related functionality including membership man-agement and packet replication. We call such a scheme End System Multicast. This shifting of multicast support from routers to end systems has the potential to address most problems associated with IP Multicast. However, the key concern is the performance penalty associated with such a model. In particular, End System Multicast introduces du-plicate packets on physical links and incurs larger end-to-end delay than IP Multicast. In this paper, we study this question in the context of the Narada protocol. In Narada, end systems self-organize into an overlay structure using a fully distributed protocol. In addition, Narada attempts to optimize the e ciency of the overlay based on end-to-end measurements. We present details of Narada and evaluate it using both simulation and Internet experiments. Prelimi-nary results are encouraging. In most simulations and Inter-net experiments, the delay and bandwidth penalty are low. We believe the potential bene ts of repartitioning multicast functionality between end systems and routers signi cantly outweigh the performance penalty incurred.

CoolStreaming/DONet: A dData-driven overlay network for live media streaming

Article

Jan 2005

Locality-Awareness in BitTorrent-Like P2P Applications

Abstract and Figures

Recommended publications

Introducing MorphMix: Peer-to-Peer based Anonymous Internet Usage with Collusion Detection

Map Matching Based on Improved Topology Structure Algorithm

Free curves and periodic points for torus homeomorphisms

Abduction, Experience, and Goals: A Model of Everyday Abductive Explanation