Network Journal 1
Network Journal 1
Network Journal 1
ISSN 1796-2056
Volume 6, Number 11, November 2011
Contents
REGULAR PAPERS
Mobility Impact on Session Survivability under the SHIM6 Protocol and Enhancement of its
Rehoming Procedure
Amine Dhraief and Abdelfettah Belghith
Development of Anonymous Networks Based on Symmetric Key Encryptions
Hazim Haddad, Shinsuke Tamura, Shuji Taniguchi, and Tatsuro Yanase
Development of a Ubiquitous Industrial Data Acquisition System for Rotogravure Printing Press
Yuhuang Zheng
An Application of the Modification of Slow Start Algorithm in Campus Network
Guo-hong Gao, Wen-xian Xiao, Zhen Liu, and Wen-long Wan
Cryptanalysis and Improvement of Selvi et al.s Identity-Based Threshold Signcryption Scheme
Wei Yuan, Liang Hu, Hongtu Li, Jianfeng Chu, and Yuyu Sun
An Independent Set Packet Classification Algorithm Using Priority Sorting
Hui-Gui Rong and Hao Chen
Enabling Awareness Driven Differentiated Data Service in IOT
Haoming Guo, Shilong Ma, and Feng Liang
Enhancement of an Authenticated 3-round Identity-Based Group Key Agreement Protocol
Wei Yuan, Liang Hu, Hongtu Li, Jianfeng Chu, and Yuyu Sun
A Task Scheduling Strategy in Heterogeneous Multi-sinks Wireless Sensor Networks
Liang Dai, Hongke Xu, and Ting Chen
Visual Important-Driven Interactive Rendering of 3D Geometry Model over Lossy WLAN
Bailin Yang, Zhiyong Zhang, and Xun Wang
Secure Identity-based Threshold Broadcast Encryption in the Standard Model
Leyou Zhang, Qing Wu, and Yupu Hu
A Power Allocation Algorithm Based on Cooperative Game Theory in Multi-cell OFDM Systems
Ping Wang, Jing Han, Fuqiang Liu, Yang Liu, and Jing Xu
Expectation Value Calculation of Grid QoS Parameters Based on Algorithm Prim
Kaijian Liang, Linfeng Bai, and Xilong Qu
Web Page Classification using an Ensemble of Support Vector Machine Classifiers
Shaobo Zhong and Dongsheng Zou
1521
1533
1543
1549
1557
1565
1572
1578
1586
1594
1602
1610
1618
1625
Integration of Unascertained Method with Neural Networks and Its Application
Huawang Shi
Researches on Grid Security Authentication Algorithm in Cloud Computing
Keshou Wu, Lizhao Liu, Jian Liu, Weifeng Li, Gang Xie, Xiaona Tong, and Yun Lin
Non-line-of-sight Error Mitigation in Wireless Communication Systems
Chien-Sheng Chen, Yi-Jen Chiu, Ho-Nien Shou, and Ching-Lung Chi
1631
1639
1647
Mobility impact on session survivability under
the SHIM6 protocol and enhancement of its
rehoming procedure
Amine Dhraief Abdelfettah Belghith
HANA Research Group, Manouba University, Tunisia
Email: [email protected], [email protected]
AbstractMultihoming is a solution that enables a fault-
tolerant access to the Internet by conguring on each
network entity several IP addresses associated with distinct
ISPs. IPv6 natively allows end-hosts and end-sites to be
multihomed where nodes and routers can have multiple IP
addresses. However, a specic support is required to take full
advantage of multihoming. The SHIM6 protocol provides
such a support.
We study in this paper to what extent the mobility
impacts the SHIM6 protocol component in general and
more specically the context establishment as it is a sine
qua none condition for session survivability. We focus on
possible consequences of mobility before, during, and after
the context establishment. We nd that in some mobility
scenarios, the SHIM6 context is never established and the
session survivability cannot be ensured.
Index TermsMultihoming, Mobility, SHIM6, Testbed
I. INTRODUCTION
Providing a redundant and reliable access to the net-
work is a major concern for protocol designers [1]. A
solution that enables a fault-tolerant access to the Internet
consists in conguring on each network entity several
IP addresses associated with distinct Internet Service
Providers (ISP). A study conducted by Agrawal et al. [2]
revealed that at least 60% of Internet stub autonomous
systems (AS) are multihomed to two or more ISP.
Multihoming protocols provide an indispensable sup-
port to take full advantage of multihoming and a frame-
work for multiple addresses management [3]. With the
forthcoming version of the IP network, IPv6, lots of
efforts have been made to enable multihoming benets,
such as reliability, session survivability and load sharing.
During the last few years, more than 40 solutions have
been proposed for IPv6 multihoming [4]. The majority of
these solutions have their own mechanisms to preserve
established sessions after a failure.
On the other hand mobility protocols, a family of
protocols which provides a support for host mobility, also
aim at preserving nodes sessions while moving. Mobility
and multihoming are usually studied separately. Mobility
protocols do not consider the case of multihomed mobile
node, while multihomed protocols do not take into ac-
count mobility. Nonetheless, in the Internet, nodes are at
the same time mobile and multihomed. Nodes are mobile
as they are able to change the access network while having
running session; they are multihomed as they are equipped
with several interfaces (such as Wi-Fi or Wimax)
In this paper, we study the impact of the mobility on
multihoming protocols. For this purpose, we focus on a
particular multihoming protocol - the SHIM6 protocol.
One of the most important aspects of the SHIM6 protocol
is its context establishment, as it is a sine qua none
condition for session survivability. Without an established
SHIM6 context, communicating peers cannot rehome
their communications in case of failures. Hence, we focus
more precisely on possible consequences of mobility
before, during and after the context establishment. We
nd that in some mobility scenarios, the SHIM6 context
is never established and the session survivability cannot
be ensured. Furthermore, the rehoming procedure is a
key feature of any multihoming protocol. Rehoming a
communication implies a change in the used IP address
and may result in the change of the upstream ISP. As
SHIM6 is designed to be deployed on static nodes, its
default rehoming decision strategy does not meet the
requirement of mobility. We present in this paper some
optimizations in order to improve the SHIM6 rehoming
latency in a mobile environment. We demonstrate, through
measurements on a real testbed, that these optimizations
improves signicantly the rehoming latency.
The remainder of this paper is structured as follows.
Section II highlights multihoming motivations, function-
alities and constraints. Section III starts with an overview
of the IPv6 multihoming approaches and then focuses on
the SHIM6 protocol. Section IV analyzes the impact of
node mobility on the SHIM6 context and evaluates it in
an experimental testbed. Section V concludes this paper.
II. BACKGROUND
A. Denitions
A node identier refers to a constant that uniquely
identies a node in a given network [5]. A locator is the
topological reference of an interface in a network. The
later is used by routing protocols to locate any entity in
a network.
Mobility is dened as a change in the node locator.
In a mobile environment, a node that changes its current
locator usually preforms successively two steps: a layer
2 (L2) handover then a layer 3 (L3) handover. The
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1521
2011 ACADEMY PUBLISHER
doi:10.4304/jnw.6.11.1521-1532
L2 handover consists in a change from the link-layer
connectivity; whereas, the L3 handover refers to a change
from the access network and the acquisition of a new
locator [6].
A node is said multihomed when it simultaneously has
several available locators. A node may acquire several
locators in various congurations: it can be connected to
a link where multiple prexes are advertised or it can
be equipped with several interfaces attached to different
access networks.
B. Multihoming motivations
Multihoming presents several motivations such as sus-
tained redundancy against failure, improved performance,
allowing load sharing and permitting policing [7].
1) Redundancy: The most important motivation of
getting attached to several upstream providers is to protect
end-sites as well as end-hosts from failures. Abley et al.
[8] detailed the potential causes of failures in networks.
The most common and important ones are : physical
failures, routing protocol failures and ISP failures. Phys-
ical failures refer to outages that may affect network
components (e.g., routers) or network connections (e.g.,
ber cuts). Routing protocol failures are due to misbe-
having routing protocols (e.g., withdraw valid routes or
announce unstable routes ). ISP failures are outages that
affect the Internet providers leading to the interruption of
the Internet connectivity. By providing redundant paths,
multihoming alleviates these failures. If an outage affects
one of the available paths, multihoming protocols detect
this failure and rehome running sessions onto another
working path.
2) Performance: Akella et al. [9] quantied to what ex-
tent multihoming improves network performance in terms
of delay, available bandwidth and reliability. They showed
that a multihomed site connected to two ISPs acquires a
25% improvement in its average performance, and that
getting connected to more than 4 providers yields a little
further improvement. In [10], Launois et al. showed that
multihoming increases the number of concurrent available
paths and that lower delays are found among new paths.
Hence, multihoming may improve network performance
in terms of delay, available bandwidth and resilience
against failure.
3) Load sharing: Load balancing refers to the situation
where a site spreads its trafc among its available links
towards a given destination; whereas, load sharing refers
to the case where no destination is specied. Load balanc-
ing is then a particular case of load sharing. Load sharing
allows end-sites to increase their aggregate throughput
and thus improve their performance. For example, as the
broadband access price is constantly dropping, small and
mid-sized corporate can emulate a T1 link by subscribing
to several broadband accesses and distribute their trafc
among the different connections. Hence, multihomed sites
can use their available connections simultaneously by dis-
tributing both incoming and outgoing trafcs among their
available paths and thus performing load sharing [11].
4) Policy: By being multihomed, a site would like to
distribute its trafc according to some policies. Policies
are the rules that dene the trafc to be forwarded to a
given provider. For example, a corporate may subscribe
to two providers, one for its e-commerce transactions and
the other for its personal Internet usage.
C. Multihoming functionalities
In order to satisfy the aforementioned incentives and
motivations, some fundamental functionalities should be
provided by multihoming protocols.
1) Decoupling node identication from its localization:
TCP/IP has been formally designed to allocate a single
IP address per device. The role of an IP address was two-
fold: it locates an end-host in the network and it identies
the end-host running sessions [12], [13]. In the current
Internet, nodes tend to be mobile and multihomed, most of
the time they are equipped with multiple interfaces. They
have several addresses so they require a more exible
interaction with their address sets. From a session point
of view, their identity needs to be independent from their
physical administrative domain [14]. Multihomed nodes
have several IP addresses and consequently are located
in several networks; whereas, they should have a single
identity. Therefore, the multihoming paradigm require the
decoupling of node identity from its location.
2) Maintaining the set of addresses up-to-date: The
major motivation of multihoming is to have redundant
accesses to the Internet in order survive failures. Failures
that might affect Internet paths and ISP renumbering op-
erations are events that alter the multihomed node address
sets. Multihoming solutions must provide mechanisms for
failure detection and failure recovery.
Abley et al. [8] detailed the potential causes of failures
in networks. The most common and important ones are
: physical failures, routing protocol failures and ISP fail-
ures. Physical failures refer to outages that may affect net-
work components (e.g., routers) or network connections
(e.g., ber cuts). Routing protocol failures are failures due
to misbehaving routing protocols (e.g., withdrawing valid
routes or announcing unstable routes) [15]. ISP failures
are outages that affect Internet providers leading to the
interruption of the Internet connectivity. Similarly to the
movement for mobile nodes, failure detection can be used
for multihomed nodes as a clue to verify the reachability
of the currently used locator. Hence, multihomed nodes
should detect such events in order to maintain their
address set up-to-date.
3) Trafc engineering: On one hand, multihoming
entities (end-sites or end-hosts) are connected to the
Internet through several paths characterized by a set of
quality of service (QoS) parameters (e.g., delay, band-
width, jitter). On the other hand, multihoming entities aim
at performing load sharing according to some policies.
Hence, multihoming entities need to efciently select the
appropriate path satisfying their performance or policy
requirement [16]. Trafc Engineering (TE) functionalities
allow multihomed entities to optimize the use of available
1522 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
paths by adapting the route selection mechanism to some
requirements [17]. TE functionalities are necessary to
achieve suitable performances, load sharing and the policy
requirements presented in section II-B.
D. Multihoming constraints
The deployment of multihoming in the Internet faces,
however, several constraints. These constraints are thor-
oughly discussed in [8].
1) Scalability: The scalability issue is a major con-
cern in deploying multihoming. In fact, a multihoming
solution that maintains states in the inter provider routing
systems is inherently not scalable. For example, one of
the most used techniques to achieve multihoming relies
on the BGP. BGP is an inter-domain routing protocol,
i.e. BGP is responsible of routes announcements in the
Internet. The Internet contains today more than 25000
IPv4 autonomous Systems (AS) and the BGP Routing
Information Base (RIB) contains approximatively 300000
entries [18]. In addition, at least 60% of stub domains are
multihomed [2]. The growth of the BGP table impacts
the packet forwarding speed and requires a large memory
space.
2) Compatibility with IP routing: A multihoming so-
lution should be compatible with the IP routing [19]. The
locators used by such protocol should comply with the
Internet topology. The Rekhter Law [20] stipulates that:
Addressing can follow topology or topology can follow
addressing; choose one.. Multihoming protocols should
follow this law in order to prevent routes disaggregation
and allow the routing system to scale. Thus, as end host
routes can not be propagated in the whole Internet, the
used locator should be topologically correct [21].
3) Compatibility with existing sockets: The POSIX
standard [22] denes a generic socket API which must
be used by any protocol for interoperability purpose.
This socket API uses IP addresses for identication pur-
poses Consequently, a protocol that supports multihoming
should be compatible with the existing socket API.
4) Independence: The Independence refers to the ab-
sence of cooperation between a multihomed entity and its
upstream providers, and among the upstream providers
themselves. In order to ensure independence between
the multihomied entity and its upstream providers, a
multihoming solution should not be dependent on specic
congurations enabled on the provider side. This means
that an ISP should not provide a specic support to
an end-site because this end-site is multihomed so that
small corporate can also benet from multihoming. The
independence between providers means that ISPs are not
supposed to cooperate between each other because an end
site is multihomed with them.
III. THE SHIM6 PROTOCOL
We present in this section an overview of the IPv6 mul-
tihoming approaches and then we focuss on a host-centric
multihoming protocol, namely the SHIM6 protocol.
We distinguish three categories of multihoming proto-
cols: the routing approach, the edge-centric approach and
the host-centric approach. The routing approach is based
on path diversity inferring and re-routing algorithms in
order to provide multihoming functionalities. The edge-
centric approach enables the multihoming support at the
edge of an IPv6 network. Host-centric approach enables
the multihoming support at the end hosts.
A. Multihoming host-centric approaches
In such approaches, end hosts are more aware of their
networks and available connections and this requires extra
complexity in network stacks. There are mainly two ways
for enabling the multihoming in a network stack. The rst
one consists of modifying an existing layer; whereas the
second way consists of adding a new thin sub-layer to
handle the multihoming.
Most of the host-centric solutions which are based on
the rst way of enabling multihoming support modify
the transport layer since multihoming aims at providing
transport layer survivability. The most used transport
protocols (TCP and UDP) use IP addresses to identify
communication. If a failure occurs in the used address,
the transport session is automatically broken. A possible
solution to this problem is to use multiple addresses
per end-host in the transport layer to switch from one
address to another in case of a failure. Several proposals
in the literature enabled multihoming in the transport layer
such as Multihomed TCP, TCP-MH, SCTP and DCCP.
Multihomed TCP [23] and TCP-MH [24] modify the
TCP protocol while SCTP and DCCP are new transport
protocols. Multihomed TCP uses a context identier
instead of IP addresses and ports to identify a connec-
tion. TCP-MH modies the SYN segment to contain all
the available addresses and implements primitives (MH-
Add,MH-Delete) to modify the address currently in use.
The SCTP [25] provides a native multihoming support
by associating one session with multiple IP addresses
and thus with multiple paths. One path is considered
as primary and the others are backups. The Datagram
Congestion Control Protocol (DCCP) did not originally
support multihoming. In fact, multihoming is added as an
extension [26] that provides an extension adds primitive
to transfer the established connection from one address to
another.
Host-centric solutions based on the second way of
enabling multihoming at the end-host add a new shim
layer in the network stack which decouples a nodes
identication from its localization. In fact, TCP/IP has
been formerly designed to allocate a single IP address
per device. The role of the IP address was two-fold: First
locate the end-hosts in the network, second identify end-
hosts running sessions [12], [13]. Multihomed end-hosts
have several addresses and thus they require a more ex-
ible interaction with their address sets. From a sessions
point of view, identity needs to be independent from their
physical administrative domain [14]. Therefore, a solution
that manages multiple IP addresses per node and thus
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1523
2011 ACADEMY PUBLISHER
handles multihoming, consists of adding a shim layer
which decouples locators from identiers: the application
handles identiers while IP routing layer handles locators.
We can decouple node identity from its localization by
creating new namespaces either for node identication or
node localization. We can also achieve this by choosing
one address as a permanent node identier and consider
the remaining addresses as potential locators. In this case,
we should perform an IP address rewriting in order to
translate the node identier into a locator and vice versa.
B. SHIM6
The SHIM6 protocol is an IPv6 multihoming protocol
[27], [28]. It introduces a new shim sublayer within the
IP layer.
In order to preserve session survivability, SHIM6 uses
one of the available addresses as a permanent identier.
This address -called upper layer identier (ULID)- is a
location independent identier. The remaining addresses
are considered as locators. The shim layer performs an
address rewriting from ULID to locator and vice versa.
Each SHIM6 node stores the information related to its
locators, ULID and its correspondent peer addresses in a
structure called the SHIM6 context. The SHIM6 context
is established after a four-way handshake and can be mod-
ied while having an ongoing communication through
specic SHIM6 update messages (see Fig. 1).
Fig. 1. SHIM6 context establishment
C. Establishing a SHIM6 context
SHIM6 maintains a context per ULID pair which holds
information about the established session between these
two ULID. A SHIM6 context is identied by two context
tags which are 47-bit numbers randomly allocated by each
one of the communicating peer. The tag is included in
each SHIM6 control message in order to prevent possible
attacker to spoof SHIM6 control messages. In order to
discover SHIM6 context tags, attackers need to be along
the path to sniff the context tag.
The SHIM6 context is established after a four-way
handshake control messages I1,R1,I2,R2. We detail in the
following each of these messages.
1) I1 Message: The I1 message is the rst SHIM6
control message in the context establishment handshake.
When one of the communicating peer decides to set up
a SHIM6 context, it starts by creating a context state. It
allocates a tag to the SHIM6 context, sends an I1 message
to the other node and sets the state of its context to
I1_SENT. The I1 message embeds the context tag allo-
cated by the initiator and a nonce. The nonce is a 32-bit
randomly generated number by the initiator which must
be included in the R1 message (the response message to
I1) to be used to identify the response. The initiator starts
a timer (I1_TIMEOUT) upon sending the I1 message, and
if it does not receive an R1 message or an I2 message
(in case of a simultaneous context establishment) after
the expiration of the timer, it retransmits the I1 message.
The retransmission of the I1 message is controlled by an
exponential back-off timer. The maximum number of al-
lowed retransmissions is MAX_I1_RETRANSMISSION
upon which it is inferred thateither the corespondent
peer may have not implemented SHIM6 or a rewall is
blocking the I1 message. If the initiator peer receives an
ICMP error "Unrecognized Next Header" in response to
its I1 message, it is a more reliable indication that the
correspondent peer does not implement SHIM6.
2) R1 Message: R1 message is a response message
to I1. When a host needs to send an R1 message (the
Peer in Fig. 1), it copies the initiator nonce eld from the
I1 message into the R1 message, generates a responder
nonce and a hash of the information contained in the I1
message (context tag, ULID pair, initiator nonce, and a
secret S maintained by the peer) called responder val-
idator. Both responder validator and the responder nonce
are used by the correspondent peer in order to verify that
an I2 message is sent in response to its R1 message. At
this stage, the correspondent peer does not allocate any
SHIM6 state, it stays in the idle state.
3) I2 Message: When a host receives an R1 message
(the Initiator in Fig. 1), it rst checks whether it has
allocated a context corresponding to the nonce included in
the R1 message. If no context is found, the host discards
the R1 message, otherwise it sends an I2 message. In
this latter case, the host copies the Responder Validator
eld, and the responder nonce from the R1 message and
includes them in the I2 message in addition to its locator
list. Finally, the hosts starts an I2_TIMEOUT timer and
sends the I2 message changing its state from I1_SENT
to I2_SENT. If the host does not receive an R2 message
in response to the I2 message before I2_TIMEOUT, it
may retransmit the I2 message according to a procedure
similar to the one used with I1 messages.
4) R2 Message: Upon receiving an I2 message, the
host extracts the Responder Validator value and veries
whether this value correspond to the value that it would
have computed. If this verication fails, it discards the I2
message, otherwise the host extracts the locator list from
the I2 message. It creates then a context and generates a
context tag. It nally sends an R2 message and changes its
state from IDLE to ESTABLISHED. In the R2 message,
1524 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
the host includes its context tag, its responder nonce and
its locator list. When the peer receives the R2 message, it
veries whether there is context that matches the nonce
included in the R2 message. It extracts then the locator
list, the context tag and records these information in its
context. Finally it changes its state from I2_SENT to
ESTABLISHED.
D. Updating a SHIM6 Context
The set of the available locators of a node supporting
SHIM6 may change in time. A locator may become un-
available after an outage or after a renumbering operation
performed by the corresponding upstream provider. This
node can also acquire new locator(s) when a new router
boots on the links it is attached to. As this node shares
with its correspondent peers its list of locators (recorded
in their SHIM6 contexts), it should inform them about any
change that may affect its list of locators. For this purpose,
SHIM6 uses a control message, called Update Request
(UR), which is used by SHIM6 nodes to inform their
correspondent peers about any change that affects their
locator set. The UR message should be acknowledged
by an Update Acknowledgment UA message. If after
ending an UR message the node does not receive any
UA message before UPDATE_TIMEOUT time, then it
retransmits the UR message. The retransmission of the
UR is controlled by a back-off timer and the maximum
number of retransmission is MAX_UPDATE_TIMEOUT.
After reaching this limit, the node discards its SHIM6
context. The UR message includes a request nonce, the
destination context tag and the node new locator(s) list.
The UA message includes the destination context tag and
the request nonce copied from the UR message.
E. SHIM6 context recovery
When a node receives a payload message containing a
SHIM6 extension header or a SHIM6 control message
but it has no SHIM6 context already established with
the sender, it assumes that it has discarded this context
while the sender has not. In such situation, the receiver
starts a SHIM6 context recovery procedure. It replies
with an R1bis SHIM6 message in order to fast-reestablish
the lost SHIM6 context.The R1bis message includes the
context tag copied from the packet which has triggered the
sending of the R1bis, a responder nonce and a responder
validator. The responder validator is a hash of the context
tag, the pair of the locator, the responder nonce and
a secret maintained by the sender of the R1bis. The
responder validator together with the responder nonce
are used to identify the I2bis message received as a
response to the R1bis message. If a node receives an
R1bis message, it rst extracts the context tag and the
source and destination addresses of the message. In order
to conclude that the sender of the R1bis message lost
its SHIM6 context, the node must verify two conditions.
The rst condition is that the context tag included in the
R1bis message is bound to local SHIM6 Context in the
ESTABLISHED state. The second condition is that the
source and destination addresses of the R1bis message
match respectively the local preferred locator and the peer
preferred locator of this context. If the two conditions are
fullled, the receiver of the R1bis message, replies with an
I2bis message. It includes in this message the responder
validator and the responder nonce copied from the R1bis
message and an initiator nonce in addition to its locator
list. Finally it changes the state of its SHIM6 context from
ESTABLISHED to I2BIS_SENT. Upon receiving an I2bis
message, the host veries that the responder validator
is the equal to a responder validator that it would has
computed. Then it allocates a SHIM6 context, changes
its state to ESTABLISHED and sends an R2 message.
As the multihomed node has several addresses, SHIM6
uses a combination of Hash Based Addresses (HBAs)
[29] and Cryptographically Generated Addresses (CGAs)
[30] to bind a set of addresses with a multihomed node
and to verify whether a claimed address belongs to a
node [16], [31]. SHIM6 uses the REAchability Protocol
(REAP) in order to detect possible failures and recover
from them [32], [33]. REAP allows SHIM6 to detect
failures either through the absence of keepalives sent by
the corresponding peer or through information provided
by the upper layer protocol. The recovery mechanism is
based on the exploration of the available addresse set.
The goal of this exploration process is to nd a working
address pair.
IV. SHIM6 IN A MOBILE ENVIRONMENT
In this section, we study the behavior of the SHIM6
protocol in a mobile environment. As the SHIM6 context
is a key feature of the SHIM6 protocol, we study the
mobility impact before, during and after the context
establishment. In the following we assume that a node
called Initiator initiates a SHIM6 context with a Peer
in the Internet. We assume that any of them can move
at anytime. In order to study the impact of movement
on the context establishment, we divide the context es-
tablishment handshake into three phases: the rst phase
lasts from the sending of the rst I1 message until the
reception of an R1 message. The second phase lasts from
the reception of an R1 message until the sending of the
rst I2. The third phase lasts from the sending of the rst
I2 message until the reception of an R2 message (see
Fig. 1).
A. Mobility before the context establishment
If an Initiator executes an L3 handover before estab-
lishing a SHIM6 context with its Peer, its ongoing session
will be broken. Before establishing a SHIM6 context, the
Peer does not know the possible locators of the Initiator.
Therefore, if the currently used locator of the Initiator
becomes unreachable, the whole session is broken. Con-
sequently, the establishment of a SHIM6 context is a sine
qua non condition for session survivability as it holds
information necessary for rehoming.
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1525
2011 ACADEMY PUBLISHER
B. Mobility during the context establishment
1) Preliminary study: In the following, we aim to show
that node mobility impacts the context establishment and
leads to the loss of SHIM6 control messages and their
retransmission. For this purpose, we set up the testbed
presented by Fig. 2.
Fig. 2. Testbed 1: impact of nodes mobility on the SHIM6 context
establishment latency
Our testbed involves an Initiator that moves between
two access points (APs): AP1 and AP2. The APs are
connected to the NetEM Node which uses a special
feature of the Linux kernel named the network emulator
(NetEM) module. This module is used to emulate a large
network where we can vary the end-to-end delay and
congure the packet loss rate. We congure a one way
delay equal to 50 ms +/- 5 ms. The NetEM node is also
connected to the SHIM6 nodes. Two other nodes are used
to monitor and capture trafc in the experiment. In this
experiment, we assume that the Initiator executes an L2
handover. The L2 handover is randomly triggered while
the SHIM6 context is being established. In our testbed,
the Initiator implements the SHIM6 protocol developed
by the UCL University [34].
Table I shows the retransmission frequency of the
SHIM6 control messages I1 and I2. We note that in all
the cases, either I1 or I2 is retransmitted. A retransmission
of a SHIM6 control message indicates that this message
was lost during node movement. We can empirically see
that node movement, while a SHIM6 context is being
established, leads to the loss of a SHIM6 control message
and a retransmission either of the I1 or the I2 message.
2) Theoretical study: Node mobility while a SHIM6
context is being established may defer the set up of
the context or makes it impossible. The consequences of
mobility during the establishment of the SHIM6 context
depend on which entity is moving (Initiator, Peer), the
handover type (L2, L3) and the phase in which the
movement occurs (I1-R1, R1-I2, I2-R2). We use the
following notation to capture these parameters: (moving
entity, handover type, context establishment phase). In the
following, the symbol (*) denotes any possible eventual-
I1 I2
Retransmission frequency 51.2% 48.8%
TABLE I
RETRANSMISSION FREQUENCY OF I1 AND I2 MESSAGES
ity. For example, an Initiator executing an L3 handover
after sending an I2 message is noted (Initiator, L3, I2-R2).
Fig. 3 describes all the possible cases of the mobility
during the context establishment. In the following, we
assume that all the retransmission timeouts last 4s as
suggested by the SHIM6 specications [28].
We rst study the consequences of an L2 handover
during the context establishment then we focus on the
L3 handover case.
(*, L2, I1-R1): if the Initiator or the Peer changes
AP during the I1-R1 phase, the SHIM6 protocol
retransmits the I1 message after a timeout as it has
not received an R1 message in response to its I1
(case 1 and 7).
(*, L2, I2-R2): if the Initiator or the Peer changes
AP during the I2-R2 phase, the SHIM6 protocol
retransmits the I2 message after a timeout as it has
not received an R2 message in response to its I2
(case 3 and 9).
(Peer, L2, R1-I2): if the Peer moves during the R1-I2
phase, it may not receive the I2 sent by the Initiator.
Thus, the Initiator retransmits the I2 message after a
timeout (case 8).
(Initiator, L2, R1-I2): if the Initiator moves in the
R1-I2 phase, it will send the I2 message as soon as
it nishes the L2 handover and therefore, there will
be no retransmissions (case 2).
As a conclusion, if an L2 handover occurs during the
context establishment, the context will be delayed, but
always established.
In the following, we examine the L3 handover conse-
quences on the context establishment.
(Initiator, L3, I1-R1): if the Initiator moves during
the I1-R1 phase and acquires a new address, it will
not receive the R1 message - as it was sent to its
old address. The Initiator will conclude that its I1
message was lost and therefore, it will send it again
from the new address after a timeout and the context
will be established (case 4).
(Initiator, L3, R1-I2): The Initiator moves after re-
ceiving the R1 message and before sending I2, it will
send the I2 message with the newly acquired address.
Before the Initiator movement, the Peer sends an R1
message to the Initiator with a responder validator
eld calculated using the ULID address pair present
in the I1 message. After executing an L3 handover,
the Initiator copies this responder validator from the
R1 message in its I2 message. Upon receiving the I2
message, the Peer nds that the responder validator
does not match the one that it would have computed
with the previous address. Thus, the Peer silently
1526 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
Fig. 3. Possible cases of mobility during the SHIM6 context establishment
discards the I2 even if it retransmitted and the context
is not established (case 5).
(Initiator, L3, I2-R2): if the Initiator moves after
sending the I2 message and before receiving R2, it
will not receive the R2 message. Before executing
an L3 handover, the Initiator sends an I2 message
to the Peer containing its old address. The Peer
sends the R2 message to the Initiator s old address
and sets its SHIM6 context state to established. The
Initiator executes an L3 handover and acquires a
new address. Meanwhile, it does not receive R2.
After a timeout, it sends the I2 message again with
the newly acquired address. Upon reception of the
I2 message, the Peer veries that it has already
an established context with the Initiator having the
same context tag and overlapping locator sets. Thus,
the Peer concludes that the Initiator has lost the
original context (which is wrong). It discards the old
context and sends an R2 message again. Finally the
context is successfully established (case 6). This a
typical context confusion situation predicted by the
SHIM6 protocol specications. The peer detecting
such a situation must not keep two contexts in an
established state having the same context tag and
having overlapping locator sets.
(Peer, L3,*): if the Peer executes an L3 handover
during a context establishment, the SHIM6 context
will never be established (case 10, 11, 12). In fact,
if the Peer moves during the I1-R1 phase, it will
not receive the I1 message as the Initiator does not
know its new address. Similarly, if the Peer moves
during the R1-I2 phase or the I2-R2 phase, it will
not receive the I2 message.
As a conclusion, if the Peer executes an L3 handover
during the context establishment, the context cannot be
established. Moreover, if the Initiator executes an L3
handover during the R1-I2 phase the context is not
established. In all the other cases, the SHIM6 context can
be established with additional delays. In the following, we
aim to reduce the SHIM6 establishment delays by using
movement detection triggers.
3) Movement detection optimization: The study in the
previous section shows that in the majority of the cases,
mobility during context establishment leads to retransmit-
ting SHIM6 control messages (cases 1, 3, 4, 6, 7, 8 and
9 of Fig. 3).
The retransmission mechanism in SHIM6 is controlled
by a backoff timer having an initial value of 4s. As the
SHIM6 messages are lost during the execution of the L2
handover, we propose to improve the SHIM6 retransmis-
sion timer by coupling SHIM6 with a movement detection
trigger. We use link-layer hints in order to retransmit lost
messages quickly. When a new link-layer association is
established, we stop the retransmission timer and send
again the last SHIM6 control message. If the retransmitted
SHIM6 control message was really lost then our proposal
signicantly reduces the context establishment latency.
Otherwise, if the retransmitted SHIM6 message was not
lost, then we are in a case of a received duplicated SHIM6
message. In such a case, the receiver will silently discard
it as it has an old nonce.
The movement detection optimization will undoubtedly
improve the context establishment time in a mobile envi-
ronment. In the next section, we quantify this improve-
ment through experiments.
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1527
2011 ACADEMY PUBLISHER
C. Evaluation
In this section, we quantify analytically and through
measurements on a real testbed the impact of node mobil-
ity on the SHIM6 context latency. Our evaluation covers
both the L2 handover execution case and a L3 handover
execution case during the SHIM6 context establishment.
We aim to prove that our proposed movement detection
optimization, presented in section IV-B3, signicantly re-
duces the SHIM6 context latency during node movement.
We use in our evaluation the same testbed presented
in Fig. 2. We integrated to the SHIM6 implementation
developed by the UCL University [34] a movement detec-
tion trigger which helps us to quickly retransmit SHIM6
control messages when needed.
1) L2 handover case: As explained in section IV-B,
if the Initiator executes an L2 handover during the I1-R1
(case 1 in Fig. 3, (Initiator, L2 ,I1-R1)), it will retransmit
the I1 message; whereas, if the Initiator executes an L2
handover during the I2-R2 (case 3 in Fig. 3, (Initiator, L2,
I2-R2)), it will retransmit the I2 message. If the movement
occurred during the R1-I2 phase, no retransmission is
needed (case 2 in Fig. 3, (Initiator, L2, I2-R2)). Hence,
in the following, we focus on case 1 and case 3.
We dene the context establishment latency as the
elapsed time from the sending of the rst I1 message to
the reception of the R2 message. Let
ce
be the context
establishment latency.
In a rst case, we assume that the Initiator waits for a
timeout set to 4s before retransmitting its pending SHIM6
control message. Therefore, after a timeout expiration and
the exchange of 4 messages,
ce
will be equal to :
ce
= T
Timeout
+4 T
OneWayDelay
(1)
In this rst case (where we use a timeout),
ce
is
theoretically equal to 4.2s (see Eq. 1), where T
Timeout
is equal to 4s and T
OneWayDelay
is equal to 50ms. In
Fig. 4, we plot
ce
for the case presented by Eq. 1.
In our experiment, if the L2 handover occurred during
the I1-R1,
ce
is equal to 4,209s; whereas if the L2
handover occurred during I2-R2
ce
is equal to 4,189s.
We observed by experimentation approximately the same
result as we found by theory.
In a second case, we assume that the Initiator imple-
ments a movement detection trigger to quickly retransmit
the pending SHIM6 message. When the Initiator detects
that a new link-layer association is established, it stops its
retransmission timer and sends again the last SHIM6 con-
trol message. Thus, if we take into account the movement
detection optimization,
ce
will be equal to :
ce
= T
StartL2handover
+T
L2handover
+4 T
OneWayDelay
(2)
The term T
StartL2handover
refers to the time between
the sending of a SHIM6 message and the execution of
the L2 handover.
In this second case (where we use a movement de-
tection trigger),
ce
is theoretically between 0.5s and
0.6s (see Eq. 2). T
StartL2handover
is between 0 and
2*T
OneWayDelay
as the L2 handover is executed either just
after sending a SHIM6 message or just before receiving
a response. The L2 handover latency is evaluated at 0.3s
and T
OneWayDelay
is equal to 50ms .
In Fig. 4, we plot
ce
for the case presented by Eq. 2.
We observed by experimentation approximately the same
result as we found by theory (0.583s for an L2 handover
during the I1-R1 phase and 0.565s for an L2 handover
during the I2-R2 phase).
Fig. 4. Context establishment latency in case of Initiator movement
2) L3 handover case: When the mobile node executes
an L3 handover during the context establishment, the
context is successfully established only in cases 4 and
6 (see Fig. 3). The only difference between case 4 and 6
is the retransmitted SHIM6 message (I1 or I2). Thus, in
both cases we obtain the same SHIM6 context latency.
Assuming we use our movement detection optimiza-
tion,
ce
is equal to:
ce
= T
StartL2handover
+T
L2handover
+
T
Discovery
+T
DAD
+4 T
OneWayDelay
(3)
T
DAD
refers to the time of execution of the Duplicated
Address Detection algorithm which veries the unique-
ness of the new address [35]. In [36], we evaluate T
DAD
at 1s. T
Discovery
refers to the required time to discover a
new network, this value is correlated to the delay between
two successive Router Advertisement (RA) messages .
We consider in the following that the RA are ran-
domly sent between 30ms and 70ms. T
StartL2handover
+
T
L2handover
is equal to 0.583s , T
Discovery
is equal to
0.16s and T
DAD
is equal to 1s. Therefore,
ce
is equal
to 1.743s.
3) Conclusion: We note that without a movement
detection mechanism, moving while a SHIM6 context
is being established results into a consequent delay of
the context establishment (4.2s in the case of the L2
handover). While the context is being established, on-
going communications are not protected against possible
failures. Thus, it is important to reduce
ce
when one of
the communicating SHIM6 nodes moves.
1528 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
D. Mobility after context establishment
Fig. 5. Neighbor unreachability protocol state machine
We study in this section the mobility consequences on
an already established SHIM6 context. We only focus on
L3 handover as it affects the set of available locators and
thus it might affect the established SHIM6 context.
The L3 handover can be divided into three steps: the
rst step is the discovery of a new network; the second
step concerns the conguration of a new address and the
verication of its uniqueness and nally the third step
deals with the verication of the reachability of the old
address and the update of the neighbor cache entry [37].
In the following we clarify the protocols involved in each
step of the L3 handover execution.
A mobile node discovers a new network through the
reception of an RA message containing a new IPv6
prex. After the conguration of a new IPv6 address,
the mobile node starts the Duplicated Address Detection
(DAD) protocol by sending Neighbor Solicitation (NS)
messages [35].
After acquiring a new address, the mobile node should
rst check whether its old access router (AR) is still
reachable. If the old AR is still reachable, the mobile
node can keep using it as a default AR, otherwise it
must select a new AR. In IPv6, nodes use the Neighbor
Unreachability Detection algorithm (NUD) to conrm the
unreachability of their old AR after conguring a new
address [35]. The node maintains a list of its neighbors
in a neighbor cache with an associated state (see Fig. 5).
If the entry associated to a neighbor reaches the PROBE
state, an active reachability conrmation is launched.
The node sends three NS messages to conrm or inrm
neighbor reachability. If the neighbor is reachable, it
responds with Neighbor Advertisement (NA) message to
the received NS message. NS and NA are dened in the
Neighbor Discovery Protocol [35].
After performing NUD, the mobile node should inform
its correspondent node that its current address/location
has changed and the trafc needs to be re-routed to its
new location. In SHIM6, the mobile node sends to the
correspondent node an Update Request message. The cor-
respondent node replies with an Update Acknowledgment
if the source address of the Update Request message
belongs to its context. If a correspondent node receives
an Update Request message from an unknown address
but having a context tag bound to an already established
context, it concludes that it has a stale context and re-
initiates it by sending an R1Bis message. In this case,
the mobile node responds with an I2Bis message. Update
Request, Update Acknowledgment, R1Bis and I2Bis are
SHIM6 control messages dened in [28].
As after executing an L3 handover, the mobile node
preferred address may become unreachable, the mobile
node must then rehome its communication to another
working address pair. The default rehoming decision
strategy based on the unreachability detection mechanism
of the NUD protocol does not meet the requirement of the
mobile environment, as the unreachability of the current
AR is detected at a later time. In fact, The DAD phase
lasts 1s, and the NUD probing phase lasts 3s. Hence, the
whole rehoming procedure lasts at least 4s.
E. Optimizations
The SHIM6 rehoming latency is an obstacle to using
SHIM6 with an application that has real time require-
ments in a mobile environment because data packets are
lost this time.
Either during the L2 handover, the network discovery
and the IPv6 address conguration, or during the ex-
change of the Update Request/Update Acknowledgment,
data packets are lost. Indeed, before the complete update
of the Peer SHIM6 context, data packets cannot reach the
Initiator new location.
Data packets lost during the update of the SHIM6
context may have an impact on the operation of TCP
which may degrade the quality of the application per-
ceived by the user. Packet losses are interpreted by TCP
as a congestion indication leading TCP to reduce its
congestion window and retransmit the lost segments [33].
We propose in the following some optimizations in
order to improve the rehoming latency in a mobile en-
vironment.
1) Fast Router Discovery: Upon the execution of an
L2 handover, the Initiator waits for the reception of an RA
message which is periodically sent by the AR. The time
between the end of the L2 handover and the reception of
the new prex corresponds to the discovery phase. We
can improve the latency of this phase by using an L2
trigger to rapidly discover new routers [38]. As soon as
the Initiator gets attached to a new AR, it immediately
sends a Router Solicitation message (RS). The reception
of the RS message by the AR triggers the sending of the
RA.
2) Optimized DAD: In IPv6, the address conguration
mechanism is controlled by an algorithm called Duplicate
Address Detection and lasts 1s [35]. While this latency is
tolerable for a node which has just booted and congured
its addresses, it does not meet the requirements of real-
time applications embedded on an Initiator executing an
L3 handover. Before conguring its own address, the node
veries its uniqueness by sending NS message. As the
probability that two nodes congure the same address is
low, the Optimized DAD procedure suggests immediately
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1529
2011 ACADEMY PUBLISHER
using the newly congured address and concurrently
veries its uniqueness [39].
3) Fast NUD: As explained in section IV-D, the NUD
protocol is controlled by a state machine. The verication
of the reachability of an address is executed if its entry
in the neighbor cache is in the PROBE state. In order to
improve the NUD execution, we add an L2 trigger to indi-
cate whether an Initiator has just nished an L2 handover.
After attaching a new AR, the Initiator changes the state
associated with its old AR to PROBE. Moreover, in the
original specication of NUD, the probing is performed
through the sending of a NS message each second. After
three unsuccessful trials, the entry is deleted. Hence, the
probing phase last 3s when the neighbor is unreachable.
We propose to reduce the time between two successive NS
to 0.2s, which is a more realistic period for the mobile
environment.
F. Evaluation
In this section, we empirically evaluate the conse-
quences of the SHIM6 node mobility on their established
SHIM6 context. We estimate the required time to update
a SHIM6 context and to rehome a session in a mobile
environment. The measurements are conducted on a real
testbed presented in Fig. 6.
Our testbed involves three ARs. We use the same access
technologies in the testbed (802.11). Nonetheless, this
does not prevent to have distinct access technologies, for
example one is provided by AR3, the other is provided by
both AR1 and AR2. Our testbed also involves an Initiator
having two wireless interfaces Eth1 and Eth2. Eth1 is
always connected to AR3 while Eth2 changes its point of
attachment from AR1 to AR2.
The mobile SHIM6 nodes involved in our testbed
embed a modied Linux IPv6 network stack. In order
to update the SHIM6 context, we add to the SHIM6
implementation developed by the UCL University [34]
the following control messages and their interactions with
the context: Update Request, Update Acknowledgment,
R1Bis and I2Bis. We modify the Linux kernel IPv6 stack
(2.6.17.11 version) with a new implementation of the
NUD protocol, we add the Optimistic DAD support to
the address conguration, and nally we implement the
link-layer trigger to ensure a fast network discovery and
a fast retransmission of the pending messages.
1) Metric: We measure in this scenario the context
update latency. Let
cu
be the context update latency. In
our scenario,
cu
is equal to:
cu
= T
L2handover
+T
DiscoveryStart
+
T
Discovery
+T
URUA
(4)
The T
DiscoveryStart
latency refers to the time between
the end of the L2 handover execution and the sending of
the RS message. The T
Discovery
latency corresponds to the
exchange of RS and RA messages. Finally the T
URUA
latency corresponds to the update of the SHIM6 context
of the Peer with the newly acquired address.
Fig. 6. Testbed 2: Moving after the SHIM6 context establishment
Fig. 7. SHIM6 context update latency vs. RA delay
Fig. 7 presents an evaluation of
cu
for three RA delay
intervals. We observe that while the RA delay increases,
cu
increases (
cu
is respectively equal to 0.7s, 0.74s
and 1.16s). The increase of the
cu
is due the increase
of the latency of the discovery phase (T
Discovery
).
Upon receiving an RS message, a router computes a
random time between 0 and MAX_RA_DELAY_TIME
to generate an RA in response to the solicita-
tion. If this computed value corresponds to a time
later than the next scheduled (multicast) RA time,
then the RA is sent at its scheduled time (and
the random value is ignored). If an RA was sent
within the last MIN_DELAY_BETWEEN_RAS, then
the next RA should be sent at a time equal
MIN_DELAY_BETWEEN_RAS plus the random value
between 0 and MAX_RA_DELAY_TIME, in order
to rate limit the RAs. In Neighbor Discovery for
IPv6 [35], MAX_RA_DELAY_TIME is set to 0.5s and
MIN_DELAY_BETWEEN_RAS is set to 3s; whereas,
in Mobility Support in IPv6 [40], if the lower limit of
the RAs interval (MinRtrAdvInterval) is less than 3s,
MIN_DELAY_BETWEEN_RAS is set to this limit and
1530 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
its minimum value is 30ms.
Therefore, whenever we increase MinRtrAdvInterval,
the time to send the scheduled RA in response to the RS
increases and thus, T
Discovery
increases.
We obtain almost the same value of
cu
in the two
rst experiments because the decrease of the T
L2handover
almost compensates the increase of the T
Discovery
. How-
ever, for the third result, T
L2handover
does not vary
while T
Discovery
decreases. The L2 handover latency
(T
L2handover
) increases when we send RAs at the highest
possible rate because of the saturation of the wireless
medium induced by the sending of several RA messages.
When RAs are sent at a high rate, the wireless medium is
saturated and therefore, the L2 handover execution lasts
longer (400ms instead of 300ms). Finally, we notice that
T
DiscoveryStart
latency does not appear in Fig. 7 as it lasts
between 2 and 3ms.
This experiment shows that in a mobile environment the
context update latency amounts to 0.7s in the best case.
The currently obtained time is an obstacle to use SHIM6
with applications that have real time requirements.
The major contributors to the context update latency are
the L2 handover and the network discovery latency. If we
consider the case where RAs are sent between 30ms and
70ms, the L2 handover latency represents 57%, and the
network discovery represents 28% of the whole context
update latency.
Several works (such as [41]) proposed to optimize the
L2 handover which can signicantly reduce this latency.
Depending on the wireless card vendor, the reduction
can reach 90% of the overall L2 handover latency (from
300ms to approximately 20ms).
V. CONCLUSION
We have studied in this paper the impact of node
mobility on the SHIM6 multihoming protocol which
was originally intended to handle multihoming for static
nodes. we focused on the mobility impact on the SHIM6
context establishment as it constitutes its key compo-
nent. We showed that performing either an L2 or an
L3 handover while establishing the SHIM6 context may
delay the context establishment. We exacerbated the cases
where performing an L3 handover may lead to a context
establishment failure. We evaluated the mobility impact
during the context establishment on developed testbeds
and measured the context establishment latency.
Furthermore, we investigated mobility impacts on an
already established SHIM6 context. We showed that the
default rehoming decision strategy of the SHIM6 protocol
does not meet the mobility requirement. Consequently, we
proposed some viable optimizations to enhance the default
rehoming decision strategy. The rehoming latency of this
enhanced rehoming decision strategy is then evaluated on
a developed testbed.
We showed through extensive experiments that SHIM6
can indeed manage mobility by itself. However, the
measured rehoming latency stays an obstacle to using
SHIM6 for real-time applications. In the quest to reduce
the rehoming latency, we are investigating the possibil-
ity of executing the NUD in parallel to the ongoing
communication in order to proactively detect possible
unreachability of the current address. Furthermore, we
are currently studying the use of the link identication
in the RA message (proposed by the DNA IETF working
group) in order to enhance the detection of node mobility.
Moreover, as SHIM6 does not take into account a registrar
entity in its architecture, it cannot solve the double
jump problem (i.e., the two communicating SHIM6 nodes
simultaneously move). We are also targeting to propose
ways to solve the double-jump problem.
REFERENCES
[1] J. Day, Patterns in Network Architecture: A Return to Fundamen-
tals. Prentice Hall, 2007.
[2] S. Agarwal, C.-N. Chuah, and R. H. Katz, OPCA: robust inter-
domain policy routing and trafc control, in Open Architectures
and Network Programming, 2003 IEEE Conference on, Apr. 2003,
pp. 5564.
[3] C. Launois and M. Bagnulo, The paths towards IPv6 multihom-
ing, in IEEE Communications Surveys and Tutorials, vol. 8, 2006,
pp. 3851.
[4] M. Bagnulo, A. G. Martinez, A. Azcorra, and C. de Launois, An
incremental approach to ipv6 multihoming, Computer Communi-
cations, vol. 29, no. 5, pp. 582592, 2006, networks of Excellence.
[5] J. F. Shoch, Inter-network naming, addressing, and routing.
Washington, D.C.: IEEE, Sep. 1978, pp. 7279.
[6] D. Le, X. Fu, and D. Hogrefe, A review of mobility support
paradigms for the internet, IEEE Communications Surveys &
Tutorials, vol. 8, no. 1, pp. 3851, / 2006.
[7] P. Savola and T. Chown, A survey of IPv6 site multihoming pro-
posals, in Telecommunications, 2005. ConTEL 2005. Proceedings
of the 8th International Conference on, vol. 1, Jun. 2005, pp. 41
48.
[8] J. Abley, B. Black, and V. Gill, Goals for IPv6 Site-
Multihoming Architectures, RFC 3582 (Informational), Internet
Engineering Task Force, Aug. 2003. [Online]. Available:
http://www.ietf.org/rfc/rfc3582.txt
[9] A. Akella, B. Maggs, S. Seshan, A. Shaikh, and R. Sitaraman, A
measurement-based analysis of multihoming, in SIGCOMM 03:
Proceedings of the 2003 conference on Applications, technologies,
architectures, and protocols for computer communications. New
York, NY, USA: ACM, 2003, pp. 353364.
[10] C. de Launois, B. Quoitin, and O. Bonaventure, Leveraging net-
work performance with IPv6 multihoming and multiple provider-
dependent aggregatable prexes, Comput. Netw., vol. 50, no. 8,
pp. 11451157, 2006.
[11] D. K. Goldenberg, L. Qiuy, H. Xie, Y. R. Yang, and Y. Zhang,
Optimizing cost and performance for multihoming, in SIG-
COMM 04: Proceedings of the 2004 conference on Applications,
technologies, architectures, and protocols for computer communi-
cations. New York, NY, USA: ACM, 2004, pp. 7992.
[12] J. Saltzer, On the Naming and Binding of Network Destinations,
RFC 1498 (Informational), Internet Engineering Task Force, Aug.
1993. [Online]. Available: http://www.ietf.org/rfc/rfc1498.txt
[13] G. Huston, Multi-homing and identity in ipv6, Internet Society
Publications, June 2004.
[14] S. Herborn, R. Boreli, and A. Seneviratne, Identity location
decoupling in pervasive computing networks, in Advanced In-
formation Networking and Applications, 2005. AINA 2005. 19th
International Conference on, vol. 2, Mar. 2005, pp. 610615.
[15] C. Labovitz, G. R. Malan, and F. Jahanian, Internet routing
instability, IEEE/ACM Transactions on Networking, vol. 6, no. 5,
pp. 515528, Oct. 1998.
[16] M. Bagnulo, A. Garcia-Martinez, and A. Azcorra, Efcient Se-
curity for IPv6 Multihoming, ACM Computer Communications
Review, vol. 35, no. 2, pp. 6168, April 2005.
[17] S. Uhlig and O. Bonaventure, Designing bgp-based outbound
trafc engineering techniques for stub ases, SIGCOMM Comput.
Commun. Rev., vol. 34, no. 5, pp. 89106, 2004.
[18] G. Huston, Bgp reports, 2009, http://bgp.potaroo.net/.
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1531
2011 ACADEMY PUBLISHER
[19] T. R. Henderson, Host mobility for IP networks: a comparison,
IEEE Network, vol. 17, no. 6, pp. 1826, Nov./Dec. 2003.
[20] Y. Rekhter and T. Li, An Architecture for IP Address
Allocation with CIDR, RFC 1518 (Historic), Internet
Engineering Task Force, Sep. 1993. [Online]. Available:
http://www.ietf.org/rfc/rfc1518.txt
[21] C. Vogt, Six/one router: a scalable and backwards compatible
solution for provider-independent addressing, in MobiArch 08:
Proceedings of the 3rd international workshop on Mobility in the
evolving internet architecture. New York, NY, USA: ACM, 2008,
pp. 1318.
[22] Standard for information technology - portable operating system
interface (posix). shell and utilities, 2004. [Online]. Available:
http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=1309816
[23] C. Huitema, Multi-homed tcp, draft-huitema-multi-homed-
0 (work in progress), May 1995. [Online]. Available:
http://tools.ietf.org/html/draft-huitema-multi-homed-01
[24] A. Matsumoto, M. Kozuka, and K. Fujikawa, Tcp multi-home
options, draft-arifumi-tcp-mh-00 (work in progress), October
2003. [Online]. Available: http://tools.ietf.org/html/draft-arifumi-
tcp-mh-00
[25] R. Stewart, Stream Control Transmission Protocol, RFC 4960
(Proposed Standard), Internet Engineering Task Force, Sep. 2007.
[Online]. Available: http://www.ietf.org/rfc/rfc4960.txt
[26] E. Kohler, Datagram congestion control protocol mobility and
multihoming, draft-kohler-dccp-mobility-01 (work in progress),
January 2006. [Online]. Available: http://tools.ietf.org/html/draft-
kohler-dccp-mobility-01
[27] P. Savola, Site Multihoming: A Microscopic Analysis of Finnish
Networks, in Networking, International Conference on Systems
and International Conference on Mobile Communications and
Learning Technologies, 2006. ICN/ICONS/MCL 2006. Interna-
tional Conference on, Apr. 2006, pp. 2525.
[28] E. Nordmark and M. Bagnulo, Shim6: Level 3 Multihoming
Shim Protocol for IPv6, RFC 5533 (Proposed Standard),
Internet Engineering Task Force, 2009. [Online]. Available:
http://www.ietf.org/rfc/rfc5533.txt
[29] M. Bagnulo, Hash-Based Addresses (HBA), RFC 5535
(Proposed Standard), Internet Engineering Task Force, Jun. 2009.
[Online]. Available: http://www.ietf.org/rfc/rfc5535.txt
[30] T. Aura, Cryptographically Generated Addresses (CGA), RFC
3972 (Proposed Standard), Internet Engineering Task Force,
Mar. 2005, updated by RFCs 4581, 4982. [Online]. Available:
http://www.ietf.org/rfc/rfc3972.txt
[31] M. Bagnulo, A. Garca-Martnez, and A. Azcorra, Fault tolerant
scalable support for network portability and trafc engineering,
in WWIC 07: Proceedings of the 5th international conference
on Wired/Wireless Internet Communications. Berlin, Heidelberg:
Springer-Verlag, 2007, pp. 129140.
[32] J. Arkko and I. van Beijnum, Failure Detection and Locator
Pair Exploration Protocol for IPv6 Multihoming, RFC 5534
(Proposed Standard), Internet Engineering Task Force, Jun. 2009.
[Online]. Available: http://www.ietf.org/rfc/rfc5534.txt
[33] A. de la Oliva, M. Bagnulo, A. Garcia-Martinez, and I. Soto,
Performance Analysis of the REAchability Protocol for IPv6
Multihoming, in Next Generation Teletrafc and Wired/Wireless
Advanced Networking 7th International Conference, NEW2AN
2007, September 2007, pp. 443454.
[34] S. Barr, Linshim6 - implementation of the shim6 protocol,
Universit catholique de Louvain, Tech. Rep., Feb 2008.
[35] T. Narten, E. Nordmark, W. Simpson, and H. Soliman, Neighbor
Discovery for IP version 6 (IPv6), RFC 4861 (Draft Standard),
Internet Engineering Task Force, Sep. 2007, updated by RFC
5942. [Online]. Available: http://www.ietf.org/rfc/rfc4861.txt
[36] A. Dhraief and N. Montavont, Toward Mobility and Multihoming
Unication- The SHIM6 Protocol: A Case Study, in Wireless
Communications and Networking Conference, 2008. WCNC 2008.
IEEE, Las Vegas, Nevada/USA, Mar./Apr. 2008, pp. 28402845.
[37] N. Montavont and T. Noel, Handover management for mo-
bile nodes in IPv6 networks, IEEE Communications Magazine,
vol. 40, no. 8, pp. 3843, Aug. 2002.
[38] G. Daley, B. Pentland, and R. Nelson, Movement detection
optimizations in mobile IPv6, in Networks, 2003. ICON2003. The
11th IEEE International Conference on, Sep. 28Oct.1, 2003, pp.
687692.
[39] N. Moore, Optimistic Duplicate Address Detection (DAD)
for IPv6, RFC 4429 (Proposed Standard), Internet
Engineering Task Force, Apr. 2006. [Online]. Available:
http://www.ietf.org/rfc/rfc4429.txt
[40] D. Johnson, C. Perkins, and J. Arkko, Mobility Support in IPv6,
RFC 3775 (Proposed Standard), Internet Engineering Task Force,
Jun. 2004. [Online]. Available: http://www.ietf.org/rfc/rfc3775.txt
[41] M. Shin, A. Mishra, and W. A. Arbaugh, Improving the latency
of 802.11 hand-offs using neighbor graphs, in MobiSys 04: Pro-
ceedings of the 2nd international conference on Mobile systems,
applications, and services. New York, NY, USA: ACM, 2004,
pp. 7083.
Dr. Amine Dhraief received his Ph.D. degree
in computer science from Telecom Bretagne
University of Rennes I in 2009, his Master
of Science degree and his Engineering de-
gree both computer science from the National
School of Computer Sciences (ENSI), Uni-
versity of Manouba respectively in 2006 and
2005. He is currently an Assistant Professor
of Computer Science at ISIGK University of
Kairouan, Tunisia. He is also member of the
HANA research group at the National School
of Computer Sciences, University of Manouba, Tunisia. His current
research interests include pervasive and ubiquitous computing and
general-purpose computation on graphics processing units.
Dr. Abdelfettah Belghith received his Master
of Science and his PhD degrees in computer
science from the University of California at
Los Angeles (UCLA) respectively in 1982
and 1987. He is since 1992 a full Professor
at the National School of Computer Sciences
(ENSI), University of Manouba, Tunisia. His
research interests include computer networks,
wireless networks, multimedia Internet, mobile
computing, distributed algorithms, simulation
and performance evaluation. He runs several
projects in cooperation with other universities, research laboratories
and research institutions. He is currently the Director of the Doctoral
School (Graduate School) STICODE of the University of Manouba,
the responsible of the Network and Distributed Systems Master degree
and the head of the HANA Research Group at the National School of
Computer Sciences.
1532 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
Development of Anonymous Networks Based on
Symmetric Key Encryptions
Hazim Haddad
University of Fukui, Fukui, Japan
Email: [email protected]
Shinsuke Tamura, Shuji Taniguchi and Tatsuro Yanase
University of Fukui, Fukui, Japan
Email: {tamura, stamigut, yanase}@u-fukui.ac.jp
AbstractAnonymous networks enable entities to send
messages without disclosing their identities. Many
anonymous networks had been proposed already, such as
Mixnet, DC-net, Crowds, etc., however, they still have
serious drawbacks. Namely, they require tremendous
computation overheads to transmit messages over networks.
That is because asymmetric key encryption algorithms are
used. This paper proposes ESEBM (Enhanced Symmetric
Key Encryption based Mixnet), a new mechanism for
anonymous communication that removes drawbacks of
existing anonymous networks while exploiting symmetric
key encryption algorithms. According to experimentations,
throughput of ESEBM is about 1/4.4 of usual
non-anonymous networks, and it achieves more than 36
times higher throughput compared with Mixnet. In addition,
different from existing anonymous networks, ESEBM can
handle reply messages without any additional mechanism,
and it can protect itself from various threats, e.g. DOS
attacks and message forgeries.
I ndex Termsanonymous communication, mixnet, privacy
protection, symmetric key encryption algorithm
I. INTRODUCTION
Identities of message senders are sometimes as
sensitive as messages themselves. For example, a
company may acquire highly confidential information
about its rival companies from identities of their
customers and suppliers. Therefore, the importance of
anonymous communication is increasing as more people
are being involved in network based communication.
Anonymous networks are ones that enable message
senders to send their messages without disclosing their
identities, and various anonymous networks had been
proposed already, e.g. Mix net [1, 5, 9], DC-net [2],
Crowds [4], etc., to protect secrets of entities that
communicate through networks. However, they still have
serious drawbacks. For example, although Mix net is one
of the most promising mechanisms, it requires the
tremendous amount of computations to encrypt/decrypt
messages that are forwarded from senders to their
receivers. That is because asymmetric key
encryption/decryption functions are adopted. In this paper,
a new anonymous network ESEBM (Enhanced
Symmetric Key Encryption based Mix net) is proposed
that removes drawbacks of existing anonymous networks
by using symmetric key encryption functions.
ESEBM consists of two parts, they are the CP
generator (offline) and the anonymous channel (online)
each of which is configured as a sequence of servers, and
senders obtain secret keys of individual servers in the
anonymous channel for encrypting their messages from
the CP generator as off-line processes. Then, once
encryption keys are shared between senders and servers,
servers in the anonymous channel can efficiently transfer
messages of senders to their receivers while exploiting
symmetric key encryption functions.
According to experimentations, the capacity of
ESEBM is more than 36 times higher than that of
decryption type Mix net. Different from asymmetric key
encryption functions, symmetric key encryption functions
also enable message receivers to send reply messages to
the anonymous senders in totally the same way as the
senders send original messages, and consequently,
anyone except the receivers cannot identify even whether
messages are replies or not. Also, the CP generator
configuration disables unauthorized entities to send
messages because only authorized entities that had
obtained secret keys from the CP generator can send
messages. Therefore, ESEBM is secure against various
kinds of attacks including DOS attacks and message
forgeries (or modifications) that are difficult to prevent in
existing anonymous networks.
II. REQUIREMENTS FOR ANONYMOUS NETWORKS
Anonymous networks should satisfy the following
requirements, i.e.,
1. no one except senders of messages can know
identities of the senders,
2. message senders can confirm their message
arrivals at their receivers without disclosing their
identities,
3. receivers can send reply messages back to the
senders without knowing the senders identities,
* Graduate School of Engineering, University of Fukui
3-9-1, Bunkyo, Fukui 910-8507, Japan
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1533
2011 ACADEMY PUBLISHER
doi:10.4304/jnw.6.11.1533-1542
4. anonymous networks must be able to protect
themselves from accesses from unauthorized
entities, and
5. anonymous networks must maintain their
performances as same as usual ones.
The 1st requirement is the most important one, and
senders of messages must be concealed not only from the
receivers but also from network managers, eavesdroppers
and any other entities. The 2nd and the 3rd requirements
are also important, and especially the 3rd one is essential
because information exchanges between entities in many
kinds of applications are carried out as conversations
between them. To satisfy the 2nd requirement is not so
difficult, e. g. senders can confirm deliveries of their
messages without disclosing their identities when the
receivers put receive signals in public bulletin boards.
However, development of practical mechanisms that
satisfy the 3rd requirement is not easy as it looks. For
example, a receiver, which sends reply message M
R
, can
identify the sender of the original message by
eavesdropping on the communication channel to find out
the entity that receives M
R
, because it knows M
R
. About
the 4th requirement, because of anonymity, entities can
behave dishonestly much easier than in usual
communication systems, therefore, anonymous
communication mechanisms must be endowed with the
ability to protect them from dishonest events. The
important thing here is that dishonest events must be
prevented while maintaining anonymities of honest
entities. Finally, to use anonymous networks in large
scale applications where large volumes of messages are
exchanged frequently, they must be efficient enough as
usual non-anonymous networks.
III. RELATED WORKS
This section summarizes currently available
anonymous networks. Although many various kinds of
anonymous networks had been proposed already, still
they cannot satisfy the requirements in the previous
section effectively. Mixnet is an example. It consists of a
sequence of mixservers T
1
, T
2
, ---, T
N
, that relay
messages from senders to their receivers. Where, senders
send their messages while encrypting them repeatedly by
public keys of multiple mixservers T
1
, T
2
, ---, T
N
in the
sequence. Then, individual mixservers relay their
receiving messages to their neighboring servers while
decrypting them by their secret decryption keys finally to
be sent to their receivers. Namely, sender S encrypts its
message M to E(k
N
, E(k
N-1
, ---, E(k
1
, M) ---)) and each T
j
that recieves E(k
j
, E(k
j-1
, ---, E(k
1
, M)---)) from T
j+1
decrypts it to E(k
j-1
, ---, E(k
1
, M)---) by its secret
decryption key k
j
-1
to forward it to T
j-1
, where E(k
j
, M) is
the encrypted form of M. In this message relaying process,
each mixserver stores its incoming messages until
pre-defined number of message arrivals, and shuffles
decrypted messages before forwarding them to its
neighbor. Therefore, each mixserver cannot identify the
links between incoming and outgoing messages of other
mixservers, and as a consequence, no one except the
senders themselves can identify the senders of messages
unless all mixservers conspire.
However, Mixnet uses asymmetric key encryption
functions, such as RSA or ElGamal, and does not work
efficiently in large scale systems where number of
senders send large volume of messages. A lot of
computation overheads are needed to encrypt and decrypt
messages. Asymmetric key encryption functions also
make Mixnet require additional mechanisms for sending
reply messages to senders of the original messages,
therefore, servers can know whether the messages are
replies or not [1, 7]. Although Mixnet can protect itself
from traffic analysis and replay attacks that are discussed
in Sec. VI. A, it cannot prevent DOS attacks or message
forgeries (or modifications). Encryption keys are publicly
disclosed and servers cannot identify spam or forged
messages because they receive messages in their
encrypted forms, therefore, anyone can send spam and
forged messages.
Crowds [4] also consists of multiple relay servers as
same as Mixnet, however, senders send their messages
without encrypting them. Instead of encrypting messages,
servers randomly decide whether to relay their receiving
messages to their receivers or to the other servers in the
network. Namely, when a server receives a message from
a sender, it forwards it to other server with probability 1-p,
and with probability p it sends it to the receiver. Then, it
becomes difficult for entities other than the sender to
identify the sender, and because no encryption or
decryption process is included, Crowds can transfer
messages efficiently. However, apparently it cannot
disable entities to identify senders by tracing messages
from their receivers to their senders. Namely, Crowds
cannot satisfy the most important requirement of
anonymous networks.
Onion routing [3, 8] uses the same principle as Mixnet,
i.e. messages travel from senders to receivers through
sequences of servers (onion routers) while being
encrypted by public keys of multiple onion routers. The
difference from Mixnet is that senders in onion routing
encrypt not only their messages but also their routes, i.e.
servers in onion routing reroute their receiving messages
in unpredictable ways. Therefore, onion routers need not
wait for large number of messages to shuffle them and
can reduce message travelling times. However, onion
routing uses asymmetric key encryption functions and
shares the same drawbacks with Mixnet. An additional
problem of onion routing is that it is vulnerable to timing
attacks, i.e. an adversary can embed messages to know
the flow times of different paths. Then, while using these
message flow times, entities can know senders of
messages by observing message sending and receiving
times of individual senders and receivers.
Other anonymous networks such as Tor [8], buses for
anonymous message delivery [6], Peer to Peer
anonymous mechanisms [12], etc. have the same
drawbacks as Mixnet or Onion routing.
In DC-net [2], sender S
q
constitutes a group {S
1
, S
2
, ---,
S
Q
} that includes itself, and entities in the group generate
their secret numbers {N
1
, N
2
, ---, N
Q
} so that the sum of
1534 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
them becomes 0 in advance. While using its generating
secret number, S
q
encrypts its message M to M + N
q
to
send it to its receiver R. At the same time, each S
j
in the
group also sends its secret number N
j
to R. Therefore, R
can extract M from messages of {S
1
, S
2
, ---, S
Q
}, i.e. N
1
+N
2
+ --- + (M + N
q
) + N
q+1
+ --- + N
Q
= M + 0 = M.
However, no one except S
q
can know the sender of M,
because each S
j
does not know secret numbers of other
senders.
As shown above, DC-net provides almost perfect
anonymity, however it has fatal drawbacks about its
performance, i.e. multiple senders must behave
synchronously. Multiple senders must agree with each
other about random numbers to encrypt messages, also
only one sender can send a message at a time. Therefore,
it is applicable only to small and closed networks. Here, it
must be noted that each S
j
must change random secret
number N
j
at every message sending. If every S
j
uses
same random secret number for different messages sent
from senders in the group, an entity X that eavesdrops on
the communication can easily identify senders of the
messages. Namely, when S
j
sends same number N
j
as its
1st and 2nd messages, X can know that S
j
s random
secret number is N
j
. Also, when S
j
sends (M
j
+ N
j
) and N
j
as its 1st and 2nd messages, it is easy for X to extract M
j
and to identify the sender.
To decrease computation volumes of encryptions and
decryptions, SEBM
[13]
exploits symmetric key
encryption functions. SEBM consists of 2 parts, the
encryption part and the decryption part, and messages are
forwarded to their receivers while being encrypted by
servers in the encryption part and decrypted by servers in
the decryption part. Here different from other anonymous
networks, senders themselves are included as relay
servers in both parts to enable the use of symmetric key
encryption functions. Therefore, although SEBM can
satisfactory reduce the computation overheads caused by
asymmetric key encryptions, senders included in the
encryption and decryption parts reduce the stability of the
communication. For example, when senders, i.e.
volunteer servers, stop operations, messages cannot be
forwarded. As another drawback, because messages in
SEBM must be encrypted and decrypted by servers both
in the encryption and the decryption parts, their travelling
times increase. Also, it cannot efficiently handle reply
messages or prevent accesses from unauthorized entities
either.
IV. ESEBM (ENHANCED SYMMETRIC KEY ENCRYPTION
BASED MIXNET)
This section proposes ESEBM, a scheme for
anonymous networks that efficiently satisfies all the
requirements listed in the previous section. ESEBM
removes most drawbacks that exist in other anonymous
networks, i.e. it can transfer messages without large
overheads, it does not require any additional mechanism
for forwarding reply messages, and it can protect itself
from various attacks.
A. ESEBM Configuration
ESEBM can be considered as a kind of decryption type
Mixnet, in which asymmetric key encryption functions
are replaced by symmetric ones, where the encryption
keys used for sending messages are distributed to senders
in advance. At the same time, it is considered as SEBM in
which volunteer servers are replaced by permanent ones
in order to make the network stable enough [15].
As shown in Fig. 1, ESEBM consists of 2 parts, i.e. the
anonymous channel and the concealing pattern generator
(CP generator). The anonymous channel is configured as
a sequence of N servers as same as Mixnet, and the CP
generator consists of Z-groups, where the g-th group is
configured by N
g
servers, and each server in the
anonymous channel is corresponded to a single server in
the CP generator and vice versa, therefore N = N
1
+ N
2
+
--- + N
Z
. In the remainder, notation T
g
(k) that represents
the k-th server in the g-th group of the CP generator is
used also for representing the p-th server T
p
in the
anonymous channel that corresponds to T
g
(k), and vice
versa.
ESEBM adopts onetime pad as the base algorithm to
encrypt and decrypt messages, and sender S of message
M
S
requests servers in the CP generator to issue a bit
string called concealing pattern (CP), a pad for encrypting
M
S
, in advance as an off-line process.
Provided that servers generate their h-th CP at the
request of S, each server T
j
in the CP generator generates
its h-th CP constructor x
j
(h), and the h-th concealing
pattern X(h) is constructed as XOR of them, i.e. X(h) =
x
1
(h)x
2
(h)---x
N
(h). Then, S sends M
S
to the first
server T
1
in the anonymous channel while encrypting it to
M
S
X(h). Therefore, the length of CPs and CP
constructors are defined as L
M
, which is the length of
messages. When S sends a long message M
S
, M
S
is
divided into multiple frames of length L
M
. Here, S uses
different CPs for encrypting different messages including
different frames of the same message. Also, although
notations X(h) and x
j
(h) are accompanied by h they do not
include any information about h.
Figure 1. ESEBM configuration
requests
CP
obtains
CP
sender S
sends encrypted
message
anonymous channel
T1 T2
receiver R
delivers
decrypted
message
TN
CP generator
2nd group
T2(1) T2(2) T2(N2)
Z-th group
TZ(1) TZ(2) TZ(NZ)
1st group
T1(1) T1(2) T1(N1)
Tg(1) Tg(2) Tg(Ng)
g-th group
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1535
2011 ACADEMY PUBLISHER
Figure 2. Message structure
As same as usual Mixnet, each server in the anonymous
channel stores its receiving messages until it receives the
predefined number of messages, and decrypts, shuffles
and forwards them to its neighboring server finally to be
sent to their receivers. Here, each T
j
decrypts its receiving
encrypted M
S
by simply XORing it by its CP constructor
x
j
(h) that constitutes X(h), the CP that S had used to
encrypt M
S
, then, it is apparent that M
S
X(h) is
transformed to M
S
when all servers decrypt it. On the
other hand, because each server knows only its CP
constructor x
j
(h) in X(h), no one can know the sender of
M
S
unless all servers conspire with each other as same as
in usual Mixnet.
However, different from usual Mixnet where all
senders encrypt their messages by using the same single
public encryption key of each mixserver, in ESEBM,
senders encrypt different messages by using different CPs,
Therefore to enable T
j
to identify its CP constructor x
j
(h)
that constitutes X(h) for encrypting M
S
, message M
S
consists of the message part and the tag part as shown in
Fig. 2. The message part maintains encrypted message M
S
,
i.e. M
S
X(h), and the tag part maintains a sequence of
tags, i.e. vector Q(h) = {Q
1
(h), Q
2
(h), ---, Q
N
(h)}, where
server T
j
that had generated the CP constructor x
j
(h)
to
construct X(h) can know x
j
(h) from Q
j
(h). Here, Q
j
(h) is
constructed so that no one can trace the message by it and
no one except T
j
can identify x
j
(h)
from it.
B. Behavior of the CP Generator
To disable entities to trace messages forwarded
through the anonymous channel, not only
correspondences between the message parts of input and
output messages of individual servers but also those
between their tag parts must be concealed. To achieve this,
the CP generator generates 2 kinds of secret encryption
keys shared between senders and individual servers, the
one is CPs and the other is tag vectors (TVs). The CP
generator is a set of server groups, each of which consists
of at least 3 servers that generate their secret CP
constructors and TV constructors independently of others
to construct CPs and TVs jointly with other servers. Here,
senders communicate only with servers in the 1st group,
i.e. with T
1
(1), T
1
(2), ---, and T
1
(N
1
), to disable servers in
the other groups to know the senders as shown in Fig. 1.
As discussed already, concealing pattern X(h) is
calculated as XOR of CP constructor x
j
(h) (j = 1, 2, ---, N)
generated by each server T
j
, and disables anyone to trace
the message parts of a message relayed by the servers. On
the other hand, individual elements of N-dimensional tag
vector Q(h) = {Q
1
(h), Q
2
(h), ---, Q
N
(h)} disable anyone to
trace the tag part of a message relayed by the servers, and
each Q
i
(h) is calculated as XOR of the i-th elements of
each N-dimensional TV constructor q
j
(h) = {0, ---0,
q
j(j+1)
(h), q
j(j+2)
(h), ---, q
jN
(h)} generated by T
j
(j = 1, ---,
N). Here, each q
jk
(h) in vector q
j
(h) is a bit pattern of
length L
T
as discussed later, 0 represents an all zero bit
pattern of length L
T
, and a sequence of j-zero patterns
precedes before the (N-j)-secret bit patterns {q
j(j+1)
(h),
q
j(j+2)
(h), ---, q
jN
(h)}. By XORing CP constructors and TV
constructors of individual serves, concealing pattern X(h)
and tag vector Q(h) are calculated as X(h) =
x
1
(h)x
2
(h)---x
N
(h) and Q(h)={0, q
12
(h),
q
13
(h)q
23
(h), ---, q
1N
(h)q
2N
(h)---q
(N-1)N
(h)}. Here,
the length of bit pattern x
j
(h) is equal to the message frame
length L
M
as mentioned before, and the last server T
N
does
not generate its TV constructor.
CPs and TVs above are generated as follows. Provided
that T
1
(k) in the 1st group of the CP generator corresponds
to T
k*
in the anonymous channel, i.e. T
1
(1) = T
1*
, T
1
(2) =
T
2*
, ---, and T
1
(N
1
) = T
N1*
, firstly, sender S sends a set of
its secret private vectors (PVs) {P
1
(h), P
2
(h), ---, P
N1
(h)}
as a request for a CP to servers T
1*
, T
2*
, ---, T
N1*
,
respectively, as shown in Fig. 3 (a). Here, each P
j
(h)
is
vector {p
j0
(h), p
j1
(h), ---p
jN
(h)} and except p
j0
(h), p
jk
(h)
is a
bit pattern of the same length as element q
jk
(h) in TV
constructor q
j
(h). Bit pattern p
j0
(h) has the same length as
CP constructor x
j
(h).
Then, T
1*
that receives the request with P
1
(h), generates
its CP constructor x
1*
(h)
and TV constructor q
1*
(h) = {0,
---, 0, q
1*(1*+1)
(h), q
1*(1*+2)
(h),
---, q
1*N
(h)}. It also
generates ID
1*
(x
1*
(h), q
1*
(h)) as an address of CP and TV
constructor pair (x
1*
(h), q
1*
(h)). Here, T
1*
maintains its CP
table, a list of CP and TV constructors that it had
generated, and ID
1*
(x
1*
(h), q
1*
(h)) represents the address
of the constructor pair {x
1*
(h), q
1*
(h)} in the table. Also,
the length of each bit pattern q
jk
(h) in TV constructor
q
j
(h) is set as L
T
, the length of ID
j
(x
j
(h), q
j
(h)).
Then, X(1, h) and Q(1, h),
the h-th CP and TV that the
1st group generates, are constructed by 1st server T
1
*
as
X(1, h) = p
10
(h)x
1*
(h) and Q(1, h) = {p
11
(h), p
12
(h), ---,
p
11*
(h)ID
1*
(x
1*
(h), q
1*
(h)), p
1(1*+1)
(h)q
1*(1*+1)
(h),
p
1(1*+2)
(h)q
1*(1*+2)
(h),
---, p
1N
(h)q
1*N
(h)}, respectively.
X(1, h) and Q(1, h) are then forwarded to T
2*
. However,
to protect them from eavesdropping, they are encrypted
by the secret key k
1*
that is shared between T
1*
and T
2*
,
i.e. X(1, h) and Q(1, h) are sent to T
2*
in the form E(k
1*
,
X(1, h), Q(1, h)), where, E(k
1*
, x) represents x encrypted
by key k
1*
. It is also possible that T
1*
encrypts X(1, h) and
Q(1, h) by using a public key of T
2*
, however to decrease
encryption overheads, a symmetric key encryption
function is adopted here.
T
2*
that receives E(k
1*
, {X(1, h), Q(1, h)}) decrypts it
to {X(1, h), Q(1, h)}, and generates its CP constructor
x
2*
(h) to modify X(1, h) to X(1, h)
=
p
10
(h)p
20
(h)x
1*
(h)x
2*
(h). T
2*
also generates TV
constructor q
2*
(h) = (0, ---0, q
2*(2*+1)
(h), q
2*(2*+2)
(h), ---,
q
2*N
(h)) to modify Q(1, h) to {p
11
(h)p
21
(h),
p
12
(h)p
22
(h), ---, p
11*
(h)p
21*
(h)ID
1*
(x
1*
(h), q
1*
(h)),
p
1(1*+1)
(h)p
2(1*+1)
(h)q
1*(1*+1)
(h), ---, p
12*
(h)p
22*
(h)
q
1*2*
(h)ID
2*
(x
2*
(h), q
2*
(h)), p
1(2*+1)
(h)p
2(2*+1)
(h)
q
1*(2*+1)
(h)q
2*(2*+1)
(h), ---,p
1N
(h)p
2N
(h)q
1*N
(h)
q
2*N
(h)}.
Here as same as T
1*
, T
2*
also maintains its CP table,
and ID
2*
(x
2*
(h), q
2*
(h)) represents the address where
tag part message part
(MsX(h)) Q1(h) Q2(h) QN(h)
1536 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
{x
2*
(h), q
2*
(h)} is located in it. Also, it is not necessary
but to simplify the descriptions, it is assumed that servers
in the anonymous channel are arranged so that T
j
(g) is
placed at the earlier position in the anonymous channel
than T
j
(h) when g < h, for every j-th group. Then, X(1, h)
and Q(1, h) are sent to T
3*
while being encrypted by k
2*
, a
secret encryption key shared between T
2*
and T
3*
, and this
process continues until T
N1*
calculates X(1, h) and Q(1,
h). Therefore, X(1, h) and Q(1, h) = {Q
1
(1, h), Q
2
(1, h),
---, Q
N
(1, h)}, the CP and the TV pair generated by the
1st group becomes as shown in equations (1) (3).
(a) 1st group
(b) r-th group
Figure 3. Behaviour of the CP generator
X(1, h) = p
10
p
20
---p
(N1)0
x
1*
(h)
x
2*
(h)---x
(N1)*
(h)
(1)
for g* included in the 1st group
Q
g*
(1, h) = p
1g*
(h)p
2g*
(h)---p
(N1)g*
(h)
q
1*g*
(h)q
2*g*
(h)---q
(g-1)*g*
(h)
ID
g*
(x
g*
(h), q
g*
(h)), where q
0*g*
(h) = 0
(2)
for i not included in the 1st group,
Q
i
(1, h) = p
1i
(h)p
2i
(h)---p
(N1)i
(h)q
1*i
(h)
q
2*i
(h)---q
(gj*)i
(h), where g
j*
< i < g
(j+1)*
(3)
Severs in the r-th group (r > 1) behave in the same way
as the 1st group as shown in Fig. 3 (b), where server T
r
(k),
the k-th server in the r-th group, corresponds to T
k#
in the
anonymous channel. However, different from the 1st
group where senders generate PVs and sends them as a
request for a CP to severs T
1*
, T
2*
, ---, T
N1*
, servers T
1#
,
T
2#
, ---, T
Nr#
in the r-th group generate CP and TV pairs
spontaneously without requests from senders, also the last
server T
Nr#
in the r-th group generates group blinding
vector B(h) = {B
1
(h), B
2
(h), ---, B
Nr
(h)}. Then, the r-th
group calculates X(r, h) and Q(r, h) = {Q
1
(r, h), Q
2
(r, h),
---, Q
N
(r, h)} as its h-th CP and TV values as shown in
equations (4) (6). In the equations, the j-th element
B
j
(h) of B(h) = {B
1
(h), B
2
(h), ---, B
Nr
(h)} is a vector of
patterns {b
j0
(h), b
j1
(h), ---, b
jN
(h)}, where the length of
b
j0
(h) is L
M
and the length of b
jk
(h) is L
T
for each k.
X(r, h) = b
10
b
20
---b
(Nr)0
x
1#
(h)x
2#
(h)
---x
Nr#
(h) (4)
for g# included in the r-th group
Q
g#
(r, h) = b
1g#
(h)b
2g#
(h)---b
(Nr)g#
(h)q
1#g#
(h)
q
2#g#
(h)---q
(g-1)#g#
(h)ID
g#
(x
g#
(h), q
g#
(h)),
where q
0#g#
(h) = 0 (5)
for i not included in the r-th group,
Q
i
(r, h) = b
1i
(h)b
2i
(h)---b
(Nr)i
(h)q
1#i
(h)
q
2#i
(h)---q
(gj#)i
(h), where g
j#
< i < g
(j+1)#
(6)
After calculating X(r, h) and Q(r, h) as equations (4)
(6), T
Nr#
removes group blinding vector B(h) by XORing
them by B(h). Namely, they are transformed as shown in
equations (7) (9).
X(r, h) = x
1#
(h)x
2#
(h)---x
Nr#
(h) (7)
for g# included in the r-th group
Q
g#
(r, h) = q
1#g#
(h)q
2#g#
(h)---q
(g-1)#g#
(h)
ID
g#
(x
g#
(h), q
g#
(h)), where q
0#g#
(h) = 0 (8)
for i not included in the r-th group,
Q
i
(r, h) = q
1#i
(h)q
2#i
(h)---q
(gj#)i
(h),
where g
j#
< i < g
(j+1)#
(9)
The last server T
r
(N
r
) = T
Nr#
in the r-th group also
receives X(r+1, h) and Q(r+1, h), the CP and TV values
generated by the (r+1)-th group, from T
r+1
(N
r+1
), the last
server in the (r+1)-th group, and it calculates X(r,
h)X(r+1, h), and Q(r, h)Q(r+1, h) to combine CPs and
Step-0
P3(h)
Step-(N1+1)
(12) (13)
Step-1
E(k1*, {X(1, h), Q(1, h)})
anonymous
channel sender
T1(1)
T1*
T1(3)
T3*
T1(2)
T2*
T1(N1)
TN1*
From
T2(N2)
Step-2
E(k2*, {X(1, h), Q(1, h)})
Step-0
P1(h)
Step-0
P2(h)
Step-N1
(10) (11)
Step-0
PN1(h)
Tr+1(Nr+1)
Tr-1(Nr-1
)
Step-0
B3(h)
Step-0
B2(h)
Step-0
B1(h)
Tr(Nr)
TNr#
X(r+1, h), Q(r+1, h)
Step-Nr
X(r, h), Q(r, h)
Step-1
E(k1#, {X(r, h), Q(r, h)})
Step-2
E(k2#, {X(r, h), Q(r, h)})
Tr(1)
T1#
Tr(2)
T2#
Tr(3)
T3#
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1537
2011 ACADEMY PUBLISHER
TVs generated by the r-th and the (r+1)-th groups into the
single CP and TV, respectively. Then, T
r
(N
r
) waits for the
arrivals of predefined number of CP and TV pairs, and
shuffles them to sends the results to the last server
T
r-1
(N
r-1
) of the (r-1)-th group. As the result of the
behaviors of all groups, the last server of the 1st group,
i.e. T
1
(N
1
), generates the CP and TV as equations (10)
and (11).
X(h) = p
10
(h)p
20
(h)---p
(N1)0
(h)x
1
(h)
x
2
(h)---x
N
(h)
(10)
Q
g
(h) = p
1g
(h)---p
(N1)g
(h)q
1g
(h)---
q
(g-1)g
(h)ID
g
(x
g
(h), q
g
(h)), where q
0g
(h) = 0 (11)
Then, T
1
(N
1
) sends X(h) and Q(h) = {Q
1
(h), Q
2
(h), ---,
Q
N
(h)} to sender S, and S removes private vectors PVs
from X(h) and Q(h) by XORing them by PVs. As the
result, finally CP and TV values become as (12) and (13).
X(h) = x
1
(h)x
2
(h)---x
(N-1)
(h)x
N
(h)
(12)
Q
g
(h)
= q
1g
(h)---q
(g-1)g
(h)ID
g
(x
g
(h), q
g
(h)),
where q
0g
(h) = 0
(13)
It must be noted that because PVs and group blinding
vectors are secrets of sender S and last server of each
group (except the 1st group), respectively, and each
server T
j
does not disclose x
j
(h) or q
j
(h) to others, any
server cannot know CP or TV constructors of other
servers. No server can know X(h) or Q(h) either unless all
servers conspire with each other.
C. Behavior of the Anonymous Channel
Fig. 4 shows the behavior of the anonymous channel.
Firstly, sender S encrypts its message M
S
by XORing it
by concealing pattern X(h) that it had acquired from
T
1
(N
1
). S also attaches tag vector Q(h) = {Q
1
(h), Q
2
(h),
---, Q
N
(h)} corresponding to X(h), to the message, and
sends {M
S
= x
1
(h)x
2
(h)---x
N
(h)M
S
, Q
1
(h), Q
2
(h),
---, Q
N
(h)} to the 1st server T
1
in the anonymous channel.
Here, Q
1
(h) has the form ID
1
(x
1
(h), q
1
(h)).
Then, T
1
that receives {x
1
(h)x
2
(h)---x
N
(h)M
S
,
Q
1
(h), Q
2
(h), ---, Q
N
(h)} retrieves CP constructor x
1
(h)
and TV constructor q
1
(h) from its CP table based on
ID
1
(x
1
(h), q
1
(h)) in Q
1
(h), calculates XOR of x
1
(h) and
M
S
, and q
1j
(h) and Q
j
(h) for each j as new values of M
S
and Q
j
(h). Therefore, M
S
and Q
j
(h) become M
S
=
x
1
(h)(x
1
(h)x
2
(h)---x
N
(h)M
S
) =
x
2
(h)x
3
(h)---x
N
(h)M
S
and Q
j
(h) =
q
1j
(h)(q
1j
(h)q
2j
(h)---q
(j-1)j
(h)ID
j
(x
j
(h), q
j
(h))) =
q
2j
(h)q
3j
(h)---q
(j-1)j
(h)ID
j
(x
j
(h), q
j
(h)). After that, T
1
removes Q
1
(h) from the tag part, waits for the predefined
number of message arrivals, and shuffles them to forward
each result to server T
2
.
All servers in the anonymous channel perform in the
same way, i.e. each T
j
converts its incoming message to
{M
S
= x
j+1
(h)x
j+2
(h)---x
N
(h)M
S
, Q
j+1
(h), Q
j+2
(h), ---,
Q
N
(h)}, where Q
g
(h) = q
(j+1)g
(h)---q
(g-1)g
(h)ID
g
(x
g
(h),
q
g
(h)). Consequently, when T
N
, the last server in the
anonymous channel, completes its operations on the
message, the message is converted into M
S
, and T
N
can
deliver M
S
to its receiver while extracting the address of
the receiver from M
S
.
The anonymous channel together with the CP
generator protects identities of message senders from
various threats as follows. Firstly, each server T
j
transforms the message part while XORing it by CP
constructor x
j
(h) which is not known to other servers and
also T
j
assigns different values as CP constructors for
encrypting different messages. Therefore no one
including other server T
i
can identify the input and output
pair of T
j
that corresponding to M
S
by comparing message
parts of T
j
s receiving and forwarding messages. For T
i
, 2
input and output pairs of T
j
, e. g.
{x
j
(h)x
j+1
(h)---x
N
(h)M
S
,
x
j+1
(h
1
)---x
N
(h
1
)M
1
}
and {x
j
(h)x
j+1
(h)---x
N
(h)M
S
, x
j+1
(h
2
)---x
N
(h
2
)
M
2
}, have equal possibilities that they are encrypted
form pairs of M
S
. As a consequence, it is impossible for
entities including servers to identify the sender of
message M
S
by tracing the message parts of messages
unless all servers conspire.
Any entity cannot trace M
S
by examining the tag parts
of messages either. Because each T
j
generates different
secret TV constructors for different messages and assigns
different bit patterns to individual elements {q
j(j+1)
(h), ---,
q
jN
(h)} in TV constructor q
j
(h), it is impossible for other
entities to identify links between incoming messages of T
j
and its outgoing messages by examining pattern
transitions in individual tags made by T
j
. Namely,
individual tags change their forms within T
j
in different
ways, and entities except T
j
cannot extract any relation
between transitions of different tags in the tag part to
identify input and output pairs of same messages.
Also, although, each server T
j*
in the 1st group in the
CP generator can know the senders of encrypted
messages from their CP and TV constructors, because T
j*
generates them at requests of the senders, when T
j*
is
placed at the earlier position of the anonymous channel,
its tags disappear in the later positions, i.e. the tag parts of
messages that are received by servers at later positions of
the anonymous channel do not include tags of any server
in the 1st group, therefore even if T
j*
conspires with
servers at the later positions, it is not possible to identify
senders.
Figure 4. Behavior of the anonymous channel
T1 T2 TN T(N-1)
sender
S
receiver
R
tag part message part
x1(h)x2(h)---xN(h)MS Q1(h) Q2(h)
QN(h)
xN(h)MS, QN(h)
MS
x2(h)---xN(h)MS Q2(h)
QN(h)
1538 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
V. REPLIES TO ANONYMOUS SENDERS
Different from other existing mechanisms [2, 7], in
ESEBM, individual servers can handle reply messages to
anonymous senders without any additional mechanism.
This means that servers cannot decide even whether a
message is the reply or not. Sender S can receive reply
messages as follows. Firstly, S obtains 2 CP and TV pairs
{X(h
1
), Q(h
1
)}, {X(h
2
), Q(h
2
)}, and constructs its message
while attaching tag vector Q(h
2
) and its encrypted address
A
S
to its sending message M
S
as shown in Fig. 5 (a).
Namely, S constructs M
S
Q(h
2
)(X
U
(h
2
)A
S
),
concatenation of M
S
, Q(h
2
), and X
U
(h
2
)A
S
. Where bit
strings X
U
(h
2
) and X
L
(h
2
) are upper and lower parts of bit
string X(h
2
), in other words, X(h
2
) = X
U
(h
2
)X
L
(h
2
). Also
it is assumed that message M
S
includes its destination
address at its left most bit positions.
After that, S encrypts M
S
Q(h
2
)X
U
(h
2
)A
S
to
X(h
1
)(M
S
Q(h
2
)X
U
(h
2
)A
S
), and sends
{X(h
1
)(M
S
Q(h
2
)X
U
(h
2
)A
S
), Q
1
(h
1
), Q
2
(h
1
), ---,
Q
N
(h
1
)} to the 1st server T
1
in the anonymous channel.
Then, T
1
decrypts it by x
1
(h
1
), CP constructor of T
1
. As a
result, the message becomes
{x
1
(h
1
)X(h
1
)(M
S
Q(h
2
)X
U
(h
2
)A
S
), Q
2
(h
1
), ---,
Q
N
(h
1
)} = {x
2
(h
1
)---x
N
(h
1
)(M
S
Q(h
2
)X
U
(h
2
)A
S
),
Q
2
(h
1
),---, Q
N
(h
1
)}. Each server T
j
in the anonymous
channel carries out the same procedure until receiver R
receives M
S
Q(h
2
)X
U
(h
2
)A
S
. Then R can extract
message M
S
, encrypted address X
U
(h
2
)A
S
of S and tag
vector Q(h
2
) to construct its reply message as
{(X
U
(h
2
)A
S
)M
R
, Q
1
(h
2
), ---, Q
N
(h
2
)} to be encrypted to
X(h
2
){X
U
(h
2
)A
S
M
R
} = {A
S
X
L
(h
2
)M
R
}, by the
anonymous channel as shown in Fig. 5 (b). Therefore, T
N
can deliver X
L
(h
2
)M
R
to S and finally S that knows
X
L
(h
2
) decrypts X
L
(h
2
)M
R
to X
L
(h
2
)X
L
(h
2
)M
R
= M
R
.
In the above, R receives M
S
Q(h
2
)X
U
(h
2
)A
S
, and it
cannot know A
S
because X
U
(h
2
) is known only to S. Also,
message X
U
(h
2
)A
S
M
R
sent by R is transformed to
A
S
X
L
(h
2
)M
R
in the anonymous channel, therefore, no
one except S can know that X
L
(h
2
)M
R
corresponds to
M
R
, and consequently even receiver R that knows M
R
cannot identify the original sender of M
S
. In this way,
servers in ESEBM can handle original and reply messages
totally in the same way, different from usual Mixnets
where each mixserver adds extra operations on reply
messages.
VI. EVALUATION OF ESEBM
A. Analysis of ESEBM Behavior
ESEBM satisfies the requirements for anonymous
networks listed in Sec. II as follows. Firstly as discussed
in Sec IV. C, no one except senders themselves can trace
messages from senders to receivers. Secondly, the
message reply mechanism discussed in Sec. V enables
receivers to send replies to senders of original messages
without knowing identities of the senders. Also by this
reply mechanism, senders can confirm the deliveries of
their messages. In addition the reply mechanism of
ESEBM does not require additional operations on reply
messages, therefore different from other existing
anonymous networks, servers cannot know even whether
their handling messages are replies or not.
(a) From sender to receiver
(b) From receiver to sender
Figure 5. Anonymous reply mechanism
(MSQ(h2)
XU(h2)AS)
CP
generator
{X(h1), Q(h1)},
{X(h2), Q(h2)}
x1(h1){X(h1)(MS
Q(h2) XU(h2)AS)}
T1 T2
TN T(N-1)
sender S
receiver R
tag part message part
X(h1)(MSQ(h2)
XU(h2)AS)
Q1(h1)
Q2(h1)
QN(h1)
Q2(h1)
QN(h1)
XL(h2)XL(h2)MR
MR
ASXL(h2)MR
T1 T2 TN T(N-1)
receiver R
sender S
tag part message part
XU(h2)ASMR Q1(h2)
Q2(h2)
QN(h2)
x1(h2){(XU(h2)ASMR)}
Q2(h2)
QN(h2)
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1539
2011 ACADEMY PUBLISHER
About the efficiency, the configuration of ESEBM
where senders must obtain CPs before sending their
individual messages is obviously a disadvantage, e. g.
message travelling times increase when durations required
for obtaining CPs are counted. However because senders
can obtain CPs as offline processes, actual message
traveling times can be suppressed at values comparable to
Mixnet. Also, when each server is configured by 2
independent CPUs, tasks for generating CPs and
forwarding messages can be assigned to different CPUs so
that the anonymous channel can forward messages
without being interrupted by tasks for the CP generator.
Then, despite of the disadvantages of the CP obtaining
process, ESEBM configuration enables anonymous
networks to adopt symmetric key encryption functions
that make ESEBM efficient enough as usual
non-anonymous networks to handle messages in practical
scale applications as demonstrated in the next subsection.
ESEBM configuration brings advantageous features not
only about the efficiency but also about security as
follows. Among various threats to networks, DOS attacks
[10], in which meaningless or spam messages are sent to
decrease availabilities of networks, and illegitimate
message forgeries (modifications), in which malicious
entities forge (modify) messages sent from anonymous
senders, are especially serious in anonymous networks.
Different from in usual networks where all entities that
send messages can be identified if costs and efforts are not
considered, in anonymous networks where identities of
senders are completely hidden, entities can behave
dishonestly more easily. In addition, about message
forgeries (modifications), in many cases receivers cannot
notice even if their receiving messages are forged
(modified) because their senders are anonymous.
The CP generator in ESEBM reduces the occurrence of
DOS attacks substantially and makes forged (modified)
messages detectable. Namely, senders must attach
consistent TVs to their messages to let servers transfer the
messages; however, the CP generator gives CPs and TVs
only to authorized entities. Therefore, unauthorized
entities must send their messages while attaching
nonregistered TVs, and servers in ESEBM that cannot
find CPs and TVs from their CP tables discard the
messages immediately, as the consequence, messages
from unauthorized entities do not decrease the availability
of the network. About the malicious message forgeries
(modification), provided that the malicious entity X does
not know the original message M, X cannot forge
(modify) encrypted M consistently because no one except
the sender of M knows the CP used for encrypting M, then
the receiver of M can notice the forgeries (modification)
because its receiving message is meaningless.
In the same way, ESEBM disables entities to carry out
traffic analysis attacks and replay attacks. A traffic
analysis attack is a way to identify the sender S of a
message M by sending multiple replies to it [7, 14].
Namely, when receiver R of M sends many replies at a
time or periodically to S, R can identify S by observing
entities that receives many messages at a time or
periodically. However, in ESEBM every message must
have different CPs and TVs, and this means that every
server discards CP and TV constructors in its CP table
once they are used. Therefore, provided that at least one of
the servers is honest, even when R sends multiple replies
only one of them is delivered to S, and R cannot identify
S. It must be noted that, it is also possible to enable
receivers to send up to predefined number of replies. If
each server T
j
maintains F(h), the number of messages
allowed to send by using tag vector Q(h), in its CP table in
addition to {x
j
(h), q
j
(h)}, T
j
does not invalidate {x
j
(h),
q
j
(h)} until it receives F(h)-messages attached by Q(h).
In a replay attack [11], an entity X identifies sender S
of message M by eavesdropping on the network to pick
M
*
, encrypted form of M, just sent from S, and putting M
*
to the network repeatedly. Then, because M is delivered to
the same receiver R many times, X can easily identify the
correspondence between S and M received by R.
Apparently ESEBM can disable replay attacks in the same
way as disabling traffic analysis attacks.
B. Message Processing Performance
Performance of ESEBM has been compared with that
of the usual non-anonymous networks and Mixnet each of
which consisted of multiple PCs that worked as relay
servers. Where individual PCs were equipped with
1.6GHz CPUs and 1GB of RAM and they were connected
by 100Mbits/sec Ethernet. Because delays of message
arrivals depend on the number of relay servers and the
time that individual servers must wait for shuffling
messages, only the throughput were compared while
changing the sizes of messages. For evaluating ESEBM,
16 tags each of which consisted of 64 bits were attached to
individual messages, therefore for ESEBM, the actual
length of a 10 Kbits message is 11 Kbits for example. For
Mixnet, RSA with 1K bits length key was adopted as the
encryption function. In real applications, a sender must
combine its message M with random secret numbers to
make the encryption function probabilistic. Also to
maintain strengths of encryption keys, different servers
must use different modulo arithmetic. However in this
evaluation, random bit strings were not attached to
messages, and all servers used the same modulo
arithmetic.
Table 1 shows the computation times required by each
server in non-anonymous network, ESEBM and Mixnet to
transfer different sizes of messages, and Fig. 6 graphically
represents them. For example, while ESEBM needs less
than 6 seconds to transfer a 20Mbits message, Mixnet
needs more than 3 minutes to transfer the same message.
Fig. 7 shows the volume of messages that usual
non-anonymous networks, ESEBM and Mixnet can send
within 1 second. These results show that, although the
throughput of ESEBM is 1/4.4 of that of non-anonymous
networks, it is more than 36 times higher than that of
Mixnet. According to statistics [16], e-mail message size
is 59KB on average, therefore, even in the environments
used for evaluations, ESEBM can handle 7 clients at a
time that send usual e-mail messages while the
non-anonymous network can handle 33 clients at a time.
On the other hand, Mixnet can handle only 0.2 clients.
The beneficial thing is that, when multiple processors are
1540 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
available, volume of messages can be processed almost in
parallel. Therefore, ESEBM can transfer the same volume
of messages as usual non-anonymous networks do when
each server is constituted by multiple processors and
memories with 4.4 times of costs. Here, although it
depends on individual applications, value 4.4 can be
considered acceptable. On the other hand, to improve the
performance of Mixnet as non-anonymous networks, 158
times of costs are necessary. Namely, ESEBM can be
used for large scale networks, in which number of clients
exchange usual sizes of messages at less extra costs.
TABLE I. COMPUTATION TIME FOR TRANSFERRING DIFFERENT SIZES
OF MESSAGES
Message size
Mbits
Non-anonymous
(msec)
ESEBM
(msec)
Mixnet
(msec)
10 625 2780 105255
20 1230 5510 207440
30 1924 8556 310986
40 2520 11225 412679
50 3125 14127 528276
60 3745 17342 ---
70 4325 19710 ---
80 4995 22862 ---
90 5643 25595 ---
100 6246 28344 ---
Figure 6. Comparison of computation time for transferring
different sizes of messages
Figure 7. Comparison of throughputs for transferring
different sizes of messages
About the breakdown of message processing time of
each server in ESEBM, it consists of shuffling (31%),
message decryption (26%), and others (43%). On the
other hand, message processing time of each server in
Mixnet consists of shuffling (0.8%), message decryption
(98.6%), and others (0.6%). As shown above, different
from Mixnet in which message decryptions require 123
times of message shuffling time, in ESEBM, message
decryptions require less than 0.84 times of the shuffling
time. When the fact that both ESEBM and Mixnet shuffle
same number of messages is considered, this means that
message decryption process in Mixnet degrades its overall
performance seriously. In other words, symmetric key
encryption functions used in ESEBM had successfully
reduced decryption times. Namely, while RSA used in
Mixnet requires the number of multiplications that is
proportional to log
2
(n), onetime pad used in ESEBM
requires only a single XOR operation, where n is the size
of encryption keys.
SEBM also uses symmetric key encryption functions
[13], and as ESEBM, it can achieve the higher throughput
than other anonymous networks such as Mixnet. However,
when compared with ESEBM, in SEBM, more servers
must be involved in forwarding messages, because it
consists of encryption and decryption servers. Therefore,
message traveling times in SEBM become longer than that
of ESEBM, i.e. different from in ESEBM where messages
are encrypted by their senders, in SEBM, they are
encrypted by a sequence of encryption servers. As other
advantages of ESEBM over SEBM, ESEBM works more
stably because all servers in ESEBM are permanent
servers different from SEBM where senders are included
as servers. Also a mechanism for reply messages is not
straightforward in SEBM.
VII. CONCLUSION
Enhanced symmetric key encryption based Mixnet has
been proposed that removes the drawbacks of many
existing anonymous networks such as Mixnet, DC-net, etc.
It satisfies all the requirements of anonymous networks.
Most importantly, while being supported by concealing
patterns, those requirements are satisfied in a simple and
efficient way. Unlike complicated Mixnet based systems,
the simplified computational requirements of individual
entities make the scheme practical and scalable.
As a drawback of ESEBM, a sender must acquire a
concealing pattern from the CP generator in advance to
send its every message as an offline process. However
because of ESEBM configuration, i.e. by dividing the
network into the CP generator (off-line) and the
anonymous channel (on-line) parts, every time-consuming
task is removed from the anonymous channel part and
highly efficient communication becomes possible.
Moreover, concealing patterns enable receivers not only to
send replies to the original anonymous message senders but
also to receive messages without disclosing their identities.
Namely, when concealing patterns are publicly disclosed
with the receivers interests, the receivers can receive
messages from senders without disclosing their identities.
As a future work, mechanisms that enhance the
0
100
200
300
400
500
600
10 20 30 40 50 60 70 80 90 100
T
i
m
e
(
s
e
c
)
Message size (Mb)
Non-anonymous
ESEBM
Mixnet
0
2
4
6
8
10
12
14
16
18
10 20 30 40 50 60 70 80 90 100
T
h
r
o
u
g
h
p
u
t
(
M
b
/
s
e
c
)
Message size (Mb)
Non-anonymous
ESEBM
Mix net Mixnet
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1541
2011 ACADEMY PUBLISHER
reliability of ESEBM are necessary. When senders or
receivers claim that some server is dishonest, ESEBM
must prove all servers are honest or detect dishonest
servers if exist. Also, ESEBM must continue its
operations even some of servers are out of their services.
REFERENCES
[1] D. Chaum, Untraceable electronic mail, return address and
digital pseudonyms, Communications of the ACM, vol. 24,
no. 2, pp. 84-88, 1981.
[2] D. Chaum, The dining cryptographers problem:
unconditional sender and recipient untraceability, Journal
of Cryptology, vol. 1, pp. 65-75, 1988.
[3] M. G. Reed, P. F. Syverson and D. M. Goldschlag,
Anonymous connections and onion routing, Selected
Areas in Communications, vol. 16, no. 4, pp. 482-494, May
1998.
[4] M. K. Reiter and A. D. Rubin, Crowds: anonymity for Web
transactions, ACM Transactions on Information and
System Security, vol. 1, no. 1, pp. 66-92, Nov 1998.
[5] R. Ingledine, M. J. Freedman, D. Hopwood and D. Molnar,
A reputation system to increase MIX-Net
reliability, Proc. of the 4th international Workshop on
information Hiding. I. S. Moskowitz, Ed. Lecture Notes In
Computer Science, Springer-Verlag, vol. 2137, London, pp.
126-141, April 2001.
[6] A. Beimel and S. Dolev, Buses for anonymous message
delivery, Proc. of the Second International Conference on
FUN with Algorithms, Elba, Italy, pp. 1-13, May 2001.
[7] P. Golle and M. Jakobsson, Reusable anonymous return
channels, Proc. of the 2003 ACM Workshop on Privacy in
the Electronic Society, (Washington, DC), WPES '03,
ACM, New York, NY, pp. 94-100, 2003.
[8] R. Dingledine and N. Mathewson, Tor: The
second-generation onion router, Proc. of the 13th USENIX
Security Symposium, San Diego, CA, USA, pp. 303-320,
August 2004.
[9] P. Golle, M. Jakobsson, A. Juels and P.
Syverson, Universal re-encryption for Mixnets, RSA
Conference Cryptographers' Track '04, Springer-Verlag,
pp. 163-178, 2004.
[10] T. Znati, J. Amadei, D. R. Pazehoski and S. Sweeny, On
the design and performance of an adaptive, global Strategy
for detecting and mitigating distributed DOS attacks in
GRID and collaborative workflow environments,
Simulation, vol. 83, pp. 291-303, March 2007.
[11] S. Y. Kang, J. S. Park and I. Y. Lee, A study on
authentication protocol in offering identification
synchronization and position detection in RFID system,
Proc. of The 2007 International Conference on Intelligent
Pervasive Computing (IPC 2007), pp. 150-154, 2007.
[12] X. Wang and J. Luo, A collaboration scheme for making
peer-to-peer anonymous routing resilient, Computer
Supported Cooperative Work in Design, 2008, CSCWD
2008, pp. 70-75, April 2008.
[13] S. Tamura, K. Kouro, M. Sasatani, K. M. Alam and H. A.
Haddad, An information system platform for anonymous
product recycling, Journal of Software, vol. 3, no. 6, pp.
46-56, 2008.
[14] L. Li, S. Fu and X. Che, Active attacks on reputable Mix
Networks, ispa, 2009 IEEE International Symposium on
Parallel and Distributed Processing with Applications, pp.
447-450, 2009.
[15] H. Haddad, H. Tsurugi and S. Tamura, A mechanism for
enhanced symmetric key encryption based Mixnet, SMC
2009 IEEE International Conference on Systems, Man and
Cybernetics, San Antonio, TX, USA, pp. 45414546, 11-14
Oct 2009, doi: 10.1109/ ICSMC.2009.5346788.
[16] http://www.idc.com/
Hazim A. Haddad received the B.E.
degree in Computer science and
Engineering from Ittihad University, from
UAE (United Arab Emirates) in 2003,
M.S. degree in nuclear and safety
engineering, from the University of Fukui
in 2008. He is currently a doctor course
student of University of Fukui.
Shinsuke Tamura was born in Hyogo,
Japan on Jan. 16, 1948, and received the
B.S., M.S. and Dr. (Eng.) degrees in
Control Engineering from Osaka
University in 1970, 1972 and 1991,
respectively. During 1972 to 2001, he
worked for Toshiba Corporation. He is
currently a professor of Graduate School
of Engineering, University of Fukui,
Japan. Prof. Tamura is a member of IEEJ,
SICE and JSST.
Shuji Taniguchi received the B.E.
and Ph.D. degrees in electronics
engineering from University of Fukui,
Fukui, Japan, in 1973, 1996, respectively.
In 1973-1978, he was with the Hitachi co.
Ltd. He is currently an associate professor
of Graduate School of Engineering in
University of Fukui.
Tatsuro Yanase received the Dr.
(Eng.) degrees in Electric & Electronic
Engineering from Nagoya University in
1977. During 1967 to 1969 he worked for
Nippon Calculating Machine Corporation.
He is now an associate professor of
Graduate School of Engineering,
University of Fukui.
1542 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
Development of a Ubiquitous Industrial Data
Acquisition System for Rotogravure Printing
Press
Yuhuang Zheng
Department of Physics, Guangdong University of Education, Guangzhou, China
Email: [email protected]
AbstractThis paper describes a data acquisition system
developed to solute the problem of different data acquisition
modules communication in ubiquitous industrial
environment. The system can allow faster reconfiguration of
plant-floor networks as applications change. It can achieve
higher throughput, lower average message delay and less
average message dropping rate in wireless communication.
The development of a data acquisition system for
rotogravure printing press in ubiquitous industrial
environment also is reported. It illustrates that the system
can perform well in industrial application.
Index TermsWireless networks, Ubiquitous Computing,
Data Acquisition Module (DAM)
I. INTRODUCTION
Wireless networks have been under rapid development
during recent years. Types of technologies being
developed to wireless personal area network for short
range, point-to multi-point communications, such as
Bluetooth and ZigBee [1]. The application of wireless
technology for industrial communication and control
systems has the potential to provide major benefits in
terms of flexible installation and maintenance of field
devices and reduction in costs and problems due to wire
cabling [2].
Wireless communications from machine to machine
greatly enhance automation of an industrial system.
Ubiquitous industrial environment is coming and allows
the engineers to acquire and control the real-time data of
wireless networks of the factory at anytime anywhere [3].
A key issue currently limiting ubiquitous industrial
environment development involves compatibility among
components in industrial environment from different
suppliers, generally referred to as interoperability. Full
compatibility among components would also provide end
users with the flexibility to connect highly specialized,
high-end sensors with best-in-class wireless interface
devices [4].
Interoperability in ubiquitous industrial environment
means wireless communication protocol and the protocol
of monitoring and controlling industrial equipments are
interoperable. Interoperable wireless protocols are
making or have appeared by some international
organizations and alliances, such as ISO, WINA, ZigBee,
etc. Most industrial equipments have their special
monitoring and controlling protocols. Data Acquisition
Module (DAM) is the most important equipment in
industrial application, but different brands almost have
different inherent monitoring protocols. For example,
Advantech ADAM 4000 series support both ADAM
ASCII and MODBUS protocols. But different ADAM
4000 Modules have different command sets. It is
common for a factory to using different kinds of DAMs
in their product lines. How to make these different DAMs
can communicate with each other is a key problem.
This paper addresses the MPCS protocol to solute this
problem. MPCS is the abbreviation of MODBUS for
Producer/Consumer Services. MPCS protocol applies
ZigBee as wireless protocol among wireless nodes. The
core of MPCS is to use the MODBUS protocol without
polling to carry industrial equipments protocol.
MODBUS protocol is applied to an electronic controller
on the lingua franca. The most important is the protocol
also must be supported by typical DAMs. But most of the
industrial monitoring systems adopt fixed period polling
with less consideration about dynamic period in using
MODBUS. So in ubiquitous industrial environment, the
MODBUS protocol cannot satisfy the latency
requirement of wireless protocol and it cannot guarantee
the real-time monitoring of industrial environment
conditions. And the polling method of MODBUS adds
extra loads and burdens the wireless channel. The MPCS
protocol changes the polling mechanism of MODBUS
and the slave equipments can send the messages by
themselves periodically without receiving query
command from the master equipment.
MPCS protocol applies ZigBee as wireless protocol
among wireless nodes and MODBUS with sending
message periodically as industrial monitoring protocol.
Experiment shows that the combination of MPCS and
ZigBee is a good way to solute the interoperability in
ubiquitous industrial environment. MPCS has the
advantages in saving bandwidth and lightening servers'
Manuscript received January 1, 2011; revised June 1, 2011; accepted
July 1, 2011.
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1543
2011 ACADEMY PUBLISHER
doi:10.4304/jnw.6.11.1543-1548
Figure 1. System Architecture of System
(a) Paper web Unwinder
(b) Infeeding unit
(c) Blue color rotogravure printing unit
(d) Red color rotogravure printing unit
(e) Green color rotogravure printing unit
(f) Outfeeding unit
(g) Paper web Rewinder
Figure 2. Structure of Rotogravure Printing Press
load and enhances the real-time performance of industrial
wireless sensor networks.
This paper first introduces the data acquisition system
for rotogravure printing press in ubiquitous industrial
environment. Secondly ZigBee and ZigBee gateway
design are discussed. Thirdly it introduces MPCS
protocol in different DAMs. There is an experiment to
test MPCS protocol on rotogravure printing press
monitoring system in this section. Finally it is a
conclusion.
II. SYSTEM OVERVIEW
Wireless sensors provide the network with the ability
to reconfigure on the fly without being tied down by
signal cables. The goal of the system is to implement
such a network using DAMs connected by Zigbee
transceivers to a central computer that interfaces with a
database accessible. The three major components consist
of different kinds of DAMs, CC2530 Low Power
Transceivers for ZigBee, and the SCADA software
hosted on the central computer. A block diagram of the
high level design is provided in Fig. 1 below.
There are DAMs in this system, including three
modules of ADAM-4011(thermocouple input module)
and three modules of ADAM-4012(analog input module).
ADAM-4011 and ADAM-4012 use MODBUS protocol.
One ADAM-4011 module records dryer air temperature
of a color unit. One ADAM-4012 module logs tension of
a color unit. All DAMs collect data from the industrial
machinery and transmit them in MODBUS format to
ZigBee gateway in which data are processed. ZigBee
gateway packs data according to the ZigBee protocol, and
transmits them to via radio. Finally, data are transmitted
to ZigBee gateway of the central computer. At the central
computer, incoming data from ZigBee gateway of it are
received and processed by the SCADA software which is
developed in Kingview [5].
Rotogravure printing press consists of paper web
unwinder, infeeding unit, rotogravure printing units,
outfeeding unit, and paper web rewinder. In the
rotogravure printing process, a web from a continuous
roll is passed over the image surface of a revolving
gravure cylinder.
The printing images are formed by many tiny recesses
engraved into the surface of the gravure cylinder. The
cylinder is about one-fourth submerged in a fountain of
low- viscosity mixed ink. The mixed ink is picked up by
the cells on the revolving cylinder surface and is
continuously applied to the paper web. After impression
is made, the web travels through an enclosed heated air
dryer to evaporate the volatile solvent. The web is then
guided along a series of rollers to the next printing unit [8,
9]. Fig. 2 shows the structure of rotogravure printing
press in out application.
III. ZIGBEE
Wireless sensors provide the network with the ability
to reconfigure on the fly without being tied down by
signal cables. The goal of the system is to implement
such a network using DAMs connected by Zigbee
transceivers to a central computer that interfaces with a
database accessible. The three major components consist
of different kinds of DAMs, CC2530 Low Power
Transceivers for ZigBee, and the SCADA software
hosted on the central computer. A block diagram of the
high level design is provided in Fig. 1.
A. ZigBee in Industrial Environment
There are thousands of devices in a factory, such as,
DAM, HMI, IPC, smart sensor, and so on. ZigBee is
focused on control and automation. ZigBee standards
1544 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
CC2530 SP3220E
Central
Computer
TTL
MR25H40
512Kx8MRAM
SPI
Gateway(Coordinator)
Figure 3. A Coordinator Gateway
CC2530 SP485E
Data Acquisition
Modules
TTL
MR25H40
512Kx8MRAM
SPI
Rotogravure
Printing Press
check point
Gateway(Router )
Figure 4. A Router Gateway
have a characteristic of "three low" of low electricity
consumption (years cell life), low cost (less than $5) and
low data rate (250 Kb/s) [6-7]. ZigBee works with small
packet devices and supports a larger number of devices
and a longer range between devices than other
technologies. ZigBee devices can form mesh networks
connecting hundreds to thousands of devices together.
Devices use very little power and can operate on a cell
battery for many years. In timing critical applications,
such as industrial application, ZigBee is designed to
respond quickly. ZigBee is a good wireless technology in
industrial application.
B. ZigBee Gateway Designation
The ZigBee gateway is based on the CC2530 System-
on-Chip, which combines a RF transceiver with an
industry-standard enhanced 8051 MCU, in-system
programmable flash memory, 8-KB RAM, and other
powerful peripherals. The gateway which connects the
central computer operates as the coordinator and the
gateway that connects a DAM runs as the router in
ZigBee network. Because of large acquired data, these
gateways includes 512KB RAM as the data buffer. Fig. 3
is the architecture of coordinator gateway. This gateway
consists of CC2530, SP3220E devices, external RAM and
some interfaces. Fig. 4 is the architecture of router
gateway. This gateway consists of CC2530, SP485E
devices, external RAM.
Because a computer has a UART interface and the
interface usually is RS-232. But UART of CC2530 is
TTL, so RS232-TTL conversion is done with a SP3220E
chip. And a DAM has a UART interface which is RS-485,
so RS485-TTL conversion is done with a SP485E chip.
ZigBee gateway allows device containing UART to
communicate via radio with other devices. Each device
connects to ZigBee gateway. In this system, six ZigBee
gateways provide the radio communication link.
When the ZigBee gateway is used in an application, it
is assumed that a permanent power source will be
available at both ends of the wireless link. This means the
on-chip radio can always be active, eliminating the need
to synchronize the transmission/reception of data. The
link is designed to operate at up to 19200 baud.
The ZigBee gateway of PC must act as a PAN
Coordinator. The PAN Coordinator is responsible for
starting the network and allocating an address to the other
gateway, which acts as a Router. Fig. 5 is the program
flow chart of PAN Coordinator.
The ZigBee gateway of DAM acts as a router device.
The router device scans the radio channels, looking for
the PAN Coordinator. Once it has found the Coordinator,
it associates with it. Data transfer between radio and the
on-chip UART is identical to that described above. Data
received via the radio is output to the connected device
using the on-chip UART, and data received by the on-
chip UART from the device is transmitted over the radio.
This process is repeated every 20ms. Fig. 6 is the
program flow chart of Router.
IV. COMMUNICATION PROTOCOL
The communication protocol between the central
computer and DAMs is MPCS. MPCS is important to
have a protocol at the application layer that allows DAMs
to take advantage of producer/consumer services. Using
producer/consumer, the data "producer" which is a DAM,
puts the PAN Coordinator ID at the front of each packet.
The message is then sent out of the network and the
Coordinator screens the ID to determine if the data is for
its consumption. If so, the Coordinator becomes the
"consumer." As a result, multi-cast communication
happens naturally and efficiently in a producer/consumer
service.
MODBUS is designed for source/destination
communication or master/slave model. MPCS, however,
joins forces with producer/consumer technology to offer a
superior message-delivery mechanism of MODBUS.
MPCS supports all of the major communication
relationships of MODBUS. MPCS is a flexible protocol
and results in efficient use of bandwidth.
MODBUS-compliant devices are common in industrial
application. Users can achieve MODBUS by devices
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1545
2011 ACADEMY PUBLISHER
Figure 5. Coordinator Program
Figure 6. Router Program
internal standard, direct interface and external converter.
Internal standard means MODBUS is the basic protocol
that DAM uses for parsing messages, such as Link-Max
and Advantech DAMs. Some DAMs have their internal
standard protocols, but these DAMs also provide
MODBUS communication interface and MODBUS
communication instruction. For example, the Advantech
DAM family supports RS-485, direct interface to
MODBUS master-slave networks without an external
electrical interface converter. These DAMs are internal
support MODBUS.
In this data acquisition system for rotogravure printing
press, MODBUS is internal standard of Advantech DAM.
To verify the performance and interoperability of MPCS
protocol, we do a test of the system in this section. The
monitoring period of control center is 1 second. The
DAMs are distributed among 30 meters distance. Table 1
shows the result of the test. Six DAMs send 2000 packets
relatively and the central computer records these packets.
Basically, the test results are satisfied, and the MPCS
protocol are suitable in non-critical industrial
environment [10-13].
V. SUPERVISORY SOFTWARE DESIGN
Supervisory software is built on Kingview. Kingview
is a kind of HMI/SCADA software with abundant
functions. Kingview provides integrated development
TABLE I.
TEST RESULT
Data Acquisition
Modules
Transmit
Packages
Receive
Packages
Successful
Rate
1# 2000 1893 94.65%
2# 2000 1972 98.60%
3# 2000 1937 96.85%
4# 2000 1963 98.15%
5# 2000 1952 97.60%
6# 2000 1918 95.90%
1546 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
Figure 8. Program interface
Figure 9. Temperature and Tension Data
the Main
Interface
System
Administration
Module
Database
Module
Extension
Module
Communication Protocol
Device Management
Operation Interface
Logging Data
List Data
Print Forms
Figure 7. Temperature of Dryer Air
environment. The function of the software consists of
system administration module, database and extension
module, which are shown in Fig. 7.
A. System Administration Module
System administration module consists of
administrator logging in, password setting, user
administrating. The part of administrator logging in is to
restrict the operation of users. The part of code setting is
to provide all the users to modify their own code. The
part of which can only be operated by administrator is to
provide the detail information of users and give or repeal
the administration right to some users. According to
requirement analysis, the system should divide all the
users to three parts, and allow them to have different
operations. The administrator can administrate all the
users; browse all pictures and data in supervisory
software. They are the top tier. Engineers have similar
power of administrator except user administrating, so
they are the middle tier. Operators can only browse given
pictures on the industrial computer and they cannot get
the running data of rotogravure printing press. Fig. 8 is
the main program interface of this data acquisition system
on central computer [14].
B. Database Module
The database module is the core of the whole system,
in which the rule searching is carried out. Authorized
users can read real time data in dynamic report forms.
Meanwhile data will be recorded in database. Nobody can
modify information in the database. Authorized users can
browse all history data in static report forms. All report
forms can be printed. Fig. 9 shows some temperature data
of dryer air and tension data recorded by this system [15-
18].
C. Extension Module
The extension module makes the query result of
database can be displayed graphically in curves. User also
can print the curves and the query result. Fig. 10 is the
curves of some temperature data about dryer air.
VI. CONCLUSION
The design and implementation of a ubiquitous data
acquisition system for rotogravure printing press is
discussed and presented for industrial monitoring system.
Tests are carried out to determine system performance for
both the instrumentation and maintenance applications,
and as the results are quite satisfactory. The results show
the performance and interoperability for the wireless data
acquisition system is good enough for some monitoring
and non-critical instrument systems.
Further efforts are necessary to improve reliability of
sensor nodes, security, and standardization of interfaces
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1547
2011 ACADEMY PUBLISHER
Figure 10. Temperature Curves of Dryer Air
and interoperability. In addition, further studies are
necessary to improve the protocols functionality by
checking the impact of the mobility of sensor nodes.
REFERENCES
[1] W. Ning , Z. Naiqian, and W. Maohua, Wireless sensors
in agriculture and food industry - Recent development and
future perspective, Computers and Electronics in
Agriculture, vol. 50, pp. 1-14, January 2006
[2] B. Alvise, C. Luca, and S.V. Alberto, Platform-based
design of wireless sensor networks for industrial
applications, International Conference on Design,
Automation and Test in Europe. Munich, vol. 1, pp.4-10
March 2006
[3] L. K. Soon, W. W. N. Nu and E. M. Joo, Wireless sensor
networks for industrial environments, International
Conference on Computational Intelligence for Modeling,
Control and Automation. Vienna, vol. 2, pp.271-276,
November 2005
[4] Industrial Wireless Technology for the 21st Century,
www1.eere.energy.gov/industry/sensors_automation/pdfs/
wireless_technology.pdf
[5] X. Xueliang, T. Cheng and F. Xingyuan, A Health Care
System Based on PLC and ZigBee, International
Conference on Wireless Communications, Networking and
Mobile Computing. New York, pp.3063-3066, October
2007
[6] L. Zheng, ZigBee wireless sensor network in industrial
applications, International Joint Conference. Korea, pp.
1067-1070, October 2006
[7] G. Vehbi and H. Gerhard, Industrial wireless sensor
networks: Challenges, design principles, and technical
approaches, IEEE Trans. on Industrial Electronics, vol.
56, pp.4258-4265, October 2009
[8] Information on:
www.epa.gov/ttn/chief/ap42/ch04/final/c4s09-2.pdf
[9] Information
on:bbs.ca800.com/html/UploadFile/bbs/20080628/200806
28161526240.doc
[10] S. Sooyeon, K. Taekyoung, J. G. Yong, P. Youngman, and
R. Haekyu, An experimental study of hierarchical
intrusion detection for wireless industrial sensor networks,
IEEE Trans. on Industrial Informatics, vol. 6, pp.744-757,
November 2010
[11] G. Sumeet, V. Shekhar and A. R. Kumar. Intelligent
industrial data acquisition and energy monitoring using
wireless sensor networks, International Journal of Grid
and High Performance Computing, vol. 2 pp.44-59, July
2010
[12] C. Jiming, C. Xianghui, C. Peng, X. Yang and S. Youxian.
Distributed collaborative control for industrial automation
with wireless sensor and actuator networks, IEEE Trans.
on Industrial Electronics, vol. 57, pp.4219-4230,
December 2010
[13] U. Alphan, G. Ozgur, and O. Ahmet, Wireless model-
based predictive networked control system over
cooperative wireless network, IEEE Trans. on Industrial
Informatics, vol. 7, pp.41-51, February 2011
[14] G. M. Coates, K. M. Hopkinson, S.R. Graham and S. H.
Kurkowski, A trust system architecture for SCADA
network security, IEEE Trans. on Power Delivery, vol. 25,
pp.158-169, January 2010
[15] H.T. Snchez, P. Snchez and M. Estrems, SCADA
system improvement in PVC high frequency plastic
welding, International Journal of Advanced
Manufacturing Technology, vol. 40, pp 84-94, January
2009
[16] G. N. Korres and N. M. Manousakis, State estimation and
bad data processing for systems including PMU and
SCADA measurements, Electric Power Systems Research,
vol. 81, pp.1514-1524, July 2011
[17] A.Kusiak and Z. Zhang, Analysis of wind turbine
vibrations based on SCADA data, Journal of Solar
Energy Engineering, vol. 132, pp.0310081-03100812,
August 2010
[18] H. Lee, D.H. Yoon, G. Jang and J.K. Park, Study of the
design of data acquisition and analysis systems for multi-
purpose regional energy systems, Journal of Electrical
Engineering and Technology, vol. 5, pp.16-20, March 2010
Yuhuang Zheng received his B.S. and M.S. degree from the
Faculty of Automation, Guangdong University of Technology,
Guangzhou, China in 2002 and 2006
respectively. In 2009, he received his
Ph.D. from School of Mechanical &
Automotive Engineering, South
China University of Technology. His
main interests are industrial
automation, embedded system design,
and pervasive computing. He is a
lecturer at the Dept. of Physics,
Guangdong University of Education.
1548 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
An Application of the Modification of Slow Start
Algorithm in Campus Network
Guo-hong Gao
School of Information Engineer, Henan Institute of Science and Technology, Henan Xinxiang, 453003, CHINA
[email protected]
Wen-xian Xiao, Zhen Liu
Network Center, Henan Institute of Science and Technology , Henan Xinxiang, 453003, CHINA
[email protected]
Wen-long Wan
Foreign language department,Henan Institute of Science and Technology,Henan Xinxiang, 453003,China
e-mail:[email protected]
AbstractFor the problems existing in TCP/IP congestion
control method, a modified slow start algorithm is
introduced to the internal services of campus network to
make congestion control method more effective. Network
simulation results show that: the modified congestion
control algorithm can effectively improve the network
transmission efficiency in a specific network environment.
I ndex Termscongestion control;slow start;campus
network; simulation
I. INTRODUCTION
Congestion control is one of the key factors for
ensuring the stability and robustness of computer network.
With the expansion of network size, the continuous
increase of network bandwidth and the increasing
diversification of networking forms, congestion control
has encountered some new problems needing solving.
When the packet number which reaches the network is
greater than the processing capacity of network, network
performance would drop dramatically, and then the
congestion will inevitably happen. In order to avoid
congestion, people use congestion control algorithm in
the network to make it work properly. TCP congestion
control algorithms include such four basic algorithms as
slow start, congestion avoidance, fast retransmit and fast
recovery. Slow start and congestion avoidance algorithm
are the methods that the TCP data sender must follow
while sending windows[1,2].
In RFC2581 and RFC2001, the slow start algorithm of
TCP was described, and its pseudo code is described as
follows (in which, swin is the sender window, awin is
notification window for the recipient) :
swin = min (cwnd, awin)
cwnd = 1;
ssthresh = 65535bytes;
When the new data packet confirms that the ACK
arrives:
If (cwnd> = ssthresh)
/ Congestion avoidance /
cwnd = cwnd +1 / cwnd;
else
/ Slow start /
cwnd = cwnd +1;
Timeout:
ssthresh = max (2, min (cwnd / 2, awin));
cwnd = 1;
Re-enter the phase of slow start.
From the above description of congestion control,
algorithm has trouble in efficiency .Its way of using the
progressive increase to find out the right sending
bandwidth makes TCP connections unable to fully use
the available network bandwidth at the beginning of
connections[2,3,4]. For the connections which have small
transmission data and strong paroxysm(such as Web
streaming), the whole connection may have always been
in the slow-start state with a small sending window, thus
making the network transmission efficiency lower. In
addition ,in the process of slow start, whether the initial
window and slow start threshold selected is appropriate
also directly affects the transmission performance of the
network. A continual loss in the connection process will
cause that the slow-start threshold value decreases rapidly;
the system is under the control of small congestion
window with slow growth for a long time, especially
when the loss occurred in the initial window, the slow
start threshold value will be reduced to the size of two
data segments, the connection with the small amount of
data sent may have always been unable to get the right
bandwidth[5].
The packet loss is taken as a basis to judge congestion
in prevailing TCP congestion control .This approach is
successful in cable transmission network. The reliability
of network transmission is relatively low; the
transmission signals are susceptible to be interfered by
external factors. Packet loss is not necessarily caused by
network congestion. The network congestion detection
mechanism has also been challenged. In this case, the use
of traditional TCP algorithm with progressive increase
and reducing in times of multiplication may result in the
decrease in the utilization of network resources, while, in
the end to end TCP congestion control strategies, the
judge of congestion is determined by the feedback
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1549
2011 ACADEMY PUBLISHER
doi:10.4304/jnw.6.11.1549-1556
information at the receiving end, the data sender doesn't
get information about network congestion until
congestion occurs for at least 1/2 RTT, the program that
the sender slows down the sending speed to avoid
network congestion is implemented after congestion
occurs for a period of time , and most of the data packets
sent in the period from the occur of network congestion
to the relieve of the network congestion may be
discarded. If the data sender can detect congestion in a
timely manner, network congestion control mechanism
would be more effective, the utilization of network
resource will be higher, and the costs for data
transmission will be smaller. Timely detection of network
congestion, is also a problem that TCP congestion control
strategies should solve[6,7,8].
From the description of the slow start, the way using
the progressive increase to find out the right sending
bandwidth makes TCP connections unable to fully use
the available network bandwidth at the beginning of
connections. For the connections which have small
transmission data and strong paroxysm, the whole
connection may have always been in the slow-start state
with a small sending window, thus making the network
transmission efficiency lower. If continual packet losses
appear in the process of connection, the value of the
threshold of slow start will sharply reduce[9]. The system
will be under the control of small congestion window
with slow growth for a long time, so it may have always
been unable to get the right bandwidth. In connection
with the deficiency above, the modified strategies have
been put forward in the paper[10].
II. ALGORITHM MODIFICATION
The purpose of congestion control is to ensure the
security, stability and efficiency of network operation, but
because of the complexity of network congestion control,
there are some difficulties to ensure all the above
objectives simultaneously. For the network using TCP/IP
protocol, the congestion control is achieved by TCP
congestion control and IP layer congestion control; TCP's
congestion control algorithm runs on the nodes, whose
algorithm complexity has little influence on the overall
network efficiency; for the IP layer congestion control
algorithm, its congestion control is implemented by core
router. In order to ensure network efficiency, IP layer
congestion control algorithm must be efficient, simple. In
the IP layer congestion control algorithm, the time to
determine whether a packet should be discarded must be
much smaller than the time that the packet is sent,
otherwise, the algorithm has no practical application
value[11,12].
LOW etc proposed TCP/AQM dual model based on
the Theory of Optimization. It regards the existing TCP
congestion control and AQM algorithms in the network
layer as the solution of the distributed algorithm with the
appropriate utility function and the optimal rate allocation
problem; hence it can theoretically analyze network
performance in equilibrium, such as packet loss rate,
throughput, efficiency, delay and queue length. In the
whole congestion control mechanism, the rate of the
sending end and the congestion extent of router influence
each other. According to the congestion extent the router
feeds back, the sending end adjusts its sending rate; on
the other hand, the transmission rate of every sending end
would affect the degree of network congestion, thus
creating a closed loop congestion control system. The
main idea of the model is to think of the transmission rate
as the original variable, the congestion metric as the dual
variables[13]. The existing TCP/AQM algorithm can be
seen as the largest Lagrangian method which makes the
utilization rate of total resource largest, as is the basic
method of optimization theory.Assuming the network
contains L links, which are shared by S roots.
When the root S sends at the rate of
s
x
, its utility
function value is
( )
s s
U x
. Assuming that
s
U
is an
increasing function which is convex, continuous and
differentiable ,Link L adjusts the amount of the degree of
congestion according to stimulated roots passing through
it, while root S adjusts the sending rate according to the
amount of congestion extent of link in its transmission
path. It is written as
( ) ( ( ), )
s
x t x t s S =
( ) ( ( ), )
l
pt p t l L =
and then
( 1) ( ( ), ( ))
( 1) ( ( ), ( ))
x t F x t p t
p t G x t p t
+ =
+ =
(Equation 2.1)
Function
F
is senders congestion control algorithm,
such as TCP Vegas, TCP Reno, etc.; function
G
as a
router queue management, such as RED. This way, TCP /
AQM strategies can be described with a triple
function
( , , ) F G U
. For convenience, when the link
with the capacity of
C
is shared by
S
roots, the goal
whose transmission rate (i.e. the original variables) is the
congestion control is selected to make
0
max ( )
. .
s
s s
x
S
s
S
U x
st x C
(Equation 2.2)
Equation (2.2) becomes the original problem, and the
corresponding dual problem as the amount of congestion
extent selected (i.e., dual variables), so that
0 0
min ( ) max ( )
s
s s s
p x
S
D p U x x p pc
= +
(Equation 2.3)
Based on Kuhn-Tucker theorem in Optimization
Theory, the existence of non-negative value
( * , * ) x p
makes
* x
become the solution of the original problem,
* p
as the solution to the dual problem. Different
mechanisms should adopt the corresponding amount of
congestion extent, such as queuing delay based on TCP
Vegas, TCP Reno based on packet loss, RED queue
based on the length of queue, REM based on the price.
some common TCP/AQM strategies are given in
reference [14], such as Reno/RED, Reno/REM,
Reno/DropTail, Vegas/DropTail and the specific form of
triples of Vegas/REM
( , , ) F G U
, and their steady-
1550 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
nature is analyzed. Recently, PAGANINI, based on
above model, further analyzed the robustness and
stability of optimized congestion control by means of
Feedback Control Theory. KELLY also proposed
congestion control framework of another type based on
Optimization Theory, and use Lyapunov Stability Theory
to analyze the stability of congestion control system.
For each TCP connection, the sender maintains two
parameters, namely the congestion window and slow start
threshold. The congestion window is used to describe the
maximum amount of data the sender can send over the
network before receiving the confirmation message; the
slow start threshold is what the sending side uses to
determine whether slow start algorithm or congestion
avoidance algorithm will be adopted to control the data
transfer. The minimum value of congestion window
(CWND) and the receiver advertised window (RWND)
determines a maximum amount of data the sender can
transmit[15].
The pseudo codes of improved algorithm : (in which,
swin is the sender window, awin is notification window
for the recipient)
cwnd = IW;
swin = min (wnd, awin);
ssthresh = 65535bytes (default value);
Sender receives a packet ACK which is newly
confirmed:
if (cwnd <ssthresh)
{
/ Slow start begins /
cwnd = cwnd +1;
}
else
{
/ Congestion avoidance /
cwnd = cwnd +1 / cwnd;
}
when senderdoesn't receive packet ACK confirmed
because of timeout :
if (cwnd = IW)
ssthresh = ssthresh;
else
{
ssthresh = max (2, min (cwnd / 2, awin));
cwnd = IW;
}
This algorithm can still further improvements, making
the algorithm is more effective, the improved algorithm
in network timeout listed in this way processing .
Overtime not received confirmation bag when an ACK:
If (cwnd< threshold)
{
ssthresh=ssthresh;
}
else
{
ssthresh=max(2,min(cwnd/2,awin));
cwnd= IW;
}
The improvement of algorithm is mainly for handling
packet loss. In the improved algorithm, when packet loss
occurs, the value of the network's congestion window
will firstly be checked. If the value of the network's
congestion window is the initial size of congestion
control window, slow start threshold maintains the same,
so as to effectively prevent the premature entry into
congestion avoidance phase. Improved slow start
algorithm can reduce the impact of packet loss caused by
non-congestion factors on the congestion control
algorithm, which has a positive significance for the
application of the wireless network[16,17].
This algorithm could be further improved to make the
algorithm more efficient. In the improved algorithm,
network timeouts will be handled as follows:
when sender doesn't receive packet ACK confirmed
because of timeout :
If (cwnd <threshold)
{
ssthresh = ssthresh;
}
else
{
ssthresh = max (2, min (cwnd / 2, awin));
cwnd = IW;
During the initial stage of the modified slow start
algorithm, congestion control window is set as the initial
window. After the sender sends the data package, the
retransmission of packet does not change the slow start
threshold in the case of receiving no ACK confirmation
information in the RTO timeout set. Packet loss may
occur in the initial congestion window or a small
congestion window[18]. The modified congestion control
algorithms need to address the small window packet loss.
Modified congestion control algorithms do not affect the
whole network performance, and have some practical
value for specific network environment.
In accordance with the modified algorithm, if packet
loss occurs in the initial congestion window or small
congestion window during the TCP connection, slow start
threshold will not change, so congestion control will be
prevented from early entering congestion avoidance
phase, which has a protective effect on the connection
with a small amount of data sent, thereby enhancing the
system's transmission efficiency[19].
The above analysis shows that: the modified slow start
algorithm mainly deals with packet loss. When the
network packet loss occurs, the value of network
congestion should be firstly checked, if the value of
network congestion window is the initial congestion
control window size, slow start threshold should be
maintained the same, so as to effectively prevent the
premature entry into congestion avoidance phase. The
algorithm can reduce the impact of packet loss caused by
non-congestion on the congestion control algorithm[20].
Modified congestion control algorithm only changes the
sender instead of modifying the entire network or the
receiver protocols, so it does not affect network
performance, and has a practical value for the application
in specific network environment.
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1551
2011 ACADEMY PUBLISHER
III. APPLICATION OF MODIFIED ALGORITHM IN CAMPUS
NETWORK
Campus network is a common local area network
applied within schools, with the characteristics of limited
connectivity, simple topology and high transmission
bandwidth. In campus network, the connection of internal
nodes with Education Network or Internet is usually by
means of the public exports rented. Compared with high
link bandwidth within the campus network, the export
bandwidth of the campus network is usually small, and
often forms a bottleneck in the export areas. Campus
network, in its interior, is usually connected through the
switch of the core, aggregation and access layer, and is
divided into different segments according to different
geographical locations. VLAN approach is generally
adopted in the management of the user within the
network; the users in the same VLAN are with the same
desktop connection bandwidth, typically 100Mbps[21].
The topologies of campus network are roughly similar,
basically based on the three-tier exchange technology;
Figure 1 is the network topology of Henan Institute of
Technology, which is a typical campus network
connection topology, within the campus network, the core
switch is connected with aggregation switches with
Gigabit; aggregation switches are connected with
switches with Gigabit; switches access to the users with
Fast; the school's total export bandwidth as 100M is
uplinked to Henan Normal University. All the campus
networks are roughly the same, typical of the tree.
According to different objects, there are two campus
network services: internal services and external services.
Internal services provide services to internal users
through the connection of core switch with the backbone
network, and provide the campus network users with
WEB, FTP, VOD, MAIL, and other management systems
services; for example, all the servers of Henan Institute of
Technology used HPDL850, which provides dual-gigabit
to core switch (DB10808) for the above services. Within
the school, in terms of all net users, this bandwidth is
adequate, and network congestion will not occur because
of bandwidth problems in internal services; but the
bandwidth at the exit of Henan Normal University is
100M, easy to form a bottleneck here and easy to cause
obstruction here in terms of foreign service of the campus
network.
CERNET CERNET
CHINANET CHINANET
Si Si
Figure 1. Campus network topology structure
Within the campus network, the terminals in the same
VLAN can share network congestion information, so a
node can be designed to represent the connection of an
entire VLAN. Specific topology is described as follows:
node n1 stands for connection node of the internal server;
node n2 stands for source node of fixed rate stream; node
n3 is used to replace multiple network segments within
the campus network. When TCP connections of node n3
are greater than a certain number, it can be assumed that
the total TCP connection bandwidth is close to a stable
value, and network connection between node n2 and n3 is
similar to the fixed flow rate (CBR)[22]. It can also be
assumed that this kind of connection represents campus
network exit bandwidth. Since the fixed rate of flow has a
certain flexibility in the competition of bandwidth with
the TCP connection, so as to simulate the actual network
situation better, that is why the connection between node
n2 and n3 uses fixed-rate flow instead of limiting
bandwidth directly; Designing another two network
connection nodes n4 and n5, the connection between
node n1 and n5 uses the classical TCP congestion control
to transmit data while the connection between node n1
and n4 uses the modified TCP (it is called TCPLAN in
the later sections of the thesis) congestion control
algorithm for data transmission. Network congestion
occurs in the link from R0 to R1.
When network congestion occurs, applied control
strategies use modified slow start. Within the campus
network, small network transfer rate will drop the overall
transmission efficiency; using modified slow start
algorithm, we do not change the slow start threshold, so
that all TCP connections can grow at a smooth rate, thus
we can take advantage of the internal resources of
network.
IV. SIMULATION
NS consists of two basic components, one is the
extended and object-oriented TCL interpreter, and the
other is the NS simulation database. The simulation
database contains event schedules, a variety of simulated
network entity object and the modules related with the
network settings. Ordinary users can general use NS by
the script description language, the specific process is as
follows: First, set the network topology with NS. In the
process of setting the topology, can be achieved using
different network objects, and network objects function
can be obtained through the object database; Then, set the
data source and data receiver object; Finally, tell data
sources what time to begin data transmission and what
time to end it through event schedules. In NS, the user
can add a new network object. Although the user can
write themselves a new object, it is more accepted to
deserve a new object from other libraries. Network flow
simulation is a very complex task, but the use of NS
software makes all the work become very simple[23].
Simulation of network flow using NS contains the
following sections:(1)Programming: Using OTCL
language programming: mainly including the creation of
classes and objects, topology design, event design. As to
the simulation of the underlying network protocol, the
classes and objects existed in library can be used, while
as for the design of the new agreement, new classes and
objects must be generated by means of designing and
1552 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
inheriting. (2) Simulation: to simulate using the NS, just
enter ns file name in the appropriate path. (3) Analysis of
the result: Currently, there are a number of ways to
analyse the results of NS simulation: to simulate using
animation, generating the image, or analyzing the event
trace file. The analysis of trace file can help to thoroughly
learn about the situation of the transmission of each
packet.
In order to achieve a new TCP congestion control
method, new NS components must be developed. In the
NS-2, there are two ways to design a component: First,
to achieve by writing new code, and second, to build a
new component by inheriting existing components[24]. In
the experiment, the inherited methods are used to build a
new component TCPLAN.The construction of
component by means of inheritace includes the
following steps:
(1) TCPLAN header files
The header files of improved congestion control
method are as follows:
New file: tcplan.h
class LANTCPAgent: public TcpAgent {
public:
virtual void set_initial_window ()
{
cwnd_ = var_initial;
}
Private:
Int_cwnd_ = var_initial;
};
(2) to create TCPLAN the C + + class files
The C++ class files of improved congestion control
methods are as follows:
New file: tcplan.cc
static LANTcpClass: public TclClass {
public:
LANTcpClass (): TclClass ("Agent / TCP / Lan") {}
TclObject * create (int, const char * const *) {
return (new LANTcpAgent ());
}
};
LANTcpAgent:: LANTcpAgent () {
bind ("var_inital", var_initial);
}
In addition, the following work needs to be done: to
define OTCL link, compose OTCL code, compile and so
on.
According to the characteristics of intra campus
network, the modified congestion control algorithm is
simulated with NS-2. The simulation results show that the
modified algorithm is suitable for intra-campus network
services.
In the simulation experiment, the campus network
topology and connectivity are shown in Figure 2. In the
actual analysis, the discussion will be made according to
agreements. Assuming there is only one network protocol,
it will be discussed later in heterogeneous network
structure.
Figure 2. Network topology and connection diagram
Let us check the changes of the TCP's cwnd by means
of simulation. In the link from n2 to R0, we use udp
services. The using of a stable cbr flow to transmit above
it does not cause congestion at the link node R0. In the
link from n1 to R0, we use the FTP stream to transmit,
with slow start algorithm as its control strategy.
It is described as follows with the TCL language.
Set ftp [new Agent/ftp/newreno]
$ns attach agent $n1 $R0 $ftp
Set tcplan [new Agent/tcplan/newreno]
$ns attach agent $n1 $R0 $tcplan
Set udp [new Agent/udp/newreno]
$ns attach agent $n2 $R0 $udp
Set sink [new Agent/tcpsink/newreno]
$ns attach agent $n5 $R1 $sink1
$ns attach agent $n4 $R1 $sink2
$ns attach agent $n3 $R1 $null
$ns connect $tcp $sink
$ns duplex-link $R0 $R1 2Mb 10ms DropTail
$ns duplex-link $R0 $n1 10Mb 2ms DropTail
$ns duplex-link $R0 $n2 10Mb 2ms DropTail
$ns duplex-link $n3 $R1 10Mb 2ms DropTail
$ns duplex-link $n4 $R1 10Mb 2ms DropTail
$ns duplex-link $n5 $R1 10Mb 2ms DropTail
$ns duplex-link op $n1 $R0 orient left-up
$ns duplex-link op $n2 $R0 orient left-down
$ns duplex-link op $n3 $R1 orient right-down
$ns duplex-link op $n4 $R1 orient right
$ns duplex-link op $n5 $R1 orient right-up
$ns queue limit $n1 $R0 10;
Set cbr [new application/traffic/cbr]
$cbr attach agent $udp
$cbr set type_CBR
$cbr set packet_size 1000
$cbr set rate_1mb
$cbr set random_false
$ns at 0 $cbr start
$ns at 0 $tcp start
$ns at 0 $tcplan start
$ns at 1.2 finsh
$ns run
$ns trace queue $n1 ,$R0 ,$tracefile
Using gnuplot to analyze, cwnd changes are shown in
Figure 3.
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1553
2011 ACADEMY PUBLISHER
Figure 3. TCP's cwnd changes
The results show that, the value of TCP's Congestion
Window will show repeated periodical change[25]. When
TCP begins to execute, it first begins from Slow-start,
then enters the Congestion Avoidance phase when cwnd
is over Ssthresh. When the packets sent to the network
continuously increase until the number which can be
transmitted on the network is over the tolerance, the
router starts using Drop-tail to discard the packets. When
packet loss occurs, TCP will set ssthresh as 1/2 of
Window value which is the value when packet loss is
found, then set the value of the Window as 1. TCP have
to re-start from the slow-start when each packet loss
occurs.
A. TCPLAN transmission effect
The ftp connection is still used to simulate new
agreement; the network topology is shown in Figure 4.
Figure 4. The cwnd change map of TCPLAN
To describe the following with the TCL language:
Set ftp [new Agent/ftp/newreno]
$ns attach agent $n1 $R0 $ftp
Set tcplan [new Agent/tcplan/newreno]
$ns attach agent $n1 $R0 $tcplan
Set udp [new Agent/udp/newreno]
$ns attach agent $n2 $R0 $udp
Set sink [new Agent/tcpsink/newreno]
$ns attach agent $n5 $R1 $sink1
$ns attach agent $n4 $R1 $sink2
$ns attach agent $n3 $R1 $null
$ns connect $tcp $sink
$ns duplex-link $R0 $R1 2Mb 10ms DropTail
$ns duplex-link $R0 $n1 10Mb 2ms DropTail
$ns duplex-link $R0 $n2 10Mb 2ms DropTail
$ns duplex-link $n3 $R1 10Mb 2ms DropTail
$ns duplex-link $n4 $R1 10Mb 2ms DropTail
$ns duplex-link $n5 $R1 10Mb 2ms DropTail
$ns duplex-link op $n1 $R0 orient left-up
$ns duplex-link op $n2 $R0 orient left-down
$ns duplex-link op $n3 $R1 orient right-down
$ns duplex-link op $n4 $R1 orient right
$ns duplex-link op $n5 $R1 orient right-up
$ns queue limit $n1 $R0 10;
Set cbr [new application/traffic/cbr]
$cbr attach agent $udp
$cbr set type_CBR
$cbr set packet_size 1000
$cbr set rate_1mb
$cbr set random_false
$ns at 0 $cbr start
$ns at 0 $tcp start
$ns at 0 $tcplan start
$ns at 1.2 finsh
$ns run
$ns trace queue $n1 ,$R0 ,$tracefile
The results show that in the network environment with
a large amount of data transmission, TCPLAN can
always improve their sending rate in a very short period
of time. With the increase in the amount of data transfer,
according to the data 2 seconds later from the start of
connection, it is faster for TCPLAN to probe the
available bandwidth than TCP with the increase of
connection time.Under the same condition, the
transmission rate of TCPLAN is still higher than the
transmission rate of TCP.
This kind of congestion control method, which is the
one that network topology and connectivity features have
been known, can be used within the campus network
services, but the algorithm is lack of certain versatility.
B. The transmission effect of TCPLAN in the environment
of heterogeneity
The above experiment is only based on the operation of
only one TCP protocol, but in the actual network, it is
bound to consider running in co-existence with other TCP
versions. The following is the comparison on the
condition of heterogeneous structure between TCPLAN
and TCP Vegas.
In simulation experiment, the campus network is
simplified as Figure 5:
Figure 5. Network topology and connection diagram
1554 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
In which, the delay between r0 and r1 is assumed to be
20ms, while the one among other links is assumed to be
1ms.
The main code is described as follows with TCL
language:
$ns duplex-link $n0 $r0 10Mb 1ms DropTail
$ns duplex-link $n2 $r0 10Mb 1ms DropTail
$ns duplex-link $r0 $r1 1Mb 20ms RED
$ns duplex-link $r1 $n1 10Mb 1ms DropTail
$ns duplex-link $r1 $n3 10Mb 1ms DropTail
set buffer_size 15
$ns queue-limit $r0 $r1 $buffer_size
set tcp0 [new Agent/TCP/Vegas];
$tcp0 set v_alpha_ 1
$tcp0 set v_beta_ 3
$tcp0 set debug_ 0
$tcp0 set window_ 24
$tcp0 set fid_ 0
$ns attach-agent $n0 $tcp0
set tcp0sink [new Agent/TCPSink]
$tcp0sink set fid_ 0
$ns attach-agent $n1 $tcp0sink
$ns connect $tcp0 $tcp0sink
set ftp0 [new Application/FTP]
$ftp0 attach-agent $tcp0
Set tcp1 [new Agent / TCP / FTPLAN]
set tcp1 [new Agent/TCP/FTPLAN]
$tcp1 set window_ 24
$tcp1 set fid_ 1
$ns attach-agent $n2 $tcp1
set tcp1sink [new Agent/TCPSink]
$tcp1sink set fid_ 1
$ns attach-agent $n3 $tcp1sink
$ns connect $tcp1 $tcp1sink
set ftp1 [new Application/FTP]
$ftp1 attach-agent $tcp1
Figure 6 shows the cwnd change of Vegas and
TCPLAN
Figure 6 The cwnd change map of TCPLAN and Vegas
It can be seen, TCPLAN always places a higher
vibration, while the Window of Vegas is always kept at a
lower position. As TCPLAN uses a more aggressive
congestion control strategy, the transmitter will continue
to send packets on the network, while Vegas uses a more
conservative approach. In contrast, TCPLAN has higher
bandwidth occupation capabilities.
The results show that: the modified slow start
algorithm can improve network transmission efficiency in
the particular network environment; Because the
algorithm only changes the agreement at the sending end,
and there is no requirement on the receiving end, so the
using of the algorithm does not affect the internal service
performance of network. The algorithm is suitable for the
connection within the campus network, and also applies
to other network connections which have same
connection characteristics.
V. CONCLUSION
Although in network congestion control has been doing
a lot of research, but because the network rapid
development and the complexity of congestion control,
network congestion control will still faced a series of
problems need to be solved. With the expanding of
network size and networking mode diversification, need
more reliable, more timely network congestion detection
mechanism, single depend on lost package or rely on the
judging repeat and confirm receipt method of network
congestion already can not adapt to the needs of the
development of the network.
For congestion control problem of research, based on
theoretical analysis method and using the simulation
software simulation methods have some shortcomings,
congestion control complexity is a long-standing
problems. As IPv6 technology development and mature,
how to play the IP layer in congestion control function, as
well as TCP and IP congestion control cooperation and is
also a new problems waiting to be solved. TCP/IP
congestion control the design and realization of facing
hordes of compromise, the existing congestion control
method and technology in multi-objective various
environments faced with many challenges, there are
many areas in need of improvement, congestion control
will attract many researchers for congestion control
solution of the problems but work hard to struggle.
REFERENCES
[1] Floyd S. A report on some recent developments in TCP
congestion control [J]. IEEE Communication Magazine,
2005, 35(4):84-90
[2] Tranenbaum A S Computer networks 2006,137~152
[3] Low, S.H. Paganini, F. Doyle, J.C. Internet congestion
control . 2006, 22 (1): 28-43
[4] J. Nagle Congestion control in TCP/IP internetworks
ACM SIGCOMM Computer Communication Review,
2008,127~139
http://www.sprintlabs.com/People/diot/publications.html
[5] Feng, W., Kandlur, D., Saha, D., et al. A self-configuring
RED gateway. In: Doshi, B., ed. Proceedings of the IEEE
INFOCOM.New York: IEEE Communications Society,
2005, 1320-1328
[6] H. Yousefizadeh, and H. Jafarkhani ,A Finite-State
Markov Chain Model for Statistical Loss Across a RED
Queue , Systems Communications, 2005,213~215
[7] Allman, M., Floyd, S., Partridge, C. Increasing TCPs
Initial Window. RFC 2414, 2009,321~325
[8] D Katabi, M Handley, C Rohrs, Internet congestion control
for future high bandwidth-delay product environments -
ACM SIGCOMM, 2002 pittsburgh,august,2004,45~48
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1555
2011 ACADEMY PUBLISHER
[9] S Floyd, S Ratnasamy, S Shenker Modifying TCPs
Congestion Control for High Speeds,- Preliminary Draft.
http://www. icir. org/floyd/papers/ hstcp.html,2010.
[10] S Floyd , HighSpeed TCP for Large Congestion Windows,
IETF draft, work in progress, 2004,12~14.
[11] S.Floyd, Limited slow-start for tcp with large congestion
window, IETF,2006,43~45.
[12] Athuraliya S, LOW S H. Optimization flow control (2):
implementation [DB/OL]. http://netlab.caltech.edu,2010.
[13] Low S H. A duality model of TCP and queue management
algorithms [DB/OL]. http://netlab.caltech.edu,2009.
[14] MISRA V. Fluid-based analysis of a network of aqm
routers supporting TCP flows with an application to RED
[DB/OL]. http://www.net.cs.umsaa.Edu/ misra,2008
[15] Soohyun Cho, Riccardo Bettati. Collaborative Congestion
Control in Parallel TCP Flows , 2005 ,1026-1031
[16] Internet Engineering Task Force. TCP Friendly Rate
Control (TFRC): Protocol Specification. Internet Draft,
2006,63~64
[17] MISRAV. Fluid-based analysis of a network of aqm
routers supporting TCP flows with an application to RED
[DB/OL]. http://www.net.cs.umsaa.Edu/ misra,2008
[18] Soohyun Cho, Riccardo Bettati. Collaborative Congestion
Control in Parallel TCP Flows , 2005 ,1026-1031
[19] Low S H. A duality model of TCP and queue management
algorithms [DB/OL]. http://netlab.caltech.edu,2010.
[20] F. Kelly, A. Maulloo, and D. Tan, Rate control for
communication networks:Shadow price proportional
fairness and stability, J. Oper. Res.Soc, 2004,237252
[21] Jeonghoon Mo and Jean Walrand,Fair End-to-End
Window-Based Congestion Control IEEE/ACM
TRANSACTIONS ON NETWORKING, 2005,94~102
[22] T.Kelly,Scalable TCP:Improving performance in
highspeed wide area networks,Submitted for
publication,December 2004,211~237
[23] S.Floyd, Limited slow-start for tcp with large congestion
window, IETF,2006,43~45
[24] S Floyd, S Ratnasamy, S Shenker Modifying TCPs
Congestion Control for High Speeds,- Preliminary Draft.
http://www. icir. org/floyd/papers/ hstcp.html,2009.
[25] Allman, M., Floyd, S., Partridge, C. Increasing TCPs
Initial Window. RFC 2414, 2004,321~325
Gao Guohong (1975-), was born
in Zhengzhou, China. He received
his B.S degree in 2000 form
Computer and Applications, Henan
normal university in Xinxiang, his
M.S degree in 2008 form School of
Computer Technology, Huazhong
University of Science and
Technology, and enroll in Wuhan
University of Technology in 2009, work hard at D.S
degree. Currently he is a professor in the School of
Information Engineer, Henan Institute of Science and
Technology, Henan Xinxiang,China. The main
publications include: Compute Operating
System ;network and information security computer
software.
Xiao Wenxian (1975-), was born in Nanyang, China.
He received his B.S degree in 2000 form Computer and
Applications, PLA information engineering university ,
his M.S degree in 2009 form School of Computer
Technology, Huazhong University of Science and
Technology, he is a associate professor in Henan Institute
of Science and Technology, The main publications
include: Compute Operating System (Beijing, China,
National Defense University Press, 2010), Asp.Net Web
Programming (Beijing, China, Nation Defense University
Press, 2008). His research interests include: network and
information security computer software. He is a advanced
membership of WASE society.
1556 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
Cryptanalysis and Improvement of Selvi et al.s
Identity-Based Threshold Signcryption Scheme
Wei Yuan
Department of Computer Science and Technology, J ilin University, Changchun, China
Email: [email protected]
Liang Hu
Department of Computer Science and Technology, J ilin University, Changchun, China
Email: [email protected]
Hongtu Li
Department of Computer Science and Technology, J ilin University, Changchun, China
Email: [email protected]
J ianfeng Chu
Department of Computer Science and Technology, J ilin University, Changchun, China
Corresponding author, Email: [email protected]
Yuyu Sun
College of Computer Science and Technology, J ilin University, Changchun 130012, China,
Software Institue, Changchun University, Changchun 130022, China
E-mail: [email protected]
AbstractSigncryption can realize the function of
encryption and signature in a reasonable logic step, which
can lower computational costs and communication
overheads. In 2008, S. S. D. Selvi et al. proposed an identity-
based threshold signcryption scheme. In this paper, we show
that the threshold signcryption scheme of S. S. D. Selvi et al.
is vulnerable if the attacker can replaces the group public
key. Then we point out that the receiver uses the senders
public key without any verification in the unsigncrypt stage
cause this attack. Further, we propose a probably-secure
improved scheme to correct the vulnerable and give the
unforgeability and confidentiality of our improved scheme
under the existing security assumption.
Index Termsidentity-based, Signcryption, bilinear pairing,
cryptanalysis, attack
I. INTRODUCTION
Encryption and signature are the two basic
cryptographic tools offered by public key cryptography
for achieving confidentiality and authentication.
Signcryption can realize the function of encryption and
signature in a reasonable logic step which is proposed by
ZHENG [1] in 1997. Comparing to the traditional way of
signature then encryption or encryption then signature,
signcryption can lower the computational costs and
communication overheads. As a result, a number of
signcryption schemes [2][3][4][5][6][7][8]were proposed
following ZHENGs work. The security notion for
signcryption was first formally defined in 2002 by Baek
et al. [9] against adaptive chosen ciphertext attack and
adaptive chosen message attack. The same as signature
and encryption, signcryption meets the attributes of
confidentiality and unforgeability as well.
In 1984, A.Shamir [10] introduced identity-based
public key cryptosystem, in which a users public key can
be calculated from his identity and defined hash function,
while the users private key can be calculated by a trusted
party called Private Key Generator (PKG). The identity
can be any binary string, such as an email address and
neednt to be authenticated by the certification
authentication. As a result, the identity-based public key
cryptosystem simplifies the program of key management
to the conventional public key infrastructure. In 2001,
Boneh and Franklin [11] found bilinear pairings positive
in cryptography and proposed the first practical identity-
based encryption protocol using bilinear pairings. Soon,
many identity-based [12][14][15][16] and other relational
[13][17][18] schemes were proposed and the bilinear
pairings became important tools in constructing identity-
based protocols.
Group-oriented cryptography [19] was introduced by
Desmedt in 1987. Elaborating on this concept, Desmedt
and Frankel [20] proposed a (t,n) threshold signature
scheme based RSA system [21]. In such a (t,n) threshold
signature scheme, any to out of n signers in the group can
collaboratively sign messages on behalf of the group for
sharing the signing capability.
Identity-based signcryption schemes combine the
advantages of identity-based public key cryptosystem and
Signcryption. The first identity-based threshold signature
scheme was proposed by Baek and Zheng [22] in 2004.
Then Duan et al. proposed an identity-based threshold
signcryption scheme [23] in the same year by combining
the concepts of identity based threshold signature and
encryption together. However, in Duan et al.s scheme,
the master-key of the PKG is distributed to a number of
other PKGs, which creates a bottleneck on the PKGs. In
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1557
2011 ACADEMY PUBLISHER
doi:10.4304/jnw.6.11.1557-1564
2005, Peng and Li proposed an identity-based threshold
signcryption scheme [24] based on Libert and
Quisquaters identity-based signcryption scheme [25].
However, Peng and Lis scheme dose not provide the
forward security. In 2008, another scheme [26] was
proposed by Fagen Li et al., which is more efficient
comparing to previous scheme. However, S. S. D. Selvi
et al. pointed out that Fagen Li et al.s scheme is not
equilibrium between the usual members and a dealer
called clerk in Fagen Li et al.s scheme and proposed an
improved scheme [27].
In this paper, we show that the threshold signcryption
scheme of S. S. D. Selvi et al. is vulnerable if the attacker
can replaces the group public key. Then we point out that
the receiver uses the senders public key without any
verification in the unsigncrypt stage cause this attack.
Further, we propose a probably-secure improved scheme
to correct the vulnerable and give the unforgeability and
confidentiality of our improved scheme under the existing
security assumption.
II. PRELIMINARIES
A. Bilinear pairing
Let
1
be a cyclic additive group generated by P,
whose order is a prime q, and
2
G be a cyclic
multiplicative group with the same order q. A bilinear
p
1 1 2
: e G G G with the f
G
pairing is a ma ollowing
pr
1. Bilinearity: for all
Non-dege rative: There exists such
th
putabl re is an fficient algorithm to
B. mp ona
Let and
be a bilinear pairing and let P be a
arithm P DLP)
Giv such that
tational Diffie-Hel Problem
en for , find the
r Diffie-Hellman Pro (BDHP)
Given for compute
r Diffie-Hellman Problem (DBDHP)
operties:
( , ) ( , )
ab
e aP bQ e P Q =
1
,Q G , ,
q
a b Z .
2. ne
P
1
, P Q G
at ( , ) 1 e P Q .
3. Com e: The e
compute ( , ) e P Q for all
1
, P Q G .
Co utati l problems
be two groups of prime order q, let
1 1 2
: e G G
1
G
2
G
G
generator of
1
G .
Discrete Log roblem (
en
1
, P Q G , find
q
n Z
P nQ = whenever such n exists.
Compu lman (CDHP)
Giv ( ) P
1
, , aP bP G
*
,
q
a b Z
elementabP .
Bilinea blem
P ( )
1
, , , P a bP cP G
*
, ,
q
a b c Z ,
2
( , )
xyz
e P P G
Bilinea
Given( )
4
1 2
, , , , P aP bP cP G G fo
*
, ,
q
a b c Z r ,
alg
en
the systems public
pa
s als
h
iven a users identity ID, the PKG will
co
decide whether ( , )
abc
e P P =
C. ty Based Threshol Identi d Signcryption
A generic identity-bases threshold signcryption scheme
with total n players and t threshold limit consists of the
following five orithms:
Setup: Giv a security parameter k, the private key
generator (PKG) generates
rameters. Among the parameters produced by Setup is
a public key
pub
P . There i o a corresponding master
key s t at is kept secret by PKG.
Extract: G
mpute a public key
ID
Q , nerate the private
key
ge
ID
S and transmit the private key to its owner in a
secure way.
Keydis: Given a private key
ID
S
of u
associated with an
identity ID that stands for a group mber of
signcryption members n and a threshold parameter t, this
algorithm generates n shares of
sers, the nu
ID
S and provides each
one to the signcryption members
1 2
, , ,
n
M M M . It also
generate ver eys that can be used to
check dity of each ared private key. We de ote
the sh red priv
ke
s a set of ification
the vali sh n
a ate keys and t fication
k
h tching veri e ma
ys by
1, ,
{ }
i i n
S
=
and
1, ,
{ }
i i n
y
=
, respectively. Note that
each ( ) ,
i
S nt to
i i
y is se M , then
i
M publishes
i
y but
keeps
i
S sec
Signcrypt: Give a message m, the private eys of t
members{
i i
S
ret.
k
in a sender group , the receivers
pu
1, ,
}
t = A
U
blic key
B
ID
Q , the Signcrypt gorithm outputs an
identity-based (t,n) threshold signcryption
al
on
ive
the
message m.
Designcrypt: G a ciphertext , the private key of
the receiver
B
ID
S , the public key the sender group
A
ID
Q , it
outputs the plain text m orif is an invalid ciphertext
be
id
yption scheme is
ilistic
tween the group
A
U and the receiver.
D. Security notions for identity-based Threshold
signcryption
The notion of semantic security of public key
encryption was extended to identity-based signcryption
scheme by Malone-Lee [28]. This was later modified by
Sherman et al. [29] which incorporates
indistinguishability against adaptive chosen ciphertext
and identity attacks (IND-IDTSC-CCA2) and existential
unforgeability against adaptive chosen message and
entity attacks (EUF-IDTSC). We describe below the
security notions for confidentiality and unforgeability
given in [30], this is the strongest security notion for this
problem.
Confidentiality: A signcr
semantically secure against chosen ciphertext and identity
attacks (IND-IDTSC-CCA2) if no probab
1558 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
po
d se
i. T
E
essage m, send
id
lynomial time adversary Eve has a non-negligible
advantage in the following game:
1. The challenger C runs the Setup algorithm an nds
the system public parameters to the adversary Eve.
2. In the first phase, Eve makes polynomial bounded
number f queries o to the following oracles.
Extract Oracle: Eve produces an identity
i
ID and
queries for the secret key of user he Extract Oracle
returns
i
S to ve.
Signcrypt Oracle: Eve produces a m er
entity
A
ID and receiver identity
B
ID . C computes the
secret ke m Extract Oracle and retu Eve, the
signcrypted c hertext from Signcrypt {
y
A
S f
i
ro rns to
p }
1, ,
, ,
i j
i t
m S ID
=
.
U ncrypt Oracle: Eve produces a sender
ver identity
nsig
identity and recei
A
ID
B
ID and a
sig cry tion n p . The challenger C computes the secret
key
B
S from Extract Oracle, returning the result of
( , , )
A
ID B
Unsigncrypt Q S to Eve. The result returned
isif is a valid signcryption from
A
U to
B
U .
3. A p
0
and
1
m of equal length
rbitrary sender
identity
A
ID . The challenger C flips a co
roduces two messages
from the message space
in, sampling a
m
M and an a
bit { } 0,1 b and computes
*
1, ,
( ,{ } , )
b i i t B
Signcrypt m S ID
=
=
.
*
is return to Eve
as challenge signcr ed ciphertext.
4. Eve is allowed t ke mial bounded number
of new queries as in step 2 with the restrictions that it
should query the Unsigncryption oracle for the
unsigncryp
ypt
o ma polyno
not
tion of
*
, the Signcryption Oracle for the
sig
le for the secret keys
ncryption of
0
m or
1
m under the sender
identity
A
ID and xtract the E Orac
of
B
ID .
5. At the end of this game, Eve outputs a bit
'
b . Eve
wins the game if
'
b b = .
Unforgeability: A signcryption scheme is existentially
unforgeable under chosen message attack (EUF-IDTSC)
if no probabilistic polynomial time adversary Eve has a
non-negligible advantage in the following game.
1. The challenger C runs the Setup algorithm to
generate the master public and private keys params and
msk respectively. C gives system public parameters
params to Eve and keeps the master privat msk
cret from Eve.
2. The adversary Eve makes po omial bounded
number of queries to the oracles scribed in step 2 of
the e.
3. E pted ciphertext
e key
se
lyn
as de
confidentiality gam
ve produces a signcry and wins
the game if the private key of sender
A
U was not ied
in the
quer
previous step and returned
by
is not
( , , )
A
ID B
nsigncrypt Q S U
and i
t Oracle with
s not th tput of a
pr as sender.
. REVIEW OF S. S. D. SELVI ET AL.S IDENTITY-BASED
sche trust
e ou
A
ID evious query to the Signcryp
THRESHOLD SIGNCRYPTION SCHEME
The me involves four roles: the PKG, a
de { aler, a sender group }
1 2
, , ,
A n
U M M M = with
identity
A
and eceiver Bob with identity ID a r
B
I .
S p: Given a security parameter k, the KG chooses
group prime order q (with
1
G additive
2
map : e G G G
D
etu P
s and of
and ltip e), a generator P of bilinear
2
1
G
mu
2
G
licativ G
1
G , a
1 1
m )
{ }
*
1 1
: 0,1 H G , { }
1
2 2
: 0,1
n
H G ,
}
*
*
3 q
. Th hooses a master-key
*
, a secure sym r (E,D
and
etric ciphe
hash f
{ : 0,1
unctions
H Z e PKG c
R q
s Z PKG publishes
system
and c putes om
pub
P sP = . The
ame par ters
{ }
1 2 1 1 2 3
, , , , , , , , , ,
pub
G n e P P E D H H H and keeps the
master-key
Extract: Given an identity ID, the PKG
compu
G
s secret.
tes ( ) Q H ID
1 ID
= and the private key
ID ID
S sQ = .
Th
:
en PKG sends the private key to its owner in a secure
way.
Keydis Suppose that a threshold t and n
satisfy1 t n q < . To share the private key
A
ID
am
the group
A
U , the trusted
S ong
dealer performs th below. e steps
1) Choose
1 1
, ,
t
F F
nomial
unif rmly at ran m from
co
1
o do ,
nstruct a poly
1
*
1
G
1
( )
A
t
ID t
F x S xF x F
= + + +
2) Compute ( )
i
S F i = fo . ( r 0, , i n =
0
A
ID
S S = ).
mb Send o me
i
S t er
i
M for 1, , i n = secretly.
3). Broadcast
0
( , )
A
ID
y e S P = and , )
j j
y e F P = for
1, ,
(
1 j t = .
4) Each
i
M then checks whether his sha is v
co
re
i
alid by
1
S
mputing
0
( , )
j
t i
P
=
i j j
e S y
=
. If
i
S is not
valid,
i
,
1
t
i i
=
=
2
, ( ) k H = ,
( ) m
k
c E = , andh H
3 1
, ) m R k ( , = .
3) Then the clerk C sends h to
i
M for 0, , i t = .
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1559
2011 ACADEMY PUBLISHER
4) Each
i
M computes the partial signature
i i pub i
W x P S
i
h =
1,
(
t
j j
=
+
i
j i =
and sends it to the clerk C, where
mod q.
1
) j
W
6) The final threshold signcryption is
1
( , , ) c R W = .
Unsigncrypt: When receiving , Bob follows the
steps below.
1) Compute
1
( , )
B
ID
e R S = and
2
( ) k H = .
2) Recover ( )
k
m D c =
3) Compute
3 1
and accept ( , , ) h H m R k = if and only
if the following equation holds:
1
( , ) ( , )
A
pub ID
e P W e P R hQ = +
. CRYPTANALYSIS OF S. S. D. SELVI ET AL.S SCHEME
The two schemes are both insecure from the view of
attack by a malicious attacker who can control the
communication channel.
The attacker intercepts the
ciphertext
1
( , , ) c R W = from sender.
1) Randomly choose
*
,
q
x Z and prepare a forged
message
'
m
2) Compute
'
1
R xP = ,
'
2 pub
R xP = , ,
' '
2
( , )
B
ID
e R Q =
'
2
( ) k H = , , .
'
' '
k
c E = ( ) m
'
h H =
' ' '
3 1
( , , ) m R k
3) Compute , set as a
public key ofU
'
pub
W P =
A
'
( ) /
A
Q x P =
'
h
'
4) The final ciphertext is .
' ' ' '
1
( , , ) c R W =
5) Attacker sends the forged ciphertext and the
replaced public key to the receiver.
After receiving the ciphertext , the
receiver
' ' ' '
1
( , , ) c R W =
1) Compute
' '
1 2
( , ) ( , )
B B
ID ID
e R S e R Q = =
' '
) k =
=
'
'
'
,
2 2
( ) ( k H H = =
2) Recover ,
'
.
'
' '
( ) ( )
k
k
m D c D c m = = =
' ' '
3 1
( , , ) h H m R k h = =
3) Verify
?
' '
1
( , ) ( , )
A
pub ID
e P W e P R hQ = +
' ' '
1
( , ) ( , ( ) / ) (
A
pub ID pub
e P R hQ e P xP h x P h e + = + =
'
, ) ( , )
pub
P P e P W =
The equation set.
' '
1
( , ) ( , )
A
pub ID
e P W e P R hQ = +
Discussion
In the view of the attacker, [27] can be simulated as
following basic Signcryption scheme:
A sender Alice with key pairs
{ }
1 1
( ), (
A A
Q H Alice S sH Alice = = )
A receiver Bob with key pairs
{ }
1 1
( ), ( )
Bob B
Q H Bob S sH Bob = =
Alice chooses
*
q
x Z ,
1
R xP = ,
2 pub
R xP = ,
2
( , )
B
e R Q = ,
2
( ) k H = , , ( )
k
c E m =
3 1
( , , h H m R ) k = ,
pub
P h
A
W x S = + and sends
1
( , , ) c R W = to Bob as the ciphertext of his message.
There is a small mistake of the definition
. We think the authors real intention
is to meet
{ }
*
*
3
: 0,1
q
H
{ } : 0,1
3 1
( , , ) h H m R k
Z
Z { }
* *
*
3 1
0,1
q
H G
= . In this hash function, any message
about the sender is not contained. If an attacker Eve say
I am Alice to Bob, Bob can not distinguish only with
the hash value h. Our attack just utilizes this attribute of
Lis scheme.
Suppose that
3
H is defined as
, and { } { }
* *
*
3 1 1
: 0,1 0,1
q
H G G
3 1
( , , , )
A
ID
h H m R k Q
Z
= . The attacker Eve intercepts the
ciphertext
1
( , , c R W) = from sender Alice and she runs
the algorithm of forging ciphertext like:
1) Randomly choose
*
,
q
x Z and prepare a forged
message
'
m
2) Compute
'
1
R xP = ,
'
2 pub
R xP = , ,
' '
2
( , )
B
ID
e R Q =
'
2
( ) k H = , , .
'
' '
k
c E = ( ) m h H =
' ' ' ' '
3 1
( , , , )
A
m R k Q
3) Compute , set as a
public key ofU
'
pub
W P =
A
' '
( ) /
A
Q x P = h
4) The final ciphertext is .
' ' ' '
1
( , , ) c R W =
5) Send the forged ciphertext and the replaced public
key to the receiver.
She will meet a hard problem that if she wants to
compute , is necessary or if she wants to
computes , must be known. As a result, if she can
succeed in forging the ciphertext, she must own the
ability to solve the DL problem.
'
h
'
A
Q
'
A
Q
'
h
. THE IMPROVEMENT OF S. S. D. SELVI ET AL. SCHEME
The scheme involves four roles: the PKG, a trust
dealer, a sender group { }
1 2
, , ,
A n
U M M M = with
identity and a receiver Bob with identity
A
ID
B
ID .
1560 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
Setup: Given a security parameter k, the PKG chooses
groups and of prime order q (with additive
and multiplicative), a generator P of , a bilinear
map , a secure symmetric cipher (E,D)
and hash functions , ,
. The PKG chooses a
master-key
1
G
: e G
{ } : 0,1
2
G
*
3 1
1
G
2 2
2
G
1
G
H G
1 1
G G
{ } 0, H G
*
2
{ }
*
1 1
: 0,1 H G
*
*
1
1
q
G
{ }
1
: 0,1
n
Z
R q
s Z and computes . The PKG
publishes system
parameters
pub
P = sP
{ }
1 2
G G
1
, , , , ,
pub
e P P E
1
, H
2
, H
3
, D H , ,n and
keeps the master-key s secret.
Extract: Given an identity ID, the PKG
computes and the private key
1
( )
ID
Q H ID =
ID I
S sQ
D
= .
Then PKG sends the private key to its owner in a secure
way.
Keydis: Suppose that a threshold t and n
satisfy1 . To share the private key t n q <
A
ID
S among
the group , the trusted dealer performs the steps below.
A
U
1) Choose
1
, ,
t 1
F F
= + + +
2) Compute for . ( ( )
i
S F i = 0, , i n =
0
A
ID
S S = ).
Send to member
i
S
i
M for secretly. 1, , i n =
3). Broadcast and
for .
0
( ,
A
ID
y e S P =
1
)
( , )
j j
y e F P = 1, , j t =
4) Each
i
M then checks whether his share is valid by
computing
i
S
1
0
( , )
j
t i
i j
e S P y
=
=
j
. If is not
valid,
i
S
i
M broadcasts an error and requests a valid one.
Signcrypt: Let
1
, ,
t
M M are the t members who want
to cooperate to signcrypt a message m on behalf of the
group .
A
U
1) Each
i
M chooses
*
i R q
x Z , computes
1i i
R x P = ,
2i i pub
R x P = ,
2
( ,
i i
e R Q )
D
B
I
= and sends
( )
1
,
i
R to the clerk C.
2) The clerk C (one among the t cooperating players)
computes
1 1
t
i i 1
R R
=
=
,
1
t
i i
=
=
,
2
( ) k H = ,
, andh H . (
k
c E = ) m =
3 1
, , )
A
ID
R k Q ( , m
3) Then the clerk C sends h to
i
M for . 0, , i t =
4) Each
i
M computes the partial signature
i i pub i
W x P S
i
h = +
=
=
mod q.
5) Clerk C verifies the correctness of partial signatures
by checking if the following equation holds:
1
1 0
( , ) ( , )( )
j
i
h t i
i i pub j j
e P W e R P y
=
=
If all partial signatures are verified to be legal, the
clerk C computes
1
t
i
W
=
=
i
W
i
on C
1
. Else
Z tes
i
, compu
ID
Q
i
P = ,
i
ID i pub
S P = ,
returns
i
ID
to E and sets( , , , )
i i i i
ID Q S Q on
1
L .
2
H query: C maintains a list
2
L to
record
2
H queries.
2
L has th f ( , ) k e form o . C receives
a qu
If
ery about
i
, selects
*
i q
k Z , returns
i
k to E, and
sets( ,k )
i i
on
2
L .
3
H query: C maintains to
record
3
a list L
3
H queries.
3
L ha , , , , ) R k Q
receives a query about(m cts
s the form of
, , , R k
(
Q
m h
) , sele
. If C
*
i q
h Z
1
i
i i i ID
,
returns
i
h to Eve, and sets
1
( , , , , )
i i i ID i
m R k Q h on
3
L .
query: If C receive
i
ry about Signcry Signcrypt
ith messag d
s a
e
i
m , i entity
i
ID
Z
que pt
w
1. Select
*
i q
c ,
1 i
W G
-up
1
L ,
2
L set
i
2. Loo , k
ID i
Q P = in
1
L ,
i i
k k = in
2
L .
Compute
i i
R c P = ,if
i j
ID ID . Else, if ID
i j
ID = ,
compute
i
R cP =
( ) H m 3. Set , , , h R k Q
3
i
i i i i ID
= .
the tit
m
4. Return( , )
i i
h W to Eve.
After the first stage, Eve chooses a pair tities on
which he wishes to be challenged on( , )
i j
ID ID . Note
that Eve can not query iden of
A
ID . Then Eve
wo plaintexts
0
and
1
m . C choos t
{0,1 b
of iden
es
y
outputs t
}
a bi
and signcrypts
b
m . To do so, he sets
*
1
R cP = ,
obtains
*
k
2
( ) H = from the hash function
2
H , and
computes
*
1
(
b b
k
c E m ) = . Then C chooses
*
1
W G and
sends the ciphertext
* * *
1
( , , )
b
c R W = to Eve. Eve can
performs a second series of queries like at the first one.
At the e the simulatio
'
b for
which he b
rel
*
nd
n
of n, she
e
pr
liev
oduces a bit
es the
atio =Signcrypt
'
( ,{
b
m S
1, ,
} , )
t j
ID
i i =
holds.
If b b
'
= , C
m.
outputs
*
1
) (
j
ID
S e c
( , )
abc
e P P . So C can so
( , R , ) ( , )
abc
e P abP e P P = = = . Else, C
outputs lve the BDDH
proble
Further, we propose a probably-secure improved scheme
. CONCLUSION
In this paper, we show that the threshold signcryption
scheme of S. S. D. Selvi et al. is vulnerable if the attacker
can replaces the group public key. Then we point out that
the receiver uses the senders public key without any
verification in the unsigncrypt stage cause this attack.
1562 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
to correct the vulnerable and give the unforgeability and
confidentiality of our improved scheme under the existing
security assumption.
e
Excellent Talents in University (NCET-06-0300).
9
LNC
CS, vol.2729, pp.383-399, Spr
CS, vol.1977, pp.155-164
CS, vol.3650, pp.218-23
NCS, v
S vol.2947, pp.187-200, Springer
02 LNCS vol.227
O84 LNCS 196, pp
, LNCS 213
Conference
-
lin Gui, Multi-recipient
the standard model,
igncryption scheme in the standard
n scheme without random
oriented
generation of authenticators
natures and public-key
ational
rom pairings, International
rity,
schemes from pairings, 2003 IEEE
n efficient and Provably
Identity-Based Threshold Signcryption
ACKNOWLEDGMENT
The authors would like to thank the editors and
anonymous reviewers for their valuable comments. This
work is supported by the National Natural Science
Foundation of China under Grant No. 60873235 and
60473099, the National Grand Fundamental Research
973 Program of China (Grant No. 2009CB320706),
Scientific and Technological Developing Scheme of J ilin
Province (20080318), and Program of New C ntury
signcryption for secure group communication, ICIEA
2009, pp.161-165.
[16] Zhengping J in, Qiaoyan Wen, Hongzhen Du, An
improved semantically-secure identity-based
signcryption scheme in
REFERENCES
[1] Zheng Y Digital signcryption or How to achieve cost
(signature & Encryption)<<cost (signature) +cost
(encryption), In Proc. Advances in CRYPTO 7,
Co
LNCS 1294, pp.165-179, Springer-Verlag,1997.
[2] Bao F., Deng R H, A signcryption scheme with
signature directly verifiable by public key. PKC98
LNCS, vol.1431, pp55-59, Springer-Verlag, 1997.
[3] Chow S.S.M., Yiu S.M., Hui L.C.K., Chow K.P.,
Efficient forward and provably secure ID-based
signcryption scheme with public verifiability and
public ciphertext authenticity. ICISC03 S,
or
vol.2971, pp.352-269, Spring-Verlag, 2004.
[4] Boyen X, Multipurpose identity based signcryption:
a swiss army knife for identity based cryptography.
CRYPT03 LN inger-
,
and signatures, CRYPTO91, LNCS 576, pp.457-469,
Springer-Varlag, 1991
[21] R. L. Rivest, A. Shamir, L. Adleman, A method for
obtaining digital sig
Varlag, 2003.
[5] Mu Y., Varadharajan V., Distributed signcryption,
INDOCRYPT00. LN
Springer-Varlag, 2000
[6] Yang G., Wong D.S., Deng X., Analysis and
improvement of a signcryption scheme with key
privacy, ISC05. LN 2,
cr
Springer-Varlag, 2005
[7] SteinFeld R., Zheng Y., A signcryption scheme
based on integer factorization. ISW00. L ol
C
1975, pp.308-322, Springer-Varlag, 2000
[8] Libert B., Quisquater J., Efficient signcryption with
key prevacy from gap Diffie-Hellman groups.
PKC04 LNC -
Con
Varlag, 2004
[9] Baek J ., Steinfeld R., Zheng Y., Formal proofs for
the security of signcryption, PKC 4,
Co
pp.80-98, Springer-Varlag, 2002
[10] A. Shamir,Identity-based cryptosystems and
signature schemes, CRYPT .47-
53, Springer-Varlag, 1984.
[11] D. Boneh, M. Franklin, Identity-based encryption
from well pairing, CRYPTO01 9,
S
pp.213-229, Springer-Varlag, 2001
[12] P.S.L.M. Barreto, B. Libert, N. Mccullagh, J .J.
Quisquater, Efficient and provably-secure identity-
based signatures and signcryption from bilinear maps
ASIACRYPT05, LNCS 3788, pp.515-532,
Springer-Verlag, 2005
[13] X. Huang, W. Susilo, Y. Mu, E Zhang, Identity-
based ring signcryption schemes: cryptographic
primitives for preserving privacy and authenticity in t
he ubiquitous world, 19
th
International
on Advanced Information Networking and
Applications, pp.649-654, Taiwan, 2005
[14] Fagen Li, Hu Xiong, Xuyun Nie, A new multi
receiver ID-based signcryption scheme for group
communications, ICCCAS2009, pp.296-300, 2009
[15] Yiliang Han, Xiao
mputers and Electrical Engineering 36(2010),
pp.545-552,Elsevier, 2010
[17] Zhenhua Liu, Yupu Hu, Xiangsong Zhang, Hua Ma,
Certificateless s
model, Information Sciences 180(2010), pp.452-464,
Elsevier, 2010.
[18] Yong Yu, Bo Yang, Ying Sun, Sheng-lin Zhu,
Identity based signcryptio
acles, Computer Standards & Interfaces 31(2009),
pp.56-62, Elsevier, 2009
[19] Y. Desmedt, Society and group
cryptography: a now concept, CRYPTO87, LNCS
293, pp.120-127, Springer-Varlag, 1987
[20] Y. Des. Frankel, Shared
yptosystems, Communications of the ACM, Vol.21,
No.2, pp.120-126, 1978
[22] J . Baek, Y. Zheng, Identity-based threshold signature
scheme from the bilinear pairings, Intern
onference on Information Technology 2004,
pp.124-128, Las Vegas, Nevada, USA, 2004
[23] S. Duan, Z. Cao, R. Lu, Robust ID-based threshold
signcryption scheme f
ference on Information security, pp.33-37,
Shanghai, China, 2004
[24] C. Peng, X. Li, An identity-based threshold
signcryption scheme with semantic secu
mputational Intelligence and Security 2005, LNAI
3902, pp.173-179, Springer-Varlag, 2005
[25] B. Libert, J .J . Quisquater, Anew identity based
signcryption
information theory workshop, pp.155-158, Paris,
France, 2003
[26] Fagen Li, Yong Yu, A
ecure ID-Based Threshold Signcryption Scheme,
ICCCAS 2008, 488-492
[27] Selvi S.S.D., Vivek S.S, Rangan C.P, Cryptanalysis
of Li et al.s
Scheme, Embedded and Ubiquitous Computing 2008,
pp.127-132
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1563
2011 ACADEMY PUBLISHER
[28] Malone Lee J :Identity based signcryption. In:
Cryptology ePrint Archive. Report 2002/098, 2002.
[29] Chow S.S.M., Yiu S.M., Hui L.C.K., Chow K.P.:
Efficient forward and provably secure ID-based
signcryption scheme with public verifiability and
public ciphertext authen
ticity. In: Lin, J .-I., Lee, D.-
ed cryptography.
In: Boneh, D. (ed.) CRYPTO 2003. LNCS, vol. 2729,
pp.383-399. Springer-Varlag, 2003
one National Grand Fundamental
China and published more tha
rticipant, Dr Liang
u has finished more than
ministerial level research proj
2006, Li Hongtu got
bachelors degree of computer
J ianfeng Chu, Correspondin
of the
stworthy
distributed network should be made.
t. She has
published 14 academic articles in English and Chinese,
four of that has been retrieved by EI.
H. (eds.) ICISC 2003. LNCS, vol. 2971, pp.352-369.
Springer-Varlag, 2004
[30] Boyen X.: Multipurpose identity based signcryption:
a Swiss army knife for identity bas
Wei Yuan was born in Chengde
of Hebei province of China in
1984. He began the study of
computer science at J ilin
University in 2003 and got his
bachelor degree in 2007. Then he
continued his research on
information security and received
his master degree in 2010. Now
he is a PhD candidate of the
college of computer science and
technology of J ilin University.
His main research interests
include cryptography and
information security. he have participated in several projects
include two National Natural Science Foundations of China and
Research 973 Program of
n 10 research papers from 2007.
Liang Hu was born in 1968. He
has his BS degree on Computer
Systems Harbin Institute of
Technology in 1993 and his PhD
on Computer Software and
Theory in 1999. Currently, he is
the professor and PhD supervisor
of College of Computer Science
and Technology, J ilin University,
China.
His main research interests
include distributed systems,
computer networks,
communications technology and information security system,
etc. As a person in charge or a principal pa
20 national, provincial and
ects of China.
Li Hongtu was born in Siping
of J ilin, China on Mar. 17
1984. In 2002, Li Hongtu
began the study of computer
science at J ilin University in
J ilin, Changchun, China. And
science. In the same year, Li Hongtu began the masters
degree study in network security at J ilin University. After
3 years study, Li Hongtu got his masters degree in 2009.
From then on, Li Hongtu began the doctors degree in the
same field of study at the same University.
From 2009, he has got a fellowship job. He worked in
grid and network security laboratory as an ASSISTANT
RESEACHER at J ilin University. From 2006 to now, he
has published several papers.
g author, was born in 1978, Ph.D.
Now he is the teacher of the
College of Computer Science and
Technology, Jilin University,
Changchun, China. He received
the Ph.D. degree in computer
structure from J ilin University in
2009. His current research
interests focus on information
security and cryptology.
An important objective
projects is to probe the trend of
network security, which can
satisfy the need of constructing high-speed, large-scale and
multi-services networks. Various complex attacks can not be
dealt with by simple defense. And to add mechanisms to
network architecture results in decreasing performance. In a
word, fundamental re-examination of how to build tru
Yuyu Sun, female, born in
1977,Lecturer, Ph.D. of Jilin
University. She graduated
from the Department of
Computer Science and
Technology of J ilin
University in 2005,and
obtained an MA degree. From
2008,she began to start her
doctorate in computer in Jilin
University, now she is
working in Changchun
University. Her current research interests include network
and information security. She mainly engaged in
Teaching and research on information security and
Application software development. She has participated
in one National Natural Science Foundation of China, one
Major Project of Chinese National Programs for
Fundamental Research and Development (973 Program),
five Science and technology support key project plan of
J ilin Provincial Science and technology Department ,
three S&T plan projects of J ilin Provincial Education
Department .She has Wrote 4 textbooks as ye
H
in
1564 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
An Independent Set Packet Classification
Algorithm Using Priority Sorting
Rong Hui-Gui, Chen Hao
School of Information Science and Engineering, Hunan University,
Changsha, 410082, China
[email protected] [email protected]
AbstractIndependent set algorithms, as a kind of packet
classification algorithms with space efficiency, has lower
execution efficiency for the lack of priority consideration in
linear matching process. In addition, new independent sets
created frequently as a result of dynamic updates greatly
increase its dependence on the consumption of storage space.
In order to overcome these above disadvantages, an
improved algorithm based on independent sets using
priority sorting (ISSP) is proposed and an improvement
strategy of split rule is designed for higher storage efficiency
in dynamic updates. The simulation results further show
that the improved algorithm, compared with IS algorithm,
reduces its dependence on storage space in dynamic updates
and has higher execution efficiency.
I ndex Termspacket classification, independent sets (IS),
priority sorting, dynamic updates
I. INTRODUCTION
With the rapid development of network technology and
the emerging network applications, Internet users are
demanding more for reliability, security and diversity of
the network service [1]. It is necessary for routers to
provide differentiated network services to meet the needs
of different users, such as packet filtering firewall, traffic
accounting, differentiated services, QoS and so on.
Routers should have the ability of fast packet
classification to support these differentiated services. Fast
packet classification algorithms have become a key
technology for high-speed routers and also have been the
key of avoiding the router being as the bottleneck of
network performance.
Packet classification algorithms in general may be
divided into two categories: one group is algorithms
implemented by pure hardware implementation, such as
content access memory (CAM). This group algorithms
have a good lookup efficiency, but it is difficult to
promote since own deficiencies (bulky, high power
consumption, supporting no range type of rules) [2]; the
other group is through software implementation, and they
are subdivided into algorithms based on Terry tree and
collection location. Xuehong Sun presents a new fast
packet classification algorithm based on independent sets,
in the IEEE Transaction On Networking meeting [3], and
this algorithm has become the most popular and efficient
packet classification algorithms in recent years.
This paper deals with the traditional problems of IS
packet classification algorithms and analyzes some key
factors affecting the performance of IS algorithm, then an
improved algorithm (ISSP) is proposed. The approved
algorithm maintains the original characteristics of IS
algorithm and it solves the linear matching issues by
introducing a priority-sorted mechanism since the first
matching rule is the final rule after sorting rather than
traverse the whole rule index table. As a result, this new
algorithm raise the performance efficiency; on the same
time, new independent sets created frequently for
dynamic updates greatly increase its dependence on the
consumption of storage space, an improvement strategy
by split rule is designed for higher storage efficiency in
dynamic updates.
PROBLEM DESCRIPTIONS
Independent sets originate from the concept of
independent set [4] in graph theory, in which independent
sets mean the subset of vertex set, and any two vertices
are not connected. The idea that get independent elements
together to determine the relevance have been reflected in
many mathematical models, above all, it is widely
applied to different scientific areas, such as fault
diagnosis, computer vision, computer networks and so on.
This paper focuses on the application and expansion of
independent sets in packet classification fields and uses
"independent sets" to distinguish from the independent
set concept in graph theory.
A. Calculation of independent sets
The first step of using independent sets to packet
classification is to construct several independent sets
based on rule base and each independent sets should
contain as many as rules for raising the storage and
lookup efficiency. The rule base of packet classification
may be mapped to an undirected graph according to the
overlapping relationship among rules, and then the
problem of constructing independent sets based on the
rule library becomes that of finding independent sets in
graph theory. Solving Independent sets always is a classic
Foundation item: Project (531107021115) supported by Fundamental
Research Funds for the Central Universities; Project (61070194)
supported by the National Natural Science Foundation of China.
Manuscript received: January 1, 2011; revised June 1, 2011; accepted
July 1, 2011.
Corresponding author: Rong Hui-gui, Lecturer of Hunan University,
Doctor of Wuhan University; Tel: +86-731- 88828148; E-mail: ronghg
@163.com
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1565
2011 ACADEMY PUBLISHER
doi:10.4304/jnw.6.11.1565-1571
and complex problem in graph theory [5]. This paper uses
a solution for finding independent sets based greedy
algorithm which generates a maximal independent set of
local optimum after once iteration, and maximal
independent sets of local optimum will be constructed
after a number of iterations. Fig.1 illustrates packet
classification rule base represented by an undirected
graph, where the left part denotes rule base, the right part
is its undirected graph mapped by the rule base.
Figure 1. The rule base denoted by the undirected graph
Greedy algorithm [6] is a method with the goal to
get local optimal solution instead of pursuing the overall
optimal solution. Many computer algorithms have used
the idea of greed, such as the knapsack problem, game
theory-related issues and so on.
Given a graph G, V represents all of its vertices, E
represents all the sides, Nv (u) represents all the vertices
connected to u and S means the independent sets of G.
Firstly a vertex x selected from G will be added to the set
S that is initially an empty set. Then all the vertices in Nv
(x) should be deleted from G in order to ensure that the
next vertex from G is not connected with all the vertices
in S, that is independent. The next step is to determine
whether the graph G is a complete graph after Nv (x)
being deleted. If G becomes a complete graph or only one
vertex is left, then we add any vertex of G to S and S will
become a maximum independent set of local optimum; if
G is still not a complete graph and also have more than
one vertex, then the above steps should be repeated until
G becomes a complete graph or only a vertex is left. Fig.2
shows the calculation of using greedy algorithm to get the
independent set.
Figure 2. The process of using greedy algorithm to calculate the
independent set
In Fig.2, there are six vertices in total. Firstly, add
vertex R1 to the independent set S and delete R2, R3, R5
and R6 that connected with A from G, and all the deleted
vertices and edges are represented with dotted lines. Only
one vertex R4 is left in G after deletion, then R1 and R4
will form an independent set named I
1
, thus, the first
iteration is completed. At the beginning of the second
iteration, vertex R1 and R4 have been removed, only the
vertices R2, R3, R5 and R6 are left, then we add R3 to
independent set S' and remove R6 from G, the remaining
vertices R2 and R6 will form a complete graph. Then, we
take any vertex together with R3 to form an independent
set, if the selected vertex is R2, then R2 and R3 will form
the second independent set I
2
, thus, the second iteration is
finished. Just like this, the third independent set
composed by R3 and R6 may be found.
Although the greedy algorithm in graph theory does
not guarantee that each independent set is the best overall,
but will ensure the independent set of each iteration being
global optimal by improving the greedy algorithm since
the substantial overlapping relationship among connected
vertices, in the undirected graph constructed by the rule
base of packet classification.
R represents the rule base; S denotes independent sets,
r means a rule and the calculation of independent sets is
carried out specifically through the following two steps:
Step 1: Choose r with the smallest destination from R.
If several items satisfy the condition, select one randomly
to add to S and delete all the rules overlap with r from S.
Then jump to step 2.
Step 2: If R is an empty set, then S is a locally optimal
independent set, and the iteration is finished. If R is
nonempty set, jump to step 1.
This method needs a pre-sort for all the rules and its
time complexity is O(nlogn)+O(n), where n is the number
of rules. The method can be proved by mathematics that
independent sets calculated every time is the largest
independent set in current rule base.
B. Limitations of IS algorithm
IS algorithm center on using the calculations of
independent sets to dispatch rules of final rule base into
several independent sets. We use I
1
to denote the first
independent set constructed by the rule R, and suppose
that R
1
= R - I
1
, which represents the remaining rules.
Then, we may carry out the calculation of the
independent set once again to generate I
2
, and then R
2
=
R
1
I
2
. The iteration repeats itself until R
m
becomes an
empty set, at last, the rules of final rule base will be
stored separately in a number of independent sets { I
1
I
2
I
s
} and IS algorithm may be seen
references[3] in detail.
For a given rule base R = { r
1
r
2
r
n
}, the
first step is to conduct a calculation of independent set
based on the mentioned method in section 2.1, then use
the IS algorithm to construct rule index tables for all the
basic sections of B
0
. Fig.3 shows the process.
1566 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
Figure 3. Constructing rule index tables of B0.
Then, in accordance with IS algorithm, the rule index
tables of all the points in B
0
is described in Fig.4:
Figure 4. The rule index tables of all the points in B
0
Unlike FIS[7] algorithm to dealing with the starting
point and end point of the rules, this packet classification
algorithm based on independent only needs to process the
starting point of the rules since no overlapped rules at the
given dimension in the same set, and thus, it saved a lot
of storage consumption. However, this algorithm only
presents one dimensional divisions for the rule base, and
as a result, there exist a larger number of rules in rule
index tables of each point in B
0
. After packets are located
to the basic section of B
0
, the algorithm should begin a
linear matching for these rules, while this group
algorithm needs to lookup the whole rule index table for
getting the rule with highest priority as a result of
ignoring the rules priority in linear matching process,
which greatly reduces lookup efficiency.
In addition, the core structure of packet classification
algorithm based on independent sets consists of several
independent sets, and when new rules arrive, they need to
be added to the appropriate independent sets. If an
independent set can meet the mutual independent
condition among all rules after the new rule being added,
then we add the new rule to it. If the new rule can not be
added to any current independent set, IS algorithm will
create a new independent set, and then add the new rule
into it. Although this method is a relatively simple
implementation, obviously, it greatly increases the
number of independent sets and the length of the rule
index tables, which greatly increased the consumptions of
storage space from dynamic updates.
PROPOSED IMPROVED ALGORITHM
From the before-mentioned discussion and analysis of
IS algorithm, an improved algorithm using priority
sorting is proposed in order to overcome the weakness of
IS algorithm.
A. The implementation of ISSP algorithm
There usually has more than one matching rule for a
packet, so we should the define the priority of the tuples
to ensure the uniqueness of matching results, which
means that the rule found is the highest priority [8-10].
Generally speaking, the longer the prefix is or the smaller
the range is, the higher the corresponding priority is [11].
IS algorithm has lower execution efficiency due to the
lack of priority consideration for rules in linear matching
of data packet, in addition, new independent sets created
frequently for dynamic updates greatly increase the
consumption of storage space. In order to overcome these
above disadvantages, an improved algorithm (ISSP) is
proposed, and the detailed algorithm flow chat is
described in Fig.5.
Figure 5. The detailed algorithm flow chat of ISSP
In this new algorithm, for any b
i
0
in B
0
, constructing its
rule index bale still needs to lookup all points of the
B
k
(k=1 2 s) to find the eligible maximum
value satisfies the condition b
p
k
b
i
0
for each starting
point, and add the corresponding rule of b
p
k
to the index
table of b
i
0
. When a rule being added to the index table,
ISSP algorithm will conduct an insertion sort by priority
for all rules exist in the index tables. Although, the rules
in rule index tables are in order from highest to lowest
priority, and then the linear lookup for the rule index table
should only return to first rule of meeting the needs
instead of traversing the entire index table.
If b
p
k
is a virtual point and its corresponding rule index
is given the value -1, which means no corresponding
rules in B
k.
Once the sorting finishes, these virtual points
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1567
2011 ACADEMY PUBLISHER
will be at the end of the index table. Therefore, the
lookup should stop and return the negative results when
the value of -1 is met, which indicates no corresponding
rules matching the data packets during searching the rule
index table, since all the values of the rest indexes are -1
and have no actual rule indexes. Obviously, this
improvement of filtering the index entry valued -1 is very
significant when the values of -1 occupy the major
positions in many rule index tables.
The concept of rule priority is basically in agreement
with the longest prefix match, and it means the longer
rule prefix or the smaller rule scope, the higher priority.
In rare circumstances, some exceptions exist, where some
rules with a larger scope have been artificially defined as
a high priority, only when special traffic should be
protected, such as the necessary protection of VOIP
traffic to ensure the screen session smoothly in a large
enterprise. Assumed that the priority assignment of the
rules in Fig.3 is described as follows:
Table 1. the rules list sorted by priority
The number
of rules
priority The number
of rules
priority
1 high 2 very high
3 low 4 high
5 very high 6 low
7 Very high 8 high
9 Very low 10 low
11 high 12 high
13 low
The corresponding rule index table will become the
next Fig.6:
Figure 6. The rule index table after priority sorting
Obviously, ISSP algorithm may improve the lookup
efficiency of rule index tables, while this improvement
also undermined the overall structure of IS algorithm. In
IS algorithm, the i-index in rule index table stores a rule
in a independent set I
i
, and after priority sorting, the rules
that the i- index stored do not necessarily belong to I
i
,
only ensure that the priority of the index is larger than i.
despite some original features of the improved structure
being lost, but this improvement increases the processing
speed of packet classification and only the real-time
updates of rules base willget negative effect. ISSP
algorithm is better than that of IS algorithm in overall if it
is not required for router to support dynamic updates.
Provided the need for real-time update rule base, then the
original rules index table should be kept and the rule
index table constructed by ISSP algorithm is also added
to each point of B
0
. Therefore, there are two rule index
tables in each point of B
0
, and one table is sorted by
priority, the other maintains the original order. This
strategy may increase the storage overhead, but can
reduce the time-consuming, and maintaining the support
for dynamic updates feature.
The core structure of packet classification algorithm
based on independent sets consists of a number of
independent sets. When a new rule is added to the rule
library, it will be accepted finally only if the rule meets
the mutual independent conditions among the rules. So, if
existing independent sets are not able to accept new rules,
then IS algorithm will create a new independent set for
accepting new rules. Although the above method is
simple to implement, but it will greatly increase the
number of independent sets, at the same time, the rule
index table also is enlarged. So, ISSP algorithm proposed
a dynamic updates strategy by split rules. When there are
no independent sets to accommodate the new rules, the
new rules will be split into several sub-rules that are
ensured to be added to the existing independent sets
according to the actual situation of each independent set.
This strategy will maximize the use of existing
independent sets and the rule index table, thereby the
storage space consumption caused the dynamic updates
will be greatly reduced.
B. Limitations of dynamic updates in IS algorithm
b
new
and e
new
are used to indicate the start and end
points, respectively. For determining whether new rules
may be added to the existing independent sets, e
new
, as a
key value, is usually used to search the range lookup tree
of B
0
and once the lookup finished, the maximum value
b
x
0
less than or equal e
new
will be returned. If the new
rules are mutually independent with the rules connected
to the rule i, in rule index table of b
x
0
, then the new rules
may be added to the independent set B
i
, but it doesnt
work conversely. If the value of the rule i, in rule index
table is -1, then you need to compare the new rule with
the former rule of B
i.
When a new rule is added to independent set, both
cases may appear:
1) If the starting point of the new rule already exists in
B
0
, all rule index tables with b
new
b
k
0
e
new
should be
updated.
2) If the starting point of the new rule is not included
in B
0
, all rule index tables with b
new
b
k
0
e
new
should be
updated, then b
new
must be added to B
0
and a rule index
table should be created for it.
Based on Fig.3, Fig.7 shows the adding process of the
two new rules 14 and 15. The process of the new rule 14
1568 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
being added according IS algorithm is illustrated below:
point b
4
0
will be returned through searching the range
query tree of B
0
and the item 3 of the rule index table b
4
0
is -1, then the rule 14 will be added to B
3
since they are
mutual independent by comparing the rule 14 with the
previous non -1 rule namely the rule 5 in B
3
. It is
unnecessary for creating a new point for B
0
since the
starting point of the rule 14 is b
4
0
, and then the rule index
table of b
4
0
should be updated from (3, 4, -1) to (3,4,14).
When we add the new rule 15, the lookup tree will return
the point b
7
0
through searching the range lookup tree of
B
0
, and then we find the item 1 of the rule index table of
b
7
0
being the rule 10 which is ensured no mutual
independent with the new rule 15, so we could not add
rule to B
1
. The item 1 of the index table is -1, and then
the rule 15 should be compared with the previous non -1
rule namely the rule 7 in B
2
, since they are not mutual
independent, so the rule 15 could not be add to B
2
here;
next, the item 3 of the index table is -1, so the rule 15
should be compared with the previous non -1 rule namely
the rule 14 in B
3
. The rule 15 will eventually be added to
B
3
since they are mutual independent. Finally, in Rule 15,
the starting point b
new
, with no new endpoints included in
B
0
, should be added to B
0
, and then we should create a
rule index table (-1,7,15) and also update the index table
b
7
0
(10, -1, 15).
Figure 7. The structure graph of IS with new added rules
The new rules 14 and 15 may be added to existed
independent sets because of finding the appropriate
location, while some rules cant be added due to the
overlap of range, such as the rule16 in Fig.8. When this
happens, you can use a simpler approach that create a
new independent set and add the new rules into it. The
disadvantages of this above-mentioned strategy lie in
having a relatively large memory consuming that needs
space to store a new set and also increases the length of
the rule index table of all points in B
0
.
C. Dynamic updates strategy of ISSP
ISSP algorithm will present a new dynamic update
strategy through splitting the new rules into several sub-
rules will be ensured to be added to the existing
independent sets.
Fig.8 is a sketch diagram of adding new rules by
applying division strategy based on Fig.7. Since the range
of the new rule 16 may overlap with that of rule 9 and
rule 10 in B
1
, overlap with rule 7 in B
2
and overlap with
rule 11 in B
3
, so, the new rule 16 can not be added to the
existing independent set B1, B2 and B3. The dashed part
of Fig. 8 means constructing a new independent set B
4
for
the rule 16, and obviously the new independent set
contains only one rule. After the constructing B
4
, the
length of rule index in all points of B
0
should increase 1
and the original 3 items are increased to 4 items.
Figure 8. The add sketch based split rules
Assumed that the coverage area of rule 16 is
[begin
16
end
16
], in Fig.8, begin
16
= b
6
0
. It's not difficult
to find that the rule 16 may be split into two sub-rules
[b
6
0
b
9
0
] and [b
9
0
end
16
] denoted as 16A and 16B
respectively, where 16A may be added to independent set
B
3
and 16B may be added to independent set B
1
. Then the
addition operation will be finished only by updating the
rule index table of b
6
0
b
7
0
b
8
0
and b
9
0
with (12,7,16),
(10, -1,16), (16,8,11). This strategy may make full use of
existing space of independent sets, instead of creating
new independent sets for storing one new rule, and the
addition operation is a bit complicated. IS algorithm only
needs to traverse the rule index table of one point in B
0
,
while the division rules need to traverse the rule index
tables of all points from the range of [begin
16
end
16
] in
B0
Compared with the addition operation, the deletion
process only needs to traverse the endpoints of rule index
table in deletion range in B
0
and the corresponding rule is
assigned to -1, obviously is a bit simple. However, the
independent set may no longer be the largest independent
set after deletion, which means that the remaining rules of
independent set with some rules removed may be added
to other independent sets. In Fig.9, the dotted lines denote
the rules to be deleted, and with the rules 1, 4 and 5
removed, we easily find the rule 2 and rule 7 in B
j
can be
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1569
2011 ACADEMY PUBLISHER
added to the independent set B
i
, then they form a
maximum independent set together with other rules in B
i
.
Figure 9. The sketch of the rules deletion
Notice that rebuilding an independent set cost a lot,
and the rule deletion and addition are bound to exist
simultaneously under the dynamic updates [12-15], so we
should not cling to constructing the largest independent
sets, and the vacancy with the removed rules is naturally
supplemented by the new rules.
Therefore, the improved algorithm realizes the
dynamic updates by splitting rules to maximize the
existing independent sets and rule index tables, as a result
this reduce the consumption on storage space from
dynamic updates, greatly.
SIMULATION EXPERIMENT
Simulation platform, whose runtime environments
include Pentium 4 3.06G CPU, 512MB RAM and
Windows XP operating system, is programmed by C++.
The rule sets adopted by simulation experiment is not
randomly created but is derived from the rule tables of
core routers in the real computer network (data sources
CAIDA. Because the length of the rule index table
determine the maximum number of linear matches, and
the value is consistent with the number of independent
sets that determined by the scale of the rule library,
therefore, this article conducts some granularity analysis
based on the scale of rule base (from tens of thousands to
hundreds of thousands). The simulation experiments are
conducted based on the rule base with a large order of
magnitude since packet classification algorithms in high-
speed networks should support classification rules (in
millions). When the magnitude variable of the rule base,
respectively is valued 319337963714709620828,
the performance of dealing with data packets in different
orders of magnitude (from one million to ten million) is
discussed and described between IS algorithm and ISSP
algorithm. The simulation experiment is just a
preliminary examination for algorithm performance with
less consideration to the actual environmental factors in
real networks, and the packets used in experiments are
randomly generated.
Simulation program consists of several components:
the rule base pretreatment of packet classification, whose
main task is to extract rules information required by
algorithms and translate IP address in the form of
character string into a digital representation, for instance,
IP address: 202.103.96.1, its corresponding binary code is
1,100,011,001,100,111 0,110,000,000,000,001, we can
convert this into a binary integer code 3395772417. The
second component is the construction of independent set,
whose main work is to divide the rule base into several
independent sets in accordance with IS algorithm, and
then generate a rule index table for each point in B
0
and
sort every rule by priority. The third part is to construct
the balance tree, and generate the balance lookup tree of
scope for all the basic section of B
0
.
The results take the running time as the contrast
parameter in seconds (S), vertical axis dictates the
running time, and the abscissa dictates the number of the
packet (million).
When the scale of the rule base reaches = 319337, the
simulation results are showed in Fig.10:
Figure 10. The simulation results of the rule base scale: = 319337
When the scale of the rule base reaches = 96371, the
simulation results are showed in Fig.11:
Figure 11. The simulation results of the rule base scale: = 96371
When the scale of the rule base reaches = 47096, the
simulation results are showed in Fig.12:
1570 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
Figure 12. The simulation results of the rule base scale: = 47096
When the scale of the rule base reaches = 20828, the
simulation results are showed in Fig.13:
Figure 13. The simulation results of the rule base scale: =20828
From the simulation results: ISSP algorithm is better
than IS algorithm in running time, moreover, the larger
the size of rule base, the more obvious of the advantages.
As the rule base increases, the running time may be saved
20%-50% because ISSP algorithm may determine the
highest priority rule matching the data packet with only a
small amount of linear searches, instead of traversing the
whole rule index table. While IS algorithm only
determine the adopted rules after traversing the full rule
index tables, and the larger of the size of rule base, the
more time-consuming of the traversal. Overall, compared
with IS algorithm, ISSP algorithm can run more quickly
in the large magnitude rule base.
CONCLUSIONS
This paper analyzed the factors influencing the
performance of IS algorithms and proposed an improved
IS algorithm. This new algorithm maintains the original
characteristics of IS algorithm instead of traversing the
whole index table, as a result, the linear matching process
is greatly shortened. At the same time, it analyzes the
shortage that new independent sets created frequently
from dynamic updates greatly increase its dependence on
the consumption of storage space, and proposed an
improvement strategy of split rule for higher storage
efficiency in dynamic updates. The simulation results
show that the improved algorithm is more efficient in
running time and the split rules increase the storage
efficiency in dynamic updates.
ACKNOWLEDGMENT
I would like to acknowledge the wonderful work of our
team for this paper. This research is supported by the
Fundamental Research Funds for the Central
Universities (531107021115); Project (61070194)
supported by the National Natural Science Foundation of
China.
REFERENCES
[1] WANG Yong-gang, SHI Jiang-tao, DAI Xue-long, YAN
Tian-xin. Simulated Testing and Comparison of
Algorithms for Packet Classification[J]. Journal of
University of Science and Technology of China, 200434
4400-409. (in Chinese)
[2] S. Zezza, E. Magli, G. Olmo, and M. Grangetto,
"SEACAST: a protocol for peer-to-peer video streaming
supporting multiple description coding," in Proc. of ICME
2009, New York, USA, Jun. 28--Jul. 3, 2009.
[3] Sun X H, Sartaj S. Packet Classification Consuming Small
Amount of Memory. IEEE Transaction On Networking,
2005, 13(5): 1135-1145.
[4] Karp R, Wigderson A. A fast parallel algorithm for the
maximal independent set problem. Journal of the ACM ,
1985, 32(4): 762-773.
[5] S. Milani and G. Calvagno, "A Game Theory Based
Classification for Distributed Downloading of Multiple
Description Coded Videos," in Proc. of the IEEE ICIP
2009, Cairo, Egypt, Nov. 24--28, 2009.
[6] Chazelle B, Guibas L J. Fractional cascading I: A data
structuring Technique. Algorithmica, 1986, 1(2): 133162.
[7] [Geraci F, Pellegrini M, Pisati P, et al. Packet classification
via improved space decomposition techniques. In Proc of
INFOCOM. Miami, 2005, 13-17.
[8] SUN Yi,LIU Tong, CAI Yi-bing, HU Jin-long, SHI Jing-
lin. Research on Packet Classification Algorithm [J].
Application Research of Computers20072445-
11. (in Chinese)
[9] Haoyu Song, Fang Hao, Murali Kodialam, T.V. Lakshman,
IPv6 lookups using Distributed and Load Balanced Bloom
Filter for 100Gbps Core Router Line Cards, INFOCOM,
2009.
[10] V. Pus and J. Korenek. Fast and scalable packet
classification using perfect hash functions. In FPGA '09:
Proceeding of the ACM/SIGDA international symposium
on Field programmable gate arrays, pages 229--236, New
York, NY, USA, 2009. ACM.
[11] Gupta P, Lin S, Mckeown N. Routing Lookups in
Hardware at Memory Access Speeds. In: Proc of IEEE
INFOCOM. San Francisco, 1998, 12401247
[12] Song H Y, Jonathan T, John L. Shape Shifting Tries for
Faster IP Route Lookup. In Proc of ICNP. Boston, 2005,
358-367.
[13] Kamath P, Lan K C, Heidemann J, et al. Generation of
high bandwidth network traffic traces. In: Proc of
International Symposium on Modeling, Analysis and
Simulation of Computer and Telecommunications Systems.
FortWorth, 2002, 401-410.
[14] Song H Y, Jonathan T, John L. Shape Shifting Tries for
Faster IP Route Lookup. In: Proc of ICNP. Boston, 2005,
358-367.
[15] A. G. Alagu Priya, Hyesook Lim. Hierarchical packet
classification using a Bloom filter and rule-priority tries[J].
Computer Communications, 2010, 33(10): 1215-1226
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1571
2011 ACADEMY PUBLISHER
Enabling Awareness Driven Differentiated Data
Service in IOT
Haoming Guo
Beihang University Computer School, Beijing, China
Email: [email protected]
Shilong Ma and Feng Liang
Beihang University Computer School, Beijing, China
Email:{slma, Fengliang} @nlsde.buaa.edu.cn
AbstractIOT needs to organize sensor resources to
monitor events in real world for all time. As result, huge
number of data will be concentrated in the system. Due to
difference of sensors awareness on event, the concentrated
datas qualities are different. The data process tasks
performance will be affected without discrimination of
datas quality. This paper introduced an approach, called
Awareness Driven Schedule (ADS) that enables involved
sensor resources to provide differentiated data service by
their awareness, to address the issue. In the approach,
higher a sensor resources awareness on the event is, more
detailed data service it should provide. Requirement that
specify relation of sensors awareness and rules of data
collect job will be submitted initially. Constant and
continuous data channels are created to organize sensors. In
accordance of their awareness and task initial requirement,
sensor resources are scheduled to collect data and aggregate
to task through the channels. Sensor resources involvement
and service depend on its awareness and tasks
requirement. Upon ADS, a middleware is built for CEAs
(China Earthquake Administration) SPON(Seismological
Precursors Observation Net) and applied for Chinas
earthquake research applications. In the applications, Dull
data of low awareness sensors could be banned out,
applications may by more efficient.
Index Terms: Sensor; Resource Schedule; IOT; Web Service
I. INTRODUCTION
In IOT (Internet of things) systems, there are thousands
of sensor resources deployed all over the areas
[1]
. Upon
web technology, the sensor resources are accessible any
where and anytime. The primary work of sensor resource
is to monitor environment around and collect data.
Through the data, system can be aware of events
development and implement related proceedings. By the
approach, IOT is constructed as new frontiers between
human and real world
[2][3]
. Sensor can constantly and
continuously provide data service for applications
[4]
. The
mechanism, however, may lead to problems that affect
systems data processing performance.
For example, in earthquake application: Earth
Viberation Detect(EVD). EVDs goal is to catch
exceptional vibration exactly. For the purpose, a large
number of sensor resources are spread all over the area.
Once exceptional vibration takes place, the sensor nearby
may catch the signal. A number of sensors data are
aggregated by EVD to find out detailed information about
the event.
In EVDs case, sensor resource has two distinguished
features: 1.Datas quantity is huge. 2.Data quality is
changing. In IOT, sensor is to watch real world constantly.
It generates data continuously as it works. As result, a
large number of data may be concentrated for further
processing. Meanwhile datas accuracy is affected by its
working conditions and relations with event. As target
event changes, data accuracy may change either. Some
data may be highly valuable while others may be dull to
application. Because datas quality is not stable as the one
in conventional web, resources service to task should be
differentiated.
The goal of sensor schedule is to enhance
differentiated continuous services and dynamic resource
involvement in task in accordance with resources data
quality and its awareness. In this schedule approach, task
publishes data requirements. All resources are involved in
accordance with whether they could provide required data.
Meanwhile, layered data channels are built for tasks.
Resources link to corresponding data channel to transfer
data with different frequency. For example, for EVD,
higher accurate the data is, more frequent it is to be
transferred. If one resources data accuracy changed, it
cuts off current data channel link and recreate new link
with corresponding level channel. If resource could not
provide required data, it quit from the working group.
Once a datas data accuracy reached tasks requirement, it
creates connection with related data channel and provide
data to task. Through this awareness driven schedule,
resources are organized in accordance with their leveled
data so that unnecessary huge data transfer may be
reduced while guarantee credible and continuous data
service for tasks.
II. RELATED WORK
Derived from the traditional sensor network, Sensor
Web is now widely used in fields such as Bio-complexity
mapping of the environment, Military applications
[5]
,
flood detection
[6]
, traffic management
[7]
and etc.
1572 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
doi:10.4304/jnw.6.11.1572-1577
In traditional sensor web research, coverage is one
primary issue. It concerns with problem of how to
optimize schedule policy to improve energy efficiency
and guarantee sensor networks coverage under
requirement of performance. Paper
[8]
introduced an
approach, called Coverage Configuration Protocol(CCP),
to address the issue by analyzing sensors connectivity
and coverage relations. In the approach, the policy that
rules whether a sensor will be activated is set to be
whether the sensor is in an area not covered by other
activated sensor. Upon CCP, paper
[9]
introduced an
algorithm to reduce unnecessary activated sensor nodes
while avoiding blinding. The research introduced above
address issues to create possible coverage by least sensors
under requirement of energy conservation. As sensors
work continuously, sensors involvement of providing data
services will be dynamic in the coverage net. Paper
[10]
introduce an approach to address continuous time sensor
scheduling problem in which part of involved sources are
to be chosen to collect data at each time point. In the
approach, the sensors that are chosen at a particular time
are represented by controls. The control variables are
constrained to take values in a discrete set, and
switchings between sensors can occur in continuous time.
The researches concerns with how to organize least
sensors to provide measurements in dynamic. However,
the measurements of activated sensors are processed
equally. All sensors are viewed as data producer with
same accuracy and quality.
In 2005, the OGC (Open Geospatial Consortium) has
proposed a new sensor web integrated framework: SWE
(Sensor Web Enablement)
[11]
, which has become the De
facto standard in industry. SWE adopts SOAP and XML
from Web Service Architecture and aims at a unified
management of the heterogonous sensor resources via
Internet, including discovery, access, controlling and
notification with the plug-and-play feature.
Conforming to the SWE standard, NICTA Open Sensor
Web Architecture (NOSA) is a software infrastructure
aimed at harnessing massive computation power of grid
computing in sensor web. The core middleware includes
planning, notification, collection and repository services.
By splitting the information sensing and processing, it
harnesses the Grid Services to process the information,
which not only greatly reduces the load of sensor network,
but also simplifies the heterogeneous sensor network
management. The architecture allows the data
interoperability. However, the quality of data is not
considered as a standard way for result processing.
Resource scheduling is a NP-complete question
[12]
in
distributed systems, therefore up to now only locally
optimal solution is available. According to the mentioned
research above, it can be seen most of the
information-driven middleware implements the
scheduling by processing the data indiscriminately, this
scheduling mechanism is suitable for information
browsing activities, but inefficient for emergencies which
involves variety of parties and monitors distributed
dynamic event sources because of longer processing time
and more resource. Therefore a sophisticated and
effective mechanism is required for data filtering and
scheduling.
III. DESIGN OF AWARENESS DRIVEN SCHEDULE
ADSs goal is to enable differentiated continuous
services and dynamic resource involvement in task in
accordance with resources data quality and its awareness.
In ADS, tasks register awareness requirement(AR) to
Awareness Requirement Registration(ARR) and create
Task Awareness Schedule(TS) in Task Awareness
Scheduler Manager(TSM). In TS, data channels are
defined and built for data differentiation and service
forwarding. In the AR, information type and data value
definition are listed. TSR searches all resources who can
provide same data as defined by AR and invoke.
Resources create task object handler(TOH) in local task
object handler pool(THOP). Once a resource is aware of
target event, it collects data and check data channels
definition from related TS. Data transfer frequencies are
listed in TSs data channel definition. Resource retrieves
the frequency information by which it transfers data. In
TSs data channel definition, data process services are
defined. All data from one channel is about to forward to
the specific service. If resources data value shift from
one range to another, TS may link related data channel to
resource and reassign data transfer job. If resource lost
awareness of event, it cut off link to TS. If a resource
finds the event, it checks TSR with the event information
and retrieve related AR upon which links to TS are built.
Through this approach ADSs goal is rtealized. The
whole view of ADS is shown as below.
Figure 1. Whole view of ADS.
A. Definitions in ADS
Definition 1: Awareness requirement (AR). AR is
tasks awareness requirement definition. In AR, task
specifies requirements of sensor resources to pool as
working group and tasks data channel definitions. Its
definition is shown as below:
AR=(ID, taskID, resPTable, dataChannelList)
(1) ID is the ARs identity;
(2) taskID is current tasks identity
(3) resPTable={p
i
|i=1,2,..n}, it specifies what kind of
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1573
2011 ACADEMY PUBLISHER
resource task needs. It consists of a table of property. p
i
=(name,type,value) name is propertys name. type is
propertys type. While looking for resources from
resource registration at the beginning, if ARs all property
requirements matche one resources properties, the
resource may be included in tasks initial working group.
(4) dataChannelList={dcf
i
| i=1,2,n}.
dataChannelList is to specify data channels link between
resource and related data process in accordance with
resources awareness or its data accuracy.
dcf=(ID, taskID, proc, maxValue, minValue,
frequency, transMod, cacheSize);
proc is target data process in task to process the data
with required accuracy.
maxValue and minValue are to define range of the
channel. If a resources data is within the range, the
resource will be linked to the channel and the resource
will send data by the frequency.
transMod = { flow , periodic}. It specifies by
which way the involved sensor resources transfer data.
Definition 2: Resource Handler Object(RHO)
RHO is for task to receive data from corresponding
sensor resource. Once a sensor resource is invoked, a
RHO will be created and pooled in related data channel.
RHO is defined as below:
RHO=(ID, taskID, resBinding, dataCache);
(1) ID is the RHOs identity. Through ID, RHOPool
seek and retrieve the object.
(2) taskID is to maintain RHOs hosted task identity.
(3) resBinding is to specify the binding information.
Through the information, RHO may redirect commanding
messages to right resource.
(4) dataCache ={value
i
| i=1,2,..n}. its used to cache
data collected by resource by time order.
Defination 3: Task Handler Object(THO)
THO is created by sensor resource for data service
request. In accordance with data quality, THO collects
data and send data back to paired RHO. If its data range
changed, RHO may change from original pool to other
data channels pool and the new channels information is
forwarded to THO to adjust its data collection job. THOs
definition is shown as below:
THO ={ID, taskID, RHOID, dataCache, dcf}
(1) ID: is THOs identity.
(2) taskID is to maintain THOs hosted task identity.
(3) RHOID is paired RHOs identity.
(4) dataCaches definition is same as RHOs.
(5) dcfs definition is same as ARs
Definition 4: Task Awareness Scheduler(TS)
TS is to keep contact with resources, receive and
forward data to right data process object in task and
schedule resources service. Its definition shown as
below:
TS={ARID, dcs}
(1) ARID is corresponding with ARID. One AR has
one TS created.
(2) dcs is data channel list in TS. Its consist of a
group of data channel: dcs={dc
j
| j=1,2,.m };
dc={ID, dcf ID, RHOPool}
ID is the data channels identity, dcfID is data
channels definition identity through which data channel
may retrieve information from corresponding AR.
RHOPool is pool of resource handler object(RHO).
B. Registration of Tasks Awareness Requirement
Awareness Requirement Registration(ARR) is to
organize tasks awareness requirement(AR) through
which resource could involve into tasks working group.
Once tasks AR is registered, its TS will be created in
Task Awareness Scheduler Manager(TSM). TSM is
consist of a group of TS as: TSM={TS
i
|i=1,2,.n}.
After ARs TS is created, ARR searches for all
resources corresponding to ARs resPTable
specification. The result resources of the search are
organized as initial working group of task. All resources
in the working group would be invoked to create link
with ARs TS.
The whole AR registration is shown as below:
Figure 2. Communication for AR registration.
. . . .
. . ( )
. @
. ( ) . .0
RegProc Task regAR AR ARR
createTS AR TSM createTS TSID
ARRlookFor resPTable AR
RRlookFor resList ARRinvokProc
= < >
< >
< >
. .
. .
. ( ).
.0
invokeProc ARRivkTHO TSID THOPool
createCon TSID THO
getTS TSID TSM getTS TS
createConnProc
= < >
< >
< >
Figure 3. Communication for TS registration.
1574 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
. .
( ). .
( ).0
createConnProc THOgetDC dataCache TS
getDC DC createRHO resBinding DC
createRHORHOID
= < >
< >
C. Data Linkage for Resource and Task Process
After data channel connection, the resource needs to
get relevant data channels information. By the
information, resource retrieves data channels definition
dcf from ARR and. In dcf, data channels data range is
defined and the resource transfer data by rules of dcf
while data is within the range. Resources THO return
data to tasks data channel first. Data channel forward the
data to related RHO. RHO looks for data channels
definition from ARR and retrieve process object of task
which is persisted in dcf. Then RHO transfer the data to
the process object.
Figure 4. Communication for data collection task.
@
. ( ).
. .
.
.
.0
. . .
.
k k
k K
k k
k k k
k
k k m m
m
serProc THO getDcf dcfID DC
ARR getDcf dcf
dataCollectProc THO
returnData dataCache DC
forwardData dataCache RHO
TaskDataProc
TaskDataProc RHO findDcf dcfID DC
mapDcf dcfID ARR
mapDcf
= < >
< >
< >
= < >
< >
( ) .
. . ( ).
.0
m m m m
k m m m m
k m k
dcf findDcf dcf
RHO getProc NULL dcf getProc Proc
RHO transData dataCache
< >
< >
< >
In the in tasks awareness requirement, tasks process
object is persisted. During implementation, THO
transfers data back to RHO through it working data
channel. RHO looks for the process object linked to
hosted data channels definition and forward data to it. In
ADS, resource may provide data service constantly by
this approach.
Resources are to monitor real worlds event and collect
data. In ADS, resources data collection job is ruled by its
linked data channel. In data channels specification,
maxValue and minValue are to define current data
channels range. If one resources collected is within the
range, it keeps data collection for current data channel.
Otherwise, resource looks for new channel in current TS
and collect data by the new ones rule. In data channels
definition, transMod is defined as flow or periodic. If
one data channel is defined as flow, the linked
resources should cache all data of the ruled intervals
which will be transferred back by the frequency. If one
data channel is defined as periodic, the resources
calculate average value of the ruled intervals and only the
average value will be transferred back instead of whole
data cache.
D. Resource Awareness Orientation
With development of monitored event, resources
awareness may change. In data channels definition,
maxValue and minValue are to define current data
channels range. Once a resources collected data is out
of current data channels range definition. It may check
ARR for new oriented data channel and shift related RHO
from old hosted data channel to the new one. If resource
lost awareness of the event, it will be removed from data
channel. During this process, the ROH shift request
message is defined as:
changeDCReq=(ID, RHOID, resBinding, taskID,
oldDCID, newDCID)
In the request message, RHOID is related RHO
identity. TS retrieves the object through the identity.
resBinding is information about resource. taskID is
current tasks identity through which locates related TS.
oldDCID is current linked data channels ID and
newDCID is the new data channel to link. The RHO
remove request message is defined as :
removeRHOReq=( ID, RHOID, resBinding, taskID,
oldDCID)
The process is shown as below:
Figure 5. Communication for awareness shift.
.
( ).
(( ). .
. @ . .0
( ). .
.
k K
k k
j k k
l k k j
k
AwChangeProc THO chkDC dataCache ARR
chkDC chkResult
chkResult NULL removeROH removeROHReq
TS deleteROH RHO removeROHReq
chkResult dcf changeDC changeDCReq TS
shiftROH changeDC
= < >
= < >
< > +
= < >
< .
. .0)
k m
k k l
Req DC
moveRHO RHO DC
>
< >
If a
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1575
2011 ACADEMY PUBLISHER
resource begins to sense the event, it checks ARR with
the data and its own property for tasks which are
require the data from ARR. ARR may return a list of
available tasks AR. The resource create connection with
the tasks TS and begin to provide data service. The
check message is defined as:chkTaskReq=(ID, dataCache,
pTable) ;the ARR returned message is defined as:
tskResp=(TSID
1
, TSID
2
,TSID
n
)
the process is shown as below:
Figure 6. communication for service shift.
1 1 1 1
1 1 1 1
2 2
. .
( ). . . .
( @
@ . ( ). .0|
@
chkNewTsk dataCollectProc chkTsk chkTaskReq ARR
chkTsk tskResp createTHOs tskResp THOPool
createCon TSID tskResp THOgetTS
TSID tskResp TSM getTS TS createConnProc
createCon TSID tsk
= < >
< >
< >
< >
<
2 2
2 2 2 2
@ . ( ). .0|......
@
@ . ( ). .0)
n n n n
n n n n
Resp THO getTS
TSID tskResp TSM getTS TS createConnProc
createCon TSID tskResp THO getTS
TSID tskResp TSM getTS TS createConnProc
>
< >
< >
< >
IV. APPLICATION AND TEST
Upon ADS, a Seismological Sensor Resource Data
Service System(SSRDSs) is built for CEAs
Seismological General Scientific Data Platform(SGSDP)
built for SPON.
In test, 12 resources are deployed to simulate
application environment. The resources data collection
working frequency is about 60Hz. Two tasks were
implemented for comparison. Task 1 collected all data
directly from resource without discrimination. Task 2
created 4 data channel and resources provided
differentiated data services. During implementation, 12
resources transferred data back to No1 task at about 360
data per second. In task 2, resources transferred at about
168 data per second. For task 2, data load was 47% of
task 1. Data lost may lead to certain accuracy lost. Figure
7 shows two tasks data aggregation curve. Task 2s
results accuracy is lower than task 1s. However, it was
within applications accuracy requirement.
The test above shows effectiveness of ADS for data
concentrated applications of IOT. Through ADS, task
may organize resources to provide differentiated data
service on their awareness capability that enable
applications gain data within requirement of accuracy and
reducing unnecessary dull datas burden.
0.86
0.88
0.9
0.92
0.94
0.96
0.98
1
1 2 3 4 5 6 7 8 9 10 11 12 13 14
time(minute)
a
m
p
l
i
t
u
d
e
o
f
v
i
b
r
a
t
i
o
n
No1 task
No2 task
Figure 7. Comparison of ADS and conventional.
Task may gain higher accuracy by adjust data
channels setting. In another test, 500 sensor resources are
organized to simulate real application of EVD. In test, by
different data channel setting, application can gain
different quality data. The graphic below shows details
about data process nodes load with different data
accuracy.
resource load
0%
10%
20%
30%
40%
50%
60%
70%
80%
a
c
c
u
r
a
c
y
=
0
.
7
a
c
c
u
r
a
c
y
=
0
.
7
5
a
c
c
u
r
a
c
y
=
0
.
8
a
c
c
u
r
a
c
y
=
0
.
8
5
a
c
c
u
r
a
c
y
=
0
.
9
a
c
c
u
r
a
c
y
=
0
.
9
5
a
c
c
u
r
a
c
y
=
1
data accuracy
r
e
s
o
u
r
c
e
l
o
a
d
p
e
r
c
e
n
t
a
g
e
cpu load memory load
Figure 8. Resource load of ADS.
V. CONCLUSION AND FUTURE WORK
This paper introduced an approached called
(Awareness Driven Schedule)ADS. Through tasks
requirement, ADS organize all involved resource and
enable them to provide differentiated data service to task.
Higher a resources awareness is, more detailed data it
should collect and transferred. As result, low awareness
of resources only need to provide limited and periodic
data service. Resources data services are differentiated
with their awareness. Dull data are banned out.
In real world, event may be changing so that a tasks
awareness requirement over resources could be dynamic.
With development of event, data channels range and rule
should change accordingly to ensure better surveillance
work. As result, Awareness requirements data channel
that links process object and resources need a forecasting
approach to capture change tendency of data and help
task to readjust its requirement. Currently, ADS leaves
the work to task. In future work, research should be done
in this field to provide better data service for task.
VI. ACKNOWLEDGEMENT
This research work was supported by China
1576 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
Earthquake Administrations program for
Seism-Scientific Research "Research in Online
Processing Technologies for Seismological Precursory
Network Dynamic Monitoring and Products" (NO.
201008002)
REFERENCES
[1] Hakima Chaouchi:The Internet of Things: Connecting
Objects, Wiley-ISTE,2010
[2] Lu Yan, Yan Zhang, Laurence T. Yang, Huansheng Ning:
The Internet of Things: From RFID to the
Next-Generation Pervasive Networked Systems (Wireless
Networks and Mobile Communications) Auerbach
Publications,2008
[3] Hu W, Bulusu. N. Chou, C. T, J ha. S, Design and
evaluation of a hybrid sensor network for cane toad
monitoring.ACM Trans. Sen. Netw., ACM, vol.5,
pp.1-28, 2009
[4] J . Schelp and R. Winter, "Business application design and
enterprise service design: a comparison", International
J ournal of Services Sciences, vol. 1, pp. 206--224, 2008.
[5] Akyildiz. I. F, Su.W, Sankarasubramaniam. Y and
Cayirci. E, Wireless sensor networks: a survey.Computer
Networks, Computer Networks , Vol.38(4), pp.393-422,
March 2002
[6] Faradjian. A,Gehrke. J and Bonnet. P, GADT: A
Probability Space ADT for Representing and Querying
the Physical World. Proceedings of the 18th International
Conference on Data Engineering, pp. 201-211, 2002
[7] Shih.E, Cho. S.-H, Ickes.N, Min. R , Sinha. A,Wang. A
and Chandrakasan. A, Physical layer driven protocol and
algorithm design for energy-efficient wireless sensor
networks MobiCom '01: Proceedings of the 7th annual
international conference on Mobile computing and
networking, ACM, pp. 272-287, 2001
[8] Xiaorui. Wang, Guoliang. Xing, Yuanfang Zhang,
Chenyang Lu, Robert Pless and Christopher Gill,
Integrated coverage and connectivity configuration in
wireless sensor networks, SenSys '03 Proceedings of the
1st international conference on Embedded networked
sensor systems , ACM, pp. 28-40, 2003
[9] Yuheng Liu, J uhua Pu, Shuo Zhang, Yunlu Liu and Zhang
Xiong, A Localized Coverage Preserving Protocol for
Wireless Sensor Networks, Sensors,vol 9(1), pp. 281-302,
2009
[10] H. W. J. Lee, K. L. Teo and Andrew E. B. Lim, Sensor
scheduling in continuous time, Automatica,vol 37(12),
pp. 2017-2023, 2001
[11] Botts. M, Percivall. G, Reed. C and Davidson. J , Sensor
Web Enablement: Overview And High Level
Architecture.OGC White Paper, OGC, pp. 07-165, 2007
[12] D. Fernandez-Baca, Allocating Modules to Processors in
a Distributed System. IEEE Transactions on Software
Engineering, vol.15, pp.1427-1436, 1989.
Haoming Guo, male, born in 1978. Post
Doctor the College of Mathematics and
Systems Engineering), Beihang University.
His main research interests include grid,
computation model in network, Data
Integration, and IOT Application, etc.
Shilong Ma, male, born in 1953. Professor and PhD
supervisor of the College of Computer Science and Technology,
Beihang University. His main research interests include grid,
computation model in network, and logic and behavior in
computing, etc.
Feng Liang is currently a Ph. D student in National
Laboratory for Software Development and Environment,
Beihang University with the research interests in grid
computing and cloud computing.
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1577
2011 ACADEMY PUBLISHER
Enhancement of an Authenticated 3-round
Identity-Based Group Key Agreement Protocol
Wei Yuan
Department of Computer Science and Technology, J ilin University, Changchun, China
Email: [email protected]
Liang Hu
Department of Computer Science and Technology, J ilin University, Changchun, China
Email: [email protected]
Hongtu Li
Department of Computer Science and Technology, J ilin University, Changchun, China
Email: [email protected]
J ianfeng Chu
Department of Computer Science and Technology, J ilin University, Changchun, China
Corresponding author, Email: [email protected]
Yuyu Sun
College of Computer Science and Technology, J ilin University, Changchun 130012, China,
Software Institue, Changchun University, Changchun 130022, China
E-mail: [email protected]
AbstractIn 2008, Gang Yao et al. proposed an
authenticated 3-round identity-based group key agreement
protocol, which is based on Burmester and Desmedts
protocol proposed at Eurocrypt 94. However, their protocol
can only prevent passive attack. If the active attack is
allowed, the protocol is vulnerable and an internal attacker
can forge her neighbors keying material. It is obvious that
the protocol do not achieve the aim of authentication. In this
paper, we discuss the flaws to attack this protocol and
propose an enhanced provably-secure protocol based on
their protocol. At last, we make a detailed security analysis
of our enhanced authenticated identity-based group key
agreement protocol.
Index Termsauthentication, identity-based, key agreement,
bilinear pairing, cryptanalysis, attack
I. INTRODUCTION
Secure and reliable communications [1] have become
critical in modern society. The centralized services such
as file sharing, can be changed into distributed or
collaborated system based on multiple systems and
networks. Basic cryptographic functions such as data
confidentiality, data integrity, and identity authentication
are required to construct these secure systems.
Key agreement protocol [2] [3] [4] allows two or more
participants, each of whom has a long-term key
respectively, to exchange information over a public
communication channel with each other. However, the
participants can not ensure others identity. Though Alice
wants to consult a session key with Bob, Alice can not
distinguish it if Eve pretends that she is Bob. The
authenticated key agreement protocol overcomes this
flaw and makes unfamiliar participants to ensure others
identities and consult a common session key in the public
channel.
A.Shamir [5] introduced an identity-based public key
cryptosystem in 1984, in which a users public key can be
calculated from his identity and defined hash function,
while the users private key can be calculated by a trusted
party called Private Key Generator (PKG). The identity-
based public key cryptosystem simplifies the program of
key management and increases the efficiency. In 2001,
Boneh and Franklin [6] found bilinear pairings positive
applications in cryptography and proposed the first
practical identity-based encryption protocol with bilinear
pairings. Soon, the bilinear pairings become important
tools in constructing identity-based protocols and a
number of identity-based encryption or signature schemes
[7], [8], [9], [10], [11] [12] and authenticated key
agreement protocols [13], [14], [15], [16] [17] were
proposed.
In 2008, Gang Yao, Hongji Wang, and Qingshan J iang
[18] proposed an authenticated 3-round identity-based
group key agreement protocol. The first round is for
identity authentication, the second round is for key
agreement, and the third round is for key confirmation.
Their protocol is based on the protocol of Burmester and
Desmedt [19] which was proposed at Eurocrypt 94. They
declared the proposed protocol provably-secure in the
random oracle model.
1578 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
doi:10.4304/jnw.6.11.1578-1585
In this paper, we show that an authenticated 3-round
identity-based group key agreement protocol proposed by
Gang Yao et al. is vulnerable: an internal attacker can
forge her neighbors keying material. Then we propose an
improved provably-secure protocol based on Burmester
and Desmedts as well. At last, we summarize several
security attributes of our improved authenticated identity-
based group key agreement protocol.
II. PRELIMINARIES
A. Security attributes
To get a rational key agreement protocol, Marko
Hlbl, Tatjana Welzer and Botjan Brumen defined some
security attributes which have to be fulfilled by their
secure authenticated key agreement protocol. Assume A,
B and C are three honest entities. It is desired for
authenticated key agreement protocol to possess the
following security attributes [15]:
1. Known-Key Security. A unique secret session key
should be generated in each round of a key agreement
protocol. Each session key generated in one protocol
round is independent and should not be exposed if other
secret session keys are compromised, i.e. the compromise
of one session key should not compromise other session
keys.
2. Forward Secrecy. If long-term private keys of one or
more of the entities are compromised, the secrecy of
previously established session keys should not be
affected. We say that a protocol has forward secrecy if
some but not all of the entities' long-term keys can be
corrupted without compromising previously established
session keys, and we say that a protocol has perfect
forward secrecy if the long-term keys of all the
participating entities may be corrupted without
compromising any previously established session key.
3. Key-Compromise Impersonation Resilience.
Suppose that the long-term secret key of one participating
entity is disclosing (e.g. A). Obviously, an adversary who
knows this secret key can impersonate this entity to other
participating entities (e.g. A to B and C). However, it is
desired that this disclosure does not allow the adversary
to impersonate other entities (e.g. B and C) to the entity
whose long-term secret key was disclosed (e.g. A).
4. Unknown Key-Share Resilience. After the protocol
ran, one entity (e.g. A) believes she shares a key with the
other participating entities (e.g. B and C), while those
entities (e.g. B and C) mistakenly believe that the key is
instead shared with an adversary. Therefore, a rational
authenticated key agreement protocol should prevent the
unknown key-share situation.
5. Key Control. The key should be determined jointly
by all participating entities (e.g. A, B and C). None of the
participating entities can control the key alone.
The inclusion of identities of the participating entities
and their roles in the key derivation function provide the
resilience against unknown key share attacks and
reflection attacks. The inclusion of transcripts in the key
derivation function provides freshness and data origin
authentication.
B. Bilinear pairing
Let P denote a generator of
1
G , where
1
G is an additive
group of large order q and let
2
G be a multiplicative group
with
1 2
| | | | G G = . A bilinear pairing is a map
1 1 2
: e G G G which has the following properties:
1.Bilinearity:
Given
1
, , Q W Z G , ( , ) ( , ) ( , ) e Q W Z e Q W e Q Z + = an
d ( , ) ( , ) ( , ) e Q W Z e Q Z e W Z + = . There for
any ,
q
q b Z :
( , ) ( , ) ( , ) ( , ) ( , )
ab a
e aQ bW e Q W e abQ W e Q abW e bQ W = = = =
.
2. Non-degenerative:
( , ) 1 e P P , where 1 is the identity element of
2
G .
3. Computable:
If
1
, Q W G , one can compute
2
( , ) e Q W G in
polynomial time efficiently.
C. Computational problems
Let
1
G and
2
G be two groups of prime order q, let
1 1 2
: e G G G be a bilinear pairing and let P be a
generator of
1
G .
Discrete Logarithm Problem (DLP)
Given
1
, P Q G , find
q
n Z such that
P nQ = whenever such n exists.
Computational Diffie-Hellman Problem (CDHP)
Given ( )
1
, , P aP bP G for
*
,
q
a b Z , find the
elementabP .
Bilinear Diffie-Hellman Problem (BDHP)
Given ( )
1
, , , P xP yP zP G for
*
, ,
q
x y z Z ,
compute
2
( , )
xyz
e P P G
D. Introduction of BR security model
To describe the security model for entity authentication
and key agreement aims, M. Bellare and P. Rogaway
proposed the BR93 model [13] for two-party
authenticated key agreement protocol in 1993 and the
BR95 model [14] for three-party authenticated key
agreement protocol in 1995. In BR model, the adversary
can control the communication channel and interact with
a set of
,
x y
i
U U
oracles, which specify the behavior
between the honest players
x
U and
y
U in their i
th
instantiation. The predefined oracle queries are described
informally as follows:
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1579
2011 ACADEMY PUBLISHER
Send (
x
U ,
y
U , i, m): The adversary sends message
m to the oracle
,
x y
i
U U
.The oracle
,
x y
i
U U
will return the
session key if the conversation has been accepted
by
x
U and
y
U or terminate and tell the adversary.
Reveal (
x
U ,
y
U , i): It allows the adversary to
expose an old session key that has been previously
accepted. After receiving this query,
,
x y
i
U U
will send this
session key to the adversary, if it has accepted and holds
some session key.
Corrupt (
x
U , K): The adversary corrupts
x
U and
learns all the internal state of
x
U .The corrupt query also
allows the adversary to overwrite the long-term key of
corrupted principal with any other value K.
Test (
x
U ,
y
U , i): It is the only oracle query that
does not correspond to any of the adversarys abilities.
If
,
x y
i
U U
has accepted with some session key and is being
asked a Test(
x
U ,
y
U , i) query, then depending on a
randomly chosen bit b, the adversary is given either the
actual session key or a session key drawn randomly from
the session key distribution.
Freshness. The notion is used to identify the session
keys about which adversary should not know anything
because she has not revealed any oracles that have
accepted the key and has not corrupted any principals
knowing the key. Oracle
,
i
A B
is fresh at the end of
execution, if, and only if, oracle
,
i
A B
has accepted with
or without a partner oracle
,
i
B A
, both oracle
,
i
A B
and its
partner oracle
,
i
B A
have not been sent a Reveal query,
and the principals A and B of oracles
,
i
A B
and
,
i
B A
(if
such a partner exists) have not been sent a Corrupt query.
Security is defined using the game G, played between
a malicious adversary and a collection lection
of
,
x y
i
U U
oracles and instances. The adversary runs the
game simulation G, whose setting is as follows.
Phase 1: Adversary is able to send any Send, Reveal,
and Corrupt oracle queries at will in the game simulation
G.
Phase 2: At some point during G, adversary will
choose a fresh session on which to be tested and send a
Test query to the fresh oracle associated with the test
session. Note that the test session chosen must be fresh.
Depending on a randomly chosen bit b, adversary is
given either the actual session key or a session key drawn
randomly from the session key distribution.
Phase 3: Adversary continues making any Send,
Reveal, and Corrupt oracle queries of its choice.
Finally, adversary terminates the game simulation and
outputs a bit b, which is its guess of the value of b.
Success of adversary in G is measured in terms of
adversarys advantage in distinguishing whether
adversary receives the real key or a random value. A wins
if, after asking a Test (
x
U ,
y
U ,i) query, where
,
x y
i
U U
is
fresh and has accepted, adversarys guess bit b equals
the bit b selected during the Test (
x
U ,
y
U ,i) query.
A protocol is secure in the BR model if both the
validity and indistinguishability requirements are
satisfied:
Validity. When the protocol is run between two
oracles in the absence of a malicious adversary, the two
oracles accept the same key.
Indistinguishability. For all probabilistic,
polynomial-time (PPT) adversaries A, AdvA (k) is
negligible.
III. REVIEW OF GANG YAO ET AL.S PROTOCOL
Let
1
, ,
n
U U be n participants, and PKG be the
private key generator. Let
i
ID be the identity of
i
U .
Suppose that
1
G and
2
G are two cyclic groups of order q
for some large prime q.
1
G is a cyclic additive group
and
2
G is a cyclic multiplicative group. Let P be an
arbitrary generator of
1
G , and
1 1 2
: e G G G be a
bilinear pairing.
In Gang Yao et al.s protocol, the following two steps
prepare the system parameters:
Setup:
The PKG chooses a random number
*
q
s Z and
set R sP = . The PKG also chooses { }
*
*
0 1
: 0,1 H G to
be a Map-to-Point hash function, and H is a
cryptographic hash function. Then the PKG publishes
system parameters{ }
1 2 0
, , , , , , , q G G e P R H H , and keeps
s as its master key.
Extract:
Given a public identity { }
*
0,1 ID , the PKG
computes the public key
0 1
( ) Q H ID G = and generates
the associated private key S sQ = . The PKG passed S as
the private key to the user via some secure channel.
Let n users
1
, ,
n
U U with respective public
keys ( )
0
( ) 1
i i
Q H ID i n = decide to agree upon a
common secret key.
i i
S sQ = is the long term secret key
of
i
U sent by the PKG on submitting
i
U s public
identity( ) 1 i n . Let U denote
1
|| ||
n
U U .
We assume that
1
U is the protocol initiator. The
protocol may be performed in three rounds as follows:
Round 1: Identity Authentication
Every participant
i
U generates a random
number
*
i q
r Z , computes
1580 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
Figure 1. Attacking model.
i i
E r P = , ( , ( , ))
i i i i
F H U e E R S r R = +
And broadcasts
i
E and
i
F .
After receiving every
j
E and
j
F ( ) 1 , j n j i ,
i
U verifies that none of them
equals 1. If the check succeeds,
i
U verifies whether
( , ) ( ( , ( , )) , )
j j j j
j i j i
e F P e H U e E R Q E R
= +
holds or not. If the verification succeeds,
i
U continues with the next round. Otherwise, the
protocol execution is terminated and a notification
of failure will be broadcasted.
Round 2: Key Agreement
i
U computes
0 1 1
( || || || || )
n n
T H ID E ID E = ,
then he computes
i
Y ,
i
X as follows.
i i
Y rT = ,
1 1
( )
i i i i
X r E E T
+
= + ,
And broadcasts
i
X and
i
Y .
After receiving every
i
X and
i
Y ( ) 1 , j n j i ,
i
U verifies whether
( , ) ( , )
j j
j i j i
e Y P e E T
=
holds or not. IF the verification
succeeds,
i
U continues with the next round.
Otherwise, the protocol execution is terminated and
a notification of failure will be broadcasted.
Round 3: Key Confirmation
i
U computes the keying material
i
Z as
( )( )
1
1
1
( 1 , )
n
i i i i j i j
j
Z e nr E n j X Y R
+ +
=
= +
,
then he computes
( )
1 1 1
|| || || || || || || || || || ||
i n n n i
C H i U E E X X Y Y Z =
and broadcasts
i
C .
After receiving every
j
C ( ) 1 , j n j i ,
i
U
computes the session key as
1 1 1
( || || || || || || || ||
i n n
K H U E E X X Y =
1
|| || || || || )
n i n
Y Z C C .
Otherwise,
i
U terminates the protocol execution and a
notification of failure will be broadcasted.
IV. CRYPTANALYSIS OF GANG YAO ET AL.S PROTOCOL
In our view, two important principles should be
attention: Before we want to protect a message, we
should know whether it really needs to be protected.
After finishing our protocol, we should ensure all the
valuable messages have been protected. In Gang Yao et
al.s protocol, to derive the session
key,
1 1 1
( || || || || || || || || ||
i n n
K H U E E X X Y =
1
|| || || || )
n i n
Y Z C C , all the parameters except
i
Z can
be gained from the broadcast messages. It is important to
ensure the transmitted messages are not modified, forged,
or deleted by attackers. In the round
1,
i
E ( ) 1 i n should be protected and in the round 2,
i
X and
i
Y should be protected. However,
only
i
E and
i
Y here are protected in Gang Yao et al.s
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1581
2011 ACADEMY PUBLISHER
protocol. Because the equation
( , ) ( , )
j j
j i j i
e Y P e E T
=
, where
0 1 1
( || || || || )
n n
T H ID E ID E = , user
i
U can ensure
that
j
Y is not modified or forged but
i
X is transmitted
without any verification. Actually, both
i j
X
+
and
i j
Y
+
are
needed in the equation
( )( )
1
1
1
( 1 , )
n
i i i i j i j
j
Z e nr E n j X Y R
+ +
=
= +
in the
round 3. Thus,
i
X can be replaced by
i i
X Y = .
Then
i
Z can be expressed as
1
( , )
i i i
Z e nr E R
= . Due to
the characteristic of bilinear
pairing:
1 1
( , ) ( , )
i i i i i
Z e nr E R e nr r P sP
= =
1 1
( , ) ( , )
n n
i i i i
e r sP r P e r R E
= = That is, with the random
number
i
r , any user
i
U can generate
1 i
U
+
s keying
material
1 i
Z
+
.The attacking model is described as the
figure 1.
As a result, an attacker who can control the
communication channel has the ability to intercept and
forge all the
i
X s. If a malicious user
e
U wants to
forge
1 e
U
+
s keying material
1 e
C
+
, she can compute
1 1
( , )
n
e e e
Z e r R E
+ +
= and
'
1 1
( 1|| || || || ||
e n
C H e U E E
+
= +
1 1 1
|| || || || || || )
n n e
X X Y Y Z
+
. Finally, she can
broadcast
'
1 e
C
+
to replace
1 e
C
+
.
V. IMPROVEMENT OF GANG YAO ET AL.S PROTOCOL
In this section, we first review Bermester and
Desmedts group key exchange protocol. Then we
propose a non-authentication protocol based on their
protocol with bilinear pairing. Finally, we improve the
non-authentication group key agreement protocol to an
authentication group key agreement protocol
A. Bermester and Desmedts group key exchange
protocol
Let n be the size of the group, the Bermester and
Desmedts group key exchange protocol works as
follows:
Each participant
i
U chooses a random number
i
x and
broadcasts
i
x
i
z g = ;
Each participant computes
1
i
x
i i
Z z
= and
1
1 1
i i
x x
i i i
Z z z
+
+ +
= = , and broadcasts
1 i
i
i
Z
X
Z
+
= ;
Each participant computes his session key
as
1 2
1 2
n n n
i i i i i n
K Z X X X
+ +
= .
It is easy to see that each
i
U can compute the same
session key
1 2 2 3 1
1
n
n
x x x x x x
i j
j
K Z g
+ + +
=
= =
B. Non-authentication protocol transformed from
Bermester and Desmedts protocol
1
G and
2
G are two cyclic groups of order q for some
large prime q.
1
G is a cyclic additive group and
2
G is a
cyclic multiplicative group. Let P be an arbitrary
generator of
1
G ,
1 1 2
: e G G G be a bilinear pairing
and n be the size of the group, the non-authentication
protocol works as follows:
Each user
i
U chooses a random number
*
i q
r Z and
broadcasts
i i
z r P =
Each user
i
U computes
1 i i i
Z r z
= ,
1 1 i i i
Z r z
+ +
= , and
broadcasts
1 i i i
X Z Z
+
=
Each player
i
U can computes his session key as:
1 2
( 1) ( 2)
i i i i i n
K nZ n X n X X
+ +
= + + + +
It is easy to see that for each
i
U ,
( )
1 2 2 3 1
1
n
i j n
j
K Z r r r r r r P
=
= = + + +
C. Our authenticated identity-based group key
agreement protocol
Let
1
, ,
n
U U be n participants, and PKG be the
private key generator. Let
i
ID be the identity of
i
U .
Suppose that
1
G and
2
G are two cyclic groups of order q
for some large prime q.
1
G is a cyclic additive group
and
2
G is a cyclic multiplicative group. Let P be an
arbitrary generator of
1
G , and
1 1 2
: e G G G be a
bilinear pairing.
Our protocol is described as follows:
Setup:
The PKG chooses a random number
*
q
s Z ,
sets R sP = , chooses two hash functions,
0
H and H,
where { }
*
*
0 1
: 0,1 H G . Then the PKG publishes
system parameters{ }
1 2 0
, , , , , , , q G G e P R H H , and keeps
the master key s as a secret.
Extract:
Given a public identity { }
*
0,1 ID , the PKG
computes the public key
0 1
( ) Q H ID G = and generates
the associated private keyS sQ = . The PKG outputs S as
the private key to the user via some secure channel.
Let n users
1
, ,
n
U U with respective public
key ( )
0
( ) 1
i i
Q H ID i n = decide to agree upon a
common secret key.
i i
S sQ = is the long term secret key
of
i
U sent by the PKG on submitting
i
U s public
identity( ) 1 i n . Let U denote
1
|| ||
n
U U .
We assume that
1
U is the protocol initiator. The
protocol may be performed in three rounds as follows:
1582 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
Round 1:
Each participant
i
U chooses a random number
*
i q
r Z ,
computes
i i
z r P = , ( , )
i i i
B H ID z = ,
i i i
v B S = and
broadcasts( ) , ,
i i i
ID z v .
Round 2:
After receiving each( ) , ,
i i i
ID z v ( ) 1 i n , each user
can compute
( , )
i i i
B H ID z = , ( )
0
( ) 1
i i
Q H ID i n =
and verify whether the equation
?
( , ) ( , )
i i i
e v P e BQ R =
sets or not. If the equation sets,
i
U can ensure
that( ) , ,
i i i
ID z v is not modified or forged by attackers.
Then he
computes
1 i i i
Z r z
= ,
1 1 i i i
Z r z
+ +
= ,
1 i i i
X Z Z
+
= ,
( , )
i i i
C H ID X = ,
i i i
w C S = and
broadcasts( ) , ,
i i i
ID X w .
Round 3:
After receiving each( ) , ,
i i i
ID X w ( ) 1 i n , each
user can compute
( , )
i i i
C H ID X = , ( )
0
( ) 1
i i
Q H ID i n =
and verify whether the equation
?
( , ) ( , )
i i i
e w P e CQ R =
sets or not. If the equation sets,
i
U can ensure
that( ) , ,
i i i
ID X w is not modified or forged by attackers.
Then he computes the keying material
1 2
( 1) ( 2)
i i i i i n
D nZ n X n X X
+ +
= + + + +
Actually,
( )
1 2
1 2 2 3 1
1
( 1) ( 2)
i i i i i n
n
j n
j
D nZ n X n X X
Z r r r r r r P
+ +
=
= + + + +
= = + + +
Then each user computes the session key as
1 1
( || || || || || || || )
i n n i
K H U z z X X D =
VI. SECURITY ANALYSIS AND SECURITY ATTRIBUTES
Theorem 6.1 Any modification can be found by the short
signature if the hash function H is collision resistance.
Proof: In the function of
?
( , ) ( , )
i i i
e v P e BQ R = , the
parameters P and R are public, which can not be forged
or modified, and ( , )
i i i
B H ID z = ,
0
( )
i i
Q H ID = are
computed by the receiver. Though
i
v and
i
z may be
modified by the attacker, the collision resistance hash
function H will make it impossible to gain suitable pairs
of
i
v and
i
z to pass the verification function. So if attacker
modifies any elements of ( ) , ,
i i i
ID z v , other users can
find it. The function
?
( , ) ( , )
i i i
e w P e CQ R = has a similar
situation with
?
( , ) ( , )
i i i
e v P e BQ R = . That is why any
modification can be found by the short signature.
Theorem 6.2 The attacker cant obtain the session key
from the intermediate messages if CDH problem is hard.
Proof: Suppose the challenger C wants to solve the CDH
problem. That is, given ( , ) aP bP , C should
computeabP . In our protocol, the intermediate messages
transmitted in the public channel are( ) , ,
i i i
ID z v in the
first round and ( ) , ,
i i i
ID X w in the second round. The
efficient elements are
i
z and
i
X , and other elements are
used to protect them. Supposed that attacker can obtain
the session
key
1 1
( || || || || || || || )
i n n i
K H U z z X X D = . That is,
she can obtain the keying
material
i
D .For
1 2
( 1) ( 2)
i i i i i n
D nZ n X n X X
+ +
= + + + + , she
can obtain
i
Z according to the equation
[ ]
1 2
( 1) ( 2)
i i i i n
i
D n X n X X
Z
n
+ +
+ + +
=
,
where
i
X and n had been obtained by the attacker. As it is
known to us,
i i
z r P = and
1 1 i i i i i
Z r z r r P
= = .
Define
i
z aP = and
1 i
z bP
=
=
1, 1
1
i
n
i j i
j
=
=
(3)
2 , 2
1
i
n
i j i
j
=
=
(4)
i
: A constant that is inversely proportional to the
processing (data fusion) speed of cluster head .
i
Ch
, i j
y : A constant that is inversely proportional to the
measuring speed of intra-cluster node in the network.
ij
n
1i
z : A constant that is inversely proportional to the
speed of link between the first sink and the i th cluster
head in the network
2i
z : A constant that is inversely proportional to the
speed of link between the second sink and the i th cluster
head in the network
, i j
z : A constant that is inversely proportional to the
speed of link between the cluster head in the
network.
i
Ch
ms
T : Measurement intensity constant. This is the time it
takes the intra-cluster node to measure the entire
load when
ij
n
,
1
i j
y = . The entire assigned measurement
load can be measured on the intra-cluster node in
time
ij
n
, i j ms
y T
cm
T : Communication intensity constant. This is the time
it takes to transmit the entire processing load over a link
when 1
i
z = . The entire load can be transmitted over the
th link in time i
i cm
z T
cp
T : Data fusion intensity constant. This is the time it
takes to fuse the entire load on a cluster head
when 1
i
= . The entire load can be fused on cluster
head in time
i
Ch
i cp
T .
i
: The information utility constant of cluster head .
i
Ch
The operation process of the entire application is as
follows:
1. Sink firstly divided the general task and assigned the
sub-tasks to each cluster head.
2. Each cluster head partitioned the tasks it received
then distributed to the nodes within its cluster.
3. Intra-cluster nodes performed measurement while
reported the results to the cluster head.
4. Cluster head fused the data it received from intra-
cluster nodes while sent the fused data to the sink
node.
IVOPTIMAL SCHEDULING ALGORITHM
Wireless sensor networks construct clusters several
times in its life cycle. Each cluster will have a set-up
phase and a steady-state phase. We discuss our multi-
rounds task scheduling algorithm in a steady-phase phase.
The original tasks received by sink are divided into
two stages: inter-cluster task scheduling and intra-cluster
task scheduling. First, inter-cluster task scheduling
partitions the entire tasks into each cluster, and then the
sub-tasks in a cluster is assigned to each intra-cluster
sensor node by intra-cluster task scheduling. To improve
1588 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
overlap of communication with computation, inter-cluster
task scheduling assigned sensing tasks among all clusters
in multiple rounds.
According to divisible load theory, to remove
performance degradation caused by communications
interference, sinks sends tasks to cluster head sequentially.
After each cluster finishing its tasks and fusing the data,
the cluster heads also send this rounds results to SINK
sequentially. That in every moment only allows SINK
node sends sub-tasks to a cluster head, or a cluster head
return fusion data to the sinks.
Two generic techniques for solving linear divisible
load schedule problems are linear equation solution and
linear programming. Analytical closed form solutions
have the advantage of giving insight into system
dependencies and tradeoffs. Furthermore, analytical
solutions, when they can be realized, usually require only
a trivial amount of calculation. Linear programming has
the advantage of being able to handle a wide variety of
constraints and producing numerical solutions for all
types of linear models. Alternately one can often, though
not always, set up a set of linear equations that can be
solved either numerically or, in special cases, analytically.
In this subsection A, a typical closed form solution for
task scheduling of heterogeneous wireless sensor
networks is achieved. In subsection B, a representative
task scheduling problem with bi-level programming
solution is discussed.
A. A closed form solution
A.1 Intra-cluster task scheduling
Fig.3 illustrates the timing diagram for a set of sensor
nodes, indexed from to , in one cluster. From
Fig.3, it can be observed that there is no time gap
between every two successive nodes because the divisible
workload can be transferred in the cluster. All sensor
nodes start to measure data at the same time. Once the
previous node finishes transmitting data, the other one
completes its measuring task and starts to report its data.
As a result, the proposed timing diagram minimizes the
finish time by scheduling the measuring time and
reporting time of each senor node. Moreover, since the
intra-cluster scheduling tries to avoid the transmission
conflicts at the cluster head, energy spent on
retransmission are conserved.
1
n
k
n
The working time of a sensor node can be divided
into two parts: measuring time and reporting time.
Measurement Datareporting
n
1
n
2
,1 ,1 i i ms
y T
,2 ,2 i i ms
y T
,2 ,2 i i cm
z T
,3 ,3 i i cm
z T
,1 ,1 i i cm
z T
,3 ,3 i i ms
y T
, , i ni i ni cm
z T
, , i ni i ni ms
y T
t
i
n
3
n
ni
C
i
i i cp
wT
Datafusion
Figure 3. Timing diagramfor intra-cluster task-processing
The task scheduling model considered in this paper is
shown in Figure 3. The intra-cluster nodes began
collecting data at the same time, and report the results
collected to cluster head.
In order to fully utilize the link bandwidth, and avoid
the waiting between different nodes, intra-cluster nodes
completed reporting data collected to cluster head
synchronously. Cluster head received the reported data
from intra-cluster nodes, then fused those data, and sent
the fused results to sink.
Similarly, in order to remove the performance
degradation caused by idle, and to improve efficiency,
cluster head completed reporting the fused data to the
sink node.
For Cluster head , based on the timing diagram
shown in Fig. 3, one can write the following set of
equations:
i
C
1, 1 , 1 1, , 1, , i j i j ms i j i j ms i j i j cm
y T y T z T
= + ,
2,3,... i = k (5)
A general expression for the above set of recursive
equations can be written as
1 , , 1, 1
=
i j i j i j
s
(6)
where
, , 1 , ,
/ ( )
i j i j ms i j ms i j cm
s y T y T z T
= + and
2,3,... i k =
The above recursive equation for
1 ,1 i
can be
rewritten in terms of
1i
only as
(7)
1 ,1 1 ,
2 2
/ (1 )
i
n j
i i i
j k
s
= =
= +
k
The cluster head will use the above value of
1 ,1 i
to
obtain the amount of data that has to be measured by the
rest of the 1
i
n sensors by using
1 , 1 , ,
2 2
( ) / (1
i
n j j
i j i i k i k
k j k
s
= = =
= +
2
) s (8)
The minimum measuring and reporting time of the first
sinks sub-task
1i
will then be given as
1 1 ,1 ,1 , 1
2 2
= ( ) / (1 )
i
n j
i i i ms i cm i k i i cp
j k
t y T z T s w
= =
+ + +
T
(9)
Similarly we can get the minimum measuring and
reporting time of the second sinks sub-task
2i
is :
2 2 ,1 ,1 , 2
2 2
= ( ) / (1 )
i
n j
i i i ms i cm i k i i cp
j k
t y T z T s w
= =
+ + +
T
(10)
A.2 Inter-cluster task scheduling
After cluster heads fused the clusters measured data,
cluster heads can sent the fused data to sinks concurrently
because each cluster head has a separate channel to the
sinks.
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1589
2011 ACADEMY PUBLISHER
In order to remove the performance degradation
caused by idle, and to improve efficiency, as shown in
Fig. 4, we can get
2 2 2 1 1 i i i cm i i i i cm
z T t z T = + (11)
In eq. (11) and (12), we make
,1 ,1 ,
2 2
( ) / (1 )
i
n j
i ms i cm i k i cp i
j k
y T z T s wT
= =
+ + +
s =
2i
, then take to eq. (13), we can get
i
s
1i i
r = (12)
where
2 1
/ ( )
i i i cm i i i cm
r z T s z T = +
The total tasks cluster head get is :
i
Ch
1 i i 2i
= + (13)
From Fig. 4 one can see that:
(14)
1
1 1 1 1 1 1, 1 1, 1 i i i i i cm i i i i i cm
s z T s z
+ + + + +
+ = + T
l
From eq. (14) to eq. (16), we can get
1 1 i i i i
l
+ +
= (15)
where
1
/ (1 )
i i i i i cm i
l s r z T r = + +
Now using the eq. (1), one can solve for
i
as
(16)
1
(1/ ) / (1/ )
k
i i
i
l
=
=
i
l
Hereto, we can get that the tasks cluster head
and the intra-cluster nodes within it received from the
first sink
i
Ch
1
i
and
1
, i j
. Similarly, the tasks from the
second sink
2
i
and
2
, i j
. And the total task execution
time
2 1 1 1 f i i i i i c
T t t z T
m
= + + (17)
C
1
S1
C
n
C
3
C2
S
2
2
1
t
1
1
t
1
2
t
2
2
t
3
3
t
2
n
t
3
3
t
1
n
t
1 1
1 1 1 cm
z T
1 1
n n n cm
z T
2 2
n n n cm
z T
2 2
1 1 1 cm
z T
1 1
3 3 3 cm
z T
2 2
3 3 3 cm
z T
Tf
1 1
2 2 2 cm
z T
2 2
2 2 2 cm
z T
Intra-cluster scheduling Intra-cluster scheduling
Figure 4. Timing diagramfor inter-cluster task scheduling
B. A bi-level programming method
In this subsection, a bi-level programming model is
constructed in the task scheduling problem of wireless
sensor networks.
We can regard the task scheduling problem as a
Leader-Follower problem.
The upper-level can be described as the load allocation
ratio of sinks allocated to each cluster head satisfying the
divisible load theory, which make the makespan
minimum. The lower-level can be described as the load
allocation ratio of cluster head allocated to each intra-
cluster sensor divisible load theory, which make the intra-
cluster task completion times minimum.
The problem of minimizing the total task finish time
in scheduling algorithm is described below:
2 1 1 1 i i i i i cm
t t z T T
f
+ +
k
(18)
So, for the upper-level programming, the
mathematical model is as follow:
2 1 1 1
1
( )
, 1,...,
1, 0, 1,...,
f
i
i i i i i cm f
k
i i
i
Min T
subject to Cluster head C
t t z T T i
i k
=
+ + =
= =
(19)
When the upper-level programming achieves optimal,
the most reasonable load allocation ratio
i
on each
cluster head could be derived. According to the load
allocation ratio
i
on each cluster head, the optimal load
allocation ratio
, i j
on each intra-cluster sensor can be
determined by the lower-level programming.
For the lower-level programming, the mathematical
model is as follow:
,
, , , ,
1
, ,
1
1,...,
( )
, 1,...,
, 0, 1,...,
i
i
i j
j
i k i k cm i j i j ms i i
k
n
i j i i j i
j
Min t i k
subject to Sensor S
z T y T t j n
j n
=
=
=
+ =
= =
(20)
From the above, a bi-level programming model is
constructed in the synthetic Problems of task scheduling
for wireless sensor networks. The most reasonable load
allocation ratio
i
on each cluster head could be fixed by
the upper-level programming, and the lower-level
programming established the most suitable load
allocation ratio
, i j
on each intra-cluster sensor. In the
above programming,
i
and
f
T are the target function
and the decision variable of upper level respectively, and
1590 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
, i j
and are the target function and the decision
variable of lower level
i
t
respectively.
On minimum makespan
f
T as the target function,
the most reasonable load allocation ratio
i
on each
cluster head as the decision variable, the programs to
realize the optimization hauling project of minimum
expenses and to output various forms are compiled
according to the demand.
V. WIRELESS ENERGY USE
In this section, the energy model of the OTSA-WSN
algorithm is presented in detail and the equations of
energy consumption of individual sensor nodes are
derived. The model is based on first-order radio model
[10].
There are three kinds of energy consumption in the
wireless sensor network: measurement, data fusion, and
communication. Because nodes in the sensor networks
cooperate with each other via data transmission, energy
consumption of communications exists in sensor nodes,
cluster heads and sink. It is not necessary for cluster
heads and sinks to perform any sensing task. Thus, there
is no energy cost for cluster heads due to the
measurement of these nodes, while the additional energy
cost of cluster heads attributes to data fusion. The energy
to sense, fuses, and transmits a unit sensory data are
denoted by
s
e , and , respectively. Sensor nodes
also consume the energy of to receive one unit of
data. The distance between the sender and the receiver is
.
p
e
tx
e
rx
e
d
The energy use for each kind of nodes is outlined as
follows:
Energy use for individual sensor nodes j in
clusteri :
2
, ,
( )
i j i j s tx
E e e d = + , , 1, , i k = 1, ,
i
j n =
(21)
Energy use for individual cluster head:
2
( )
i i rx p i tx
E e e e d = + + 1, ,k =
e
, i (22)
Energy use for sink:
1
k
SINK i i tx
i
E
=
=
(23)
VI. PERFORMANCE EVALUATION
In this section, we investigate the effects of different
measurement/communication speed under homogeneous
network environment on the total task finish time
(makespan) and energy consumption of every intra-
cluster nodes, and compare the 2-sinks model to the
traditional single sink structure.
In the simulation, the following energy parameters are
adopted: transmitting a unit of sensor reading over a unit
distance takes , receiving one unit of sensor
reading consumes , measuring one unit of
sensor reading needs , fusing one unit of
observation consumes and the distance
between the sender and the receiver is . There
are 30 sensor nodes in each cluster.
=200
tx
e nJ
nJ
J
J
m
=150
rx
e
=100
s
e n
=20
p
e n
=100 d
The simulation results are shown in Figure 5 to
Figure 7.
Firstly, the makespann against the number of
clusters are plotted in Fig. 4. In Fig. 4(a), the value of
measurement speed is chosen from 0.8 to 1.6, while
communication speed is fixed to 1.0. This figure shows
that measurement speed almost does not affect the
makespan because sensing takes a small fraction of the
entire execution time. Fig. 4(b) shows that when the
communication speed of nodes increases, the makespan
of a given task is reduced. It can be found that the five
lines in Fig. 4(b) converge when the number of clusters
becomes large.
(a)
(b)
Figure 5. Impact of measuring speed and bandwidth on the makespan
Next, the second simulation is about the energy
consumption of intra-cluster nodes. Sinks and cluster
heads are not taken into account because generally, sinks
has no energy constraint and the chosen cluster heads
have the possibly enough energy. The network is
configured with 20 clusters. Without loss of generality,
the intra-cluster nodes in the first cluster are chosen to
study the energy consumption, as shown in Fig.5. Fig. 5(a)
shows the higher the intra-cluster nodes measuring speed,
the more evenly the tasks allocated to each nodes, hence
the smaller the energy consumption of the nodes. Fig. 5(b)
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1591
2011 ACADEMY PUBLISHER
presents the larger communication speed between senders
and receivers, the smaller the energy consumption of the
intra-cluster nodes.
(a)
(b)
Figure 6. The impact of measuring speed and bandwidth on the energy
consumption in intra-cluster nodes
Then, Fig.6 reflects the comparison of time-
consuming and energy-consuming of two network
architecture in dealing with the same task. In the
simulation, we supposed that: y= . As can be
seen from Fig. 6(a), the task completion time is reduced
by 20% in network with 2 sinks due to better computation
and communication overlap. Fig. 6(b) shows that the
energy-consuming of sensors is more balanced, so the
networks lifetime is prolonged.
z=w=1.0
(a)
(b)
Figure 7. Comparison of time-consuming and energy-consuming of the
two network architecture
VII. CONCLUSIONS
As the nodes in wireless sensor network have limited
energy, so the tasks should be completed as soon as
possible. In this paper, we present a new task scheduling
strategy in heterogeneous clustered wireless sensor
networks with multiple sinks based divisible load theory,
to solve the problem how to complete the tasks within the
possibly shortest time. In this strategy, the tasks are
distributed to wireless sensor network based on the
processing and communication capacity of each sensors
by multiple sinks. After received the sub-tasks, the intra-
cluster sensors perform its tasks simultaneously, and send
its results to cluster head sequentially. By removing
communications interference between each sensor,
reduced makespan and improved network resource
utilization achieved. Cluster heads send fused data to
sinks sequentially after fused the data got from intra-
cluster sensors, which could overlap the task-performing
and communication phase much better. The strategy
consists of two phases: intra-cluster task scheduling and
inter-cluster task scheduling. Intra-cluster task scheduling
deals with allocating different fractions of sensing tasks
among sensor nodes in each cluster; inter-cluster task
scheduling involves the assignment of sensing tasks
among all clusters. Solutions for an optimal allocation of
fraction of task to sensors in heterogeneous wireless
sensor networks are obtained via closed-form solution
and bi-level programming solution, respectively.
ACKNOWLEDGMENT
The authors thank the editors and the anonymous
reviewers for their valuable comments that helped to
improve the paper. The work was supported by the
National Natural Science Foundation of China
(No.60972047), and the 111 project (No.B08038).
REFERENCES
1592 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
[1] V. Shah-mansouri, A. Mohsenian-rad,Lexicographically
Optimal Routing for Wireless Sensor Networks With
Multiple Sinks, IEEE Transactions on Vehicular
Technology, 2009, 58(3): 1490 1500.
[2] K. Yuen, L. Ben, B. C. Li, A Distributed Framework for
Correlated Data Gathering in Sensor Networks, IEEE
Transactions on Vehicular Technology, 2008, 57(1) :578
593.
[3] V. Bharadwaj, D. Ghose, T. G. Robertazzi, Divisible load
theory: A new paradigm for load scheduling in distributed
systems, Cluster Computing, 2003, 6(1), pp.7-18.
[4] M. Moges, T. G. Robertazzi, Wireless sensor networks:
scheduling for measurement and data reporting, IEEE
Transactions on Aerospace and Electronic Systems, 2006,
42(1), 327-340.
[5] H. Liu, X. Yuan, M. Moges, An Efficient Task
Scheduling Method for Improved Network Delay in
Distributed Sensor Networks, In Proceedings of
TridentCom, 2007, (pp.1-8). Orlando, FL, US: IEEE.
[6] H. Liu, J . Shen, X. Yuan, M. Moges, Performance
Analysis of Data Aggregation in Wireless Sensor Mesh
Networks, In Proceedings of Earth & Space 2008, (pp.1-
8), Akron, OH, USA: IEEE.
[7] C. Kijeung, T. G. Robertazzi, Divisible Load Scheduling
in Wireless Sensor Networks with Information Utility
Performance, In Proceedings of IPCCC, 2008, (pp.9-17),
Austin, Texas, USA: IEEE.
[8] Z. Zeng, A. Liu, D. Li, A Highly Efficient DAG Task
Scheduling Algorithm for Wireless Sensor Networks, In
Proceedings of ICYCS, 2008, (pp.570575). Zhang J ia J ie ,
Hunan , China: IEEE.
[9] J . Lin, W. Xiao, F. L. Lewis, et al, Energy-Efficient
Distributed Adaptive Multisensor Scheduling for Target
Tracking in Wireless Sensor Networks, IEEE
Transactions on Instrumentation and Measurement, 2009,
58(6), pp.1886 1896.
[10] W. Heinzelman, A. Chandrakasan, An application-
specifid protocol architecture for wireless microsensor
networks, IEEE Transaction on Wireless Communications,
2002, 1(4), pp. 660-670.
[11] C. Kijeung, T. G. Robertazzi, Divisible Load Scheduling
in Clustered Wireless Sensor Networks, Stony brook
university, 2009.
[12] X. Li, H. Kang and J. Cao, Coordinated Workload
Scheduling in Hierarchical Sensor Networks for Data
Fusion Applications, J ournal of Computer Science and
Technology, vol. 23, 2008, pp. 355-364.
[13] X. Li, X. Liu and H. Kang, Sensing Workload Scheduling
in Sensor Networks Using Divisible Load Theory, The
50th Annual IEEE Global Telecommunications Conference,
Washington DC, 2007, pp. 785-789.
[14] X. Tang, J . Xu, Optimizing Lifetime for Continuous Data
Aggregation With Precision Guarantees in Wireless Sensor
Networks, IEEE/ACM Transactions on Networking,
2008, 16(4), pp. 904 917.
Liang Dai was born in 1981. He was graduated from Xidian
University with Ph. D in Communication and Information
System in 2011. He is with Changan University from 2011.
His research interests include wireless sensor networks and
digital signal processing in mobile communication.
Hongke Xu received the B.Sc. degree in traffic control and
management from Chang'an University, Xian, China, in 1985,
the M.Sc. degrees in computer engineering from Xidian
University, Xian, in 1993, and the Ph.D. in traffic control and
management from Chang'an University.
He is currently a Professor with the School of Electronic and
Control Engineering, Chang'an University. His major research
interests are in the fields of traffic control and management, and
ITS.
Ting Chen was born in 1982. She was graduated from
Xidian University with Ph. D in Communication and
Information System in 2011. She is with Changan University
from 2011. Her research interests include wireless
communication networks, corss-layer design, QoS guarantee
mechanism, and etc..
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1593
2011 ACADEMY PUBLISHER
Visual Important-Driven Interactive Rendering of
3D Geometry Model over Lossy WLAN
Bailin Yang
Department of Computer and Information Engineering, Zhejiang Gongshang University, Hangzhou, China
Email: {ybl}@mail.zjgsu.edu.cn
Zhiyong Zhang and Xun Wang
Department of Computer and Information Engineering, Zhejiang Gongshang University, Hangzhou, China
Email: { zzy, wx}@mail.zjgsu.edu.cn
AbstractIn this paper, we propose a visual important-
driven interactive rendering method for 3D model over
802.11 WLAN for overcoming the shortcomings of wireless
networks narrow bandwidth, high transmission error rates
and mobile devices low power supply. This paper first
proposes an efficient simplification method based on an
improved visual important region detection technique.
Then, we develop an efficient hybrid FEC and MAC-Lite
model transmission protocol which will transmit the model
data by their importance respectively. Finally, we propose a
real-time interactive rendering method by an efficient model
coding. Experimental results demonstrate that we can
obtain better rendering result among lossy environment and
gain real-time interactive rendering result.
Index TermsVisual detection, Model Simplification, FEC,
MAC_Lite, interactive rendering
I. INTRODUCTION
With both the mature of the mobile network
infrastructure and wide use of mobile handheld devices,
3D applications based on mobile devices among wireless
network have got rapid development. However, the
shortcomings of narrow bandwidth, high transmission
error rates among wireless network and the limitations of
limited power supply and low computing performance for
mobile devices cannot meet the requirements for the real-
time interactive rendering of the 3D model on mobile
devices.
Recently, the technologies for model transmission and
interactive rendering have been received more attentions
from researchers. The typical technique is the progressive
coding and transmission method [1] for 3D model that
can transmit the model data on demand of the users
quality requirement for 3D model. Usually, the model
should be simplified into progressive model. Thus, how
to simplify this model efficient is more important. Now,
researchers are paid more attention to simplify the model
from the aspect of visual optimization.
As we knew, lossy wireless network is considerably
different from wired networks. The transmission over
lossy wireless links stays challenging due to narrow
bandwidth, fading and obstacles which result in high
transmission error rates.
To address these problems, retransmission is scheduled.
Evidently, this retransmission mechanism affects the
networks throughput and end-to-end delay badly.
Factually, most dropped packets are caused by bit-errors
in the frame during transmission in the wireless network
and a packet with this kind of errors can still be utilized
for some error-tolerant transmission such as audio, video
and graphics. In this regard, a new transport layer
protocol called UDP Lite[2] which is tailored for real-
time applications over error-prone networks has been
proposed. Unlike UDP, UDP Lite allows for partial
checksums that only covers part of a datagram, and will
therefore deliver packets that have been partially
corrupted. However, in WLAN, lots of corrupted packets
are discarded in the MAC layer without reaching UDP
layer. The CRC in the MAC layer also should be altered
to allow corrupted frames being passed to higher layers,
which is known as Mac-Lite [3]. Similar to UDP-Lite, the
coverage of Mac-Lite's checksum can be set freely.
Whats more, the key problem of interactive rendering
for 3D model on mobile device is how to decrease the
transmitted data during the transmission and lower the
computing and rendering task in mobile device.
In this paper, we propose a simplification method
based on an improved salient detection method. In the
implementation, we present some optimization
techniques to accelerate the progressive mode
reconstruction. Then, we propose an efficient hybrid FEC
and MAC-Lite model transmission protocol which will
transmit the important graphics data and less important
data by modified protocol based on FEC and sensitive
data dependent Mac-Lite respectively. Finally, a real time
transmission and interactive rendering method by an
effective model coding method is proposed. In this
method, the mobile client only executes the low-lever
rendering operation such that shortens the waiting time
before the rendering.
The rest of the paper is organized as follows. Section II
is the related work. Section III describes perceptually-
based progressive model construction method. Section IV
Corresponding author: Wang Xun
1594 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
doi:10.4304/jnw.6.11.1594-1601
designs a hybrid transmission protocol. Section V
explains the progressive transmission and interactive real
time rendering method. The experimental result is shown
in section VI. Finally, we summarize our work in section
VII.
II. RELATED WORK
A. Simplification and Visual important Computation
Researchers have presented many simplified methods
which aim to decrease the visual quality differences of
the simplified model and source model. The typical
method are QEM[4] and improved QEM[5]. However,
those methods focus only on the geometry features and
the visually important area cannot be preserved longer
while simplifying.
In order to solve this problem, the simplified methods
from the aspect of visual optimization are proposed.
Lindstrom [6] proposed the simplified method based on
the CSF model. Luebke and Hallen[7] proposed a method
that employed the visual psychology model to control the
3D model simplification procedure. Qu [8] etc. al
proposed a visual mask computing method, which would
direct the simplifying for the textured 3D model.
Unfortunately, above methods did not consider the
topology information of the mode itself.
As we knew, the idea of salient region has been
developed to help identify distinct spatial regions from
their surrounding neighborhoods. Also, the saliency
technique has been applied to 3D models. In general, the
detection of 3D salient regions can be treated as an
extension of identifying salient regions on a 2D image.
Based on the model developed for 2D image [9], Lee et al.
[10] proposed the idea of mesh saliency as a measure of
regional importance for 3D models based on the center-
surround mechanism and feature integration theory [11].
This method focuses only on the curvature of each vertex.
Gal et al. [12], however, computed a salient region based
not only on the curvature, but also the variance of
curvature, the number of curvature changes and the size
relative to the whole object. By these saliency methods,
they achieved better simplification result.
B. UDP_Lite and MAC_Lite Transmission Protocol
Recently, there are lots of literatures about UDP Lite
or Mac-Lite applications. In Ref.[13-17], UDP Lite is
deployed to transmit multimedia data. Errors in the
sensitive part of a multimedia packet should result in
dropped packets, while errors in the insensitive part are
forwarded to application layer. To allow packets
containing errors to be forwarded to the UDP layer, the
802.11 MAC level errors checking feature is completely
disabled. Regarding to WLAN, however, the MAC level
checksum cannot be completely disabled due to the high
bit error rates during transmission. Moreover, the MAC
layer plays much important role than UDP layer because
the data can be forwarded to the destination by the MAC
protocol even without UDP protocol in WLAN.
Mac-Lite is used to transmit voice in WLAN [18].
The checksum only covers headers data such as MAC
header, IP header and UDP header, but for voice data, no
checksum is applied on it. The experiments results show
that compared with the original CRC checking scheme,
better performance of networks is achieved. In [19], the
authors use different coverage of MAC layer's checksum
to transmit speech and compare their experimental results.
For video transmission, video coding technology is
adopted to divide the video data into different parts
according to their different importance and then use Mac-
Lite to transmit it [20]. In order to transmit data correctly
and fast by Mac-Lite, the forward error-correcting (FEC)
technology is used [21]. If the partial checksum detects
errors in important data such MAC header data, no
retransmission but correcting it instead.
However, there is no discussion in literatures about
adopting Mac-Lite or its modified version to transmit
mesh of 3D model in WLAN.
C. Streaming and Interactive Rendering
Different from the desktop pc device over wired
network, the main shortcoming for the mobile device is
the limited power supply and computing ability.
Therefore, the key problem of 3D model rendering on
mobile device is how to decrease the transmitted data
from the server to client and lower the computing and
rendering task in mobile device.
Luo[22] proposed the progressive transmission and
model simplification methods for mobile device.
However, these methods needs local reconstruction
operations which will take up lots of computing costs and
cause the rendering delay at the client. Actually, this
method is not good for mobile device. Thus, we can
translate the 3D model into image or video and adopt the
successful image or video coding technique to transmit
the model data. For example, reference [23] and [24]
respectively proposed the MPEG-4 coding and J PEG
2000 coding methods to transmit the 3D model.
Unfortunately, these methods are not suitable for the
application of 3D model representation in mobile e-
commerce because these method can not obtain the whole
3D model data but the static images.
III. PERCETUALLY-BASED PROGRESSIVE MODEL
CONSTRUCTION
Loosely speaking, a salient region of a model is the
area that is distinct from its surroundings. In this paper,
we propose a saliency computation method to effectively
obtain salient regions of a model. Similar to [10], the
saliency map is created by center-surround mechanism.
Usually, center-surround differences are calculated as an
across-scale difference between coarse and fine scales.
For each scale, a filter window to include neighbouring
vertices samples should be designed.
The implementation of our saliency computation
method is depicted as follows:
Step 1: Compute the mean curvature MC at each vertex
i
v ( 1... i n = , n is the number of vertices of the mesh).
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1595
2011 ACADEMY PUBLISHER
Step 2: Define the local filter window for vertex
i
v and
choose its neighboring vertex set ( )
i
NS v .
Step 3: According to ( )
i
NS v , calculate the Gaussian
weighted average ( ) GW v
i
at different scales.
Step 4: Get the difference DGW ( )
i
v
m,n
between the two
scales m and n for
i
v and then compute the geometry
feature map ( , )
l
G m n .
Step 5: Make use of the non-linear suppression operator
to combine the feature maps ( , )
l
G m n into the final
geometry saliency map GF .
In step 1, we use the method proposed in [15] to get
the MC . Then, we utilize the local filter design method to
acquire ( )
i
NS v .
Given the ( ) GW V
i
of each vertex and radiusr , its
Gaussian-weighted average is
2
2
2
2
( )exp / (2 )
( , )
exp / (2 )
r
r
i
x W
i
i
x W
MC x x V r
GW V r
x V r
(1)
Then, each feature map
( , )
l
G m n
is calculated as:
( , ) ( , ) ( , )
l i m i n
G m n GW V r GW V r =
(2)
Finally, those feature maps will be combined into one
geometry map by the nonlinear suppression operator. We
improve the method proposed in [10] by not only
acquiring the block salient region but also the details,
such as the exact boundary of the salient region. In our
case, we take the mean curvature of each vertex into
consideration while combining the above four scales into
the final salient region. We also adjust the weight and
l
to get the final geometry feature map using the
following formula.
4
1
( ) ( ) ( )
i l l
l
GF V MC G
=
= +
(3)
To preserve the visually important vertices longer, we
will adopt above salient detection method. By this
method, we can get the salient importance values ( )
i
S V
of each vertex. We have modified the QSlim algorithm
[2] by weighting the quadrics with mesh saliency.
After the creation of simplification metric, the
collapsed queue(CQ) is initialized and the collapsed
operations are executed for importing the new vertices
and edge pairs(EP). Thus, we can build the full collapsed
queue namely the vertex split list. Meanwhiletwo data
structures of the collapsed record stack and split record
stack will be introduced to meet the needs of interactive
rendering. In the following, we explain the optimization
tactics in the implementation.
1) Initialization of the CQ. According to ( )
i
w v , the
suitable EPs are chosen and the CQ is built. Usually, the
CQ adopts the heap data structure. This structure is
simple from logical. However, experimental result
demonstrates this method is slowly while lots of EPs are
appeared. In this paper, we will adopt the dynamical array
structure. Different from the heap data structure, the
sorting operation is executed after all the insertion
operations. By this method, the whole sorting time is
saved.
2) Executing the collapsed operation and building the
final CQ. Fromthe initial CQ, we can find the collapsed
edge(CE) with the smallest value of ( )
i
w v and generate
the new vertices and EP. Clearly, the new vertices and EP
will effect the sorting operation of this dynamical array.
In our implementation, the CQ will not be sorted immediately.
Factually, these new vertices and EP will effects the sorting
of dynamic array. Thus, we do not carry out the sorting for
the CQ immediately but pushes these collapsed vertices
into CQ and sorts the dynamic array again. Experimental
result shows that this method will improve the collapse
speed 30%-40% and does not affect the models
simplification result.
3) Introducing of collapsed record stack (CRS) and
split record stack (SRS). In order to achieve the
interactive rendering, the server should provide the
function of switch between different resolutions of model
quickly. However, the existed simplification method [1]
will consume a great deal of collapse and split operations
while switching from one resolution to another resolution.
Therefore, our method presents the CRS and SRS data
structure, which will record each collapsed and split
operation and push them into the CRS and SRS while
executing the simplification operation. While the model
needs the switch between different resolutions, we just
fetch these records from the CRS or SRS and execute the
corresponding rendering operation.
IV. TRANSMISSION PROTOCOL DESIGN
The basic idea of our modified protocol is to
formulate the transmission protocol according to the
different importance of the 3D model.
Progressive Mesh, as a good solution to the
transmission of 3D models over network, is represented
by a base mesh M0 followed by an ordered list of vertex
split (VSplit), which is in the form of {M0, {VSplit
1,VSplit 2, . . . ,VSplitn}}. There exists dependency
relationship among these VSplit operations. In practice,
these VSplits will be packed into packets for transmission
over networks. Hence, these packets also have
dependency relationship. Consequently, VSplits could not
be rendered unless their dependent VSplits arrived at the
client. If some of the received packets are dependent on
the lost packet, the client will endure a rendering delay
since the lost packet retransmission will be invoked. On
the contrary, if no or just a small number of the received
1596 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
packets are dependent on the lost packets, the client could
render more vertices at a given period of time and the
delay will be reduced.
Thus, in our past work [29], we have presented a novel
packetization scheme that is to decrease the dependencies
among packets. In this packetization method, two steps
will be performed. First, a Non-Redundant Directed
Acyclic Graph (NR-DAG) will be constructed to encode
all the necessary dependencies among the VSplit
operations. Second, a Global Graph Equipartition
Packing Algorithm (GGEPA) is applied to minimizing
the dependencies among different partitions while
separating the whole dependency DAG into k equal size
partitions.
Though this method can decrease the dependencies
among these packets, the dependencies are still existed. If
the dependencies between one packet with other packets
are higher, more VSplits, which are included in the
dependent packets, should wait this packet be arrived at
the client side. Thus, we here regard this packet are
rendering-importance packet. As we knew, if the VSplits
belongs to the base mesh or upper levels, the packets that
contain these VSplits are also rendering important
packets. Unfortunately, these packets maybe not have
many packets that dependent on them. To assign this kind
of packets those have many dependent packets and the
packets those have in the upper level of our model, we
will deal with them in a unified way.
In our GGEPA, we will record each packets
dependencies noted as PD. As we knew, the NR-DAG we
built is a graph. We will translate them into a tree
structure thus all nodes will have been arranged as level
by level. Manifestly, the nodes in the upper level are the
parents of the lower levels nodes. To assign each packet
with a rendering important (RI) value, we browse this
tree level by level with depth-first visiting method and
calculate each packets RI value. While finishing this
depth-first visiting, we can obtain each packets RI.
However, this method only records the RI between
neighboring levels. To obtain the RI among all the levels,
we should add all the childrens RI into their parents.
Now, we can give each node with an accurate RI value.
While the packet is packed into frames, we will
assign them with the perceptually importance value RI. It
means if the frames RI is high, it is important data.
According to the different importance of the frame, the
MAC layer uses two different ways, MAC-FEC and
MAC-Lite protocol, to transmit them respectively. The
details of both methods are as follows.
a) MAC-FEC. For visual important data, to ensure the
data transmitted correctly, the forward error-correcting
(FEC) technology is employed in the MAC layer, as
shown in Fig.1. When a frame has arrived, checksum
mechanism is used to check it. If the checksum failure,
retransmission is not used but FEC for error correction.
While using FEC, the actual data transmitted is larger
than the original base mesh data because the additional
redundancy data is added. However, the ratio of amount
of base mesh data in the entire model is much low, so
using this method to transmit the base model does not
affect the speed of the entire model transmission
obviously. By this method, it can guarantee that a base
model is transmitted correctly.
b) Mac-Lite. For less important model data, we adopt
the Mac-Lite rather than traditional MAC.
However, the key of the Mac-Lite is to set the
coverage of checksum for a frame. Usually, all headers
information should be covered because of the following
reasons:
(1) If there are bit-errors in MAC header, the frame
may be sent to other destinations because of source and
destination address information in it.
(2) If there are bit-errors in IP header, the packet will
be discarded when it is forwarded in the IP layer, because
the IP layer also has checksum mechanism which covers
the IP header.
(3) If there are bit-errors in the UDP header, the
packet may be transmitted to other applications because
the UDP header contains the source and destination port
information.
Nevertheless, the checksum just covering the headers
data is not enough. Factually, the data can be divided into
topology data and geometry data. While the topology
information is lost during transmission, visual errors to
the rendered model, such as the surface self-intersection,
will be incurred. Thus the topology data should be
transmitted as safely as the frame headers information.
Therefore, the coverage of Mac-Lite checksum is the
summary of MAC header (28bytes), IP header (20bytes)
and UDP header (8bytes) and topology data as shown in
Fig. 2.
Fig. 1. MAC frame with FEC
Fig. 2. MAC frame with VSplits
V. PROGRESSIVE TRANSMISSION AND INTERACTIVE
RENDERING
In this section, we presented a progressive
transmission and interactive rendering method on mobile
devices based on above progressive model. Different
from the typical interactive rendering method, we will
design a tactics for computing task allocated on the server
side and client side respectively.
A. Computing Task Assign and Rendering
To reduce the mobile client's computing and storage
burden, we will save and run the multi-resolution model
in the server side as client does. For the client, it would
only execute the rendering operation. First of all, we
construct the multi-resolution model with the method in
section 2, and the server will run and save this model. At
the same time, using the CRS and SRS appeared in the
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1597
2011 ACADEMY PUBLISHER
multi-resolution modeling to achieve the rapid switch of
different resolution for the model. Secondly, when the
client needs certain resolution model, the server makes
use of the CRS and SRS to obtain the collapsed edge and
split vertex, which will be formed into vertex index array
and sent to the client. Finally, the client will execute the
rendering procedure after obtaining this vertex index
array and the data stored at the client already.
In the following, we will describe the real-time
interactive rendering procedure from the aspect of the
server and mobile client.
1) Server store the multi-resolution model data and
response the clients request.
a) Responses the clients request and sends the
models geometry information to mobile client.
b) Server run and store the multi-resolution model
the client needed.
c) Response the request for the certain resolution
model from the client. Making use of the CRS and SRS
to compute the multi-resolution model and get the vertex
index array at this certain resolution.
d) Sends the vertex index array to mobile client.
e) Return to Step c and waits the request for another
resolution model of the client.
2) Mobile client make requests for the server to
obtain the certain resolution model and rendering locally
according to the returned model.
a) Client makes a rendering request to server.
b) Receives the model geometry information and
stores them locally.
c) According to the users request, sends the
rendering request for certain resolution.
d) Receiving the vertex index array from the server
side and rendering them without any reconstruction
operation locally.
e) Return to Step c
During the interactive rendering procedure, it can be
seen that our method just transmits a few vertex index
array such that the total data over the network is
decreased manifestly. More importantly, our method need
not perform the local reconstruction operation thus that
the waiting time for rendering of the model is decreased
aggressively (see the experimental result as shown in
Table 2). By this method, we can achieve interactive
rendering for 3D model at the mobile client side.
VI. EXPERIMENTAL RESULT AND DISCUSSION
A. TestBed Design
We adopt the C/S model to validate our method. The
PC server will transmit the data to mobile client through
the D-Link wireless router among the Wireless 802.11.b
network. The average network bandwidth is 0.5MB/s.
The network layout is shown in Figure 2. At the server
side, the multi-resolution model is created by our
simplified method. In the mobile, we will adopt the
rendering library M3D [25]we developed before which
conforms the OpenGL ES specification. As we knew, the
wireless network is varied so that the experimental result
for the transmission time are measured as the average of
10 times.
To verify our transmission performance, whats more,
we use the ns-2 to build the simulation testbed. In our
simulation model, the nodes have no mobility. This is
primarily because our interest in his paper is to focus on
the effectiveness of the modification of the MAC layer.
Three nodes A, B and C are used as an ad hoc network
and the topology is shown in Fig 2. In order to set the
different loss rate in the PHY layer, the Gilbert Error
model [27,28]should be added. The bit errors generated
by this model are introduced to MAC frame.
The test models in our experiment that are stored by
PLY format is Lauran, Bunny, and Horse as shown in
Fig 4. The number of total vertices and corresponding
storage space of each model, the ratio for the geometry
information and the ratio for the topology information is
shown in Table I.
Fig. 3. Topology in NS2.33
B. Transmission Protocol Performance Analysis
Before the transmission of model, we will adopt the
presented method in this paper to encode the 3D model
with base mesh and a sequence of VSplit . Each VSplit
including only the basic topology and geometry
information is 30-byte quantity. Thus, one packet whose
max size is 512-byte defined in this paper will contain
roughly 17 VSplit .
(a) Laurana
(b) Bunny
1598 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
(c) Horse
Figure 4. Test models
We will report the experimental result for Horse
model, which will be divided into 15233 VSplit . Whats
more, the size of Horse models base mesh and details
VSplit is about 0.31M Bytes and 2.76M Bytes
respectively.
In this paper, we only compare model quality received
at the client side while the packet loss rate is 5%. Fig. 5
demonstrates the close-up wireframe view of horse
model. In order to show comparative results obviously,
we have to control the location of the packet loss rate.
Then packets losses happen from the same area on the
model. In these simulations, when part of a packet is lost
and unrecovered, the packet is discarded. However, in
order to show how many packets are lost at different
packet-loss rates, we used a visualization trick by
discarding part of the received packets in case the packets
were lost. It can be seen that our transmission method can
achieve the rendering result than the traditional MAC
with retransmission mechanism.
C. Comparsions for Interactive Rendering
This subsection compares the proposed interactive
rendering method with the typical method in reference [1].
We will compare the sum of transmission time from
server to client and the rendering time on mobile device
while transmitting the base model, which occupied 20%
of the total model data, 60% and 100% of the model data.
In the typical method, the whole time includes the
transmission time, local reconstruction time and
rendering time. Fortunately, our methods whole time just
includes the transmission time and rendering time. For
the certain resolution model, the rendering time is assured
so that the rendering time is not listed in Table II.
Since our method adopts the smart coding technique
and computing task assignment, the transmitted data over
network can be decreased and the local reconstruction
time would be cut off. Thus, our method can achieve
interactive rendering result. For example, while the full
model are transmitted from the server to client and
display, the time our method consumed are 41%21%
and 37% in contrast to the typical method.
(a) MAC with retransmission
bOurs
Fig. 5. Close-up wireframe view of horse model when loss rate is 5%.
VII. CONCLUSION AND FUTURE WORK
This paper proposes a visual important interactive
rendering method for 3D model over 802.11 WLAN.
This paper first proposes an efficient simplification
method based on an improved saliency detection
technique. By the introducing of the data structure
including collapsed record stack and split record stack,
we can finish the construction of multi-resolution model.
Then, we develop an efficient hybrid FEC and MAC-Lite
model transmission protocol which will transmit the
model data by their importance respectively. Finally, we
propose a real-time interactive rendering method by an
efficient model coding. For decreasing the transmitted
model data over the wireless network, we proposed an
efficient model coding method and computing task assign
method. By this method, we can transmit the model from
the server to client quickly. Whats more, the mobile
client can save the local reconstruction operation which
would consume lots of CPU resource.
In the future work, we will adopt the geometry
compression technique which will decrease the model
data aggressively. Also, the dynamical transmission
mechanism that just transmits the part of model user can
see will also reduce the model data transmitted over the
wireless network.
TABLE I. The Ratio for Geometry and Topology Information of Each Model
3D
Models
Vertices number/total dataKB Ratio for Geometry Inforation Ration for topology information
Laurana 14499/1334 32.5% 67.5%
Bunny 20376/2963 32% 68%
Horse 16029/1382 33% 67%
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1599
2011 ACADEMY PUBLISHER
TABLE II. The Comparsions for Our Method and Typical Method of Tranmission Time and Rendering Time
Typical Method
Full model transmission time and
model reconstrucion time
Our method
Full model transmision time
the ratio comparing for the
typical method
Laurana 13889 5768 41
Bunny 18892 6891 37
ACKNOWLEDGMENT
This work is supported by in part Zhejiang natural
science key foundation (Grant No. Z1080232,
Z1101340), national natural science foundation of China
(Grant No. 60873218), and the Scientific Research Fund
of Zhejiang Provincial Education Department
(Z201018041).
REFERENCES
[1] H. Hoppe, Progressive meshes, In: SIGGRAPH 96
Proceedings, pp.99108, 1996.
[2] http://www.ietf.org/rfc/rfc3828.txt.
[3] S. A. Khayam, S. Karande, M. Krappel, and H. Radha.
Cross-layer protocol design for real-time multimedia
applications over 802.11b networks. Proc. IEEE
International Conference on Multimedia and Expo, J uly
2003, vol.2, pp. II- 425-8.
[4] M. Garland and P. Heckbert, Surface simplification
using quadric error metric, In: Proceedings of ACM
SIGGRAPH97, pp.209-215, 1997.
[5] P. Lindstrom, G. Turk, Fast and memory efficient
polygonal simplification, In: Proceedings of the
IEEEVisualization98, pp.279-284,1998.
[6] P. Lindstrom, Model simplification using image and
geometry-based metrics, PhD Thesis, Georgia Inst. of
Technology, 2000.
[7] D.P. Luebke and B. Hallen, Perceptually-driven
simplification for interactive rendering, In: Proceedings
of 12th Eurographics Workshop Rendering Techniques
(EGRW 01), pp.223-234, 2001.
[8] L.J . Qu, Gary W. Meyer, Perceptually guided polygon
reduction, IEEE Transactions on Visualization and
Computer Graphics, vol.14(5), pp. 015-1029, 2008.
[9] L. Itti, C. Koch, E. Niebur A model of saliency-based
visual attention for rapid scene analysis, In: IEEE
Transactions on Pattern Analysis and Machine
Intelligence, vol. 20(11), pp.1254-1259, 1998.
[10] C.H. Lee, A. Varshney, and D.W. J acobs, Mesh
saliency, In: Proceedings of ACM SIGGRAPH 05, pp.
659-666, 2005.
[11] A. M. Treisman, G. Gelade A feature-integration theory
of attention, Cognitive Psychology, vol. 12(1), pp. 97-
136, 1980.
[12] R. Gal, D. Cohen-OR, Salient geometric features for
partial shape matching and similarity, ACM Transaction
on Graphics, vol 25(1), pp.130-150, 2006.
[13] L. Larzon, M. Degermark, and S. Pink. UDP lite for real
time multimedia applications. in Proc. IEEE
International Conference on Communications, June 1999.
[14] A. Singh, A. Konrad, and A. D. J oseph. Performance
evaluation of UDP lite for cellular video. Proc. ACM
International Workshop on Network and Operating
Systems Support for Digital Audio and Video, J une 2001.
[15] H. Zheng and J . Boyce. An improved UDP protocol for
video transmission over Internet-to-Wireless networks.
IEEE Transactions on Multimedia, Sep. 2001, vol. 3(3),
pp. 356-365.
[16] H. Zheng. Optimizing wireless multimedia transmissions
through cross layer design. Proc. IEEE International
Conference on Multimedia and Expo, J uly 2003.
[17] S. A. Khayam, S. Karande, H. Radha, and D.
Loguinov. Performance analysis and modeling of errors
and losses over 802.11b LANs for high bit rate real-time
multimedia. Signal Processing: Image Communication,
August 2003, vol 18(7), pp. 575-595.
[18] A. Servetti and J . D. Martin. 802.11 MAC protocol with
selective error detection for speech transmission. Proc.
3rd International Workshop on QoS in Multiservice IP
Networks, Catania, Italy, February 2005, pp. 509-519.
[19] I.Chakeres, H.Dong, E.M.Belding-Royer, A.Gersho, and
J . D.Gibson. Allowing errors in speech over wireless
lans. In Proceedings of the 4th Workshop on
Applications and Services in Wireless Networks (ASWN),
Boston, MA, August 2004, 1643-1657.
[20] E.Masala, M.Bottero, and J.De Martin. MAC-level
partial checksum for H.264 video transmission over
802.11 ad hoc wireless networks. Proc. IEEE 61st
Vehicular Technology Conference, May 2005, pp. 2864 -
2868.
[21] S. A. Khayam, S. S. Karande, M. U. Ilyas and H. Radha.
Header detection to improve multimedia quality over
wireless networks. IEEE Transactions on Multimedia,
2007, vol. 9 (2), pp. 377-385.
[22] X. N. Luo, G. F. Zheng. Progressive Meshes
Transmission over a Wired-to-Wireless Network.
Wireless Networks, vol 14(1), pp. 47-53, 2008.
[23] L. Cheng, A. Bhushan, R. Pajarola, and M. E. Zarki,
Real-time 3d graphics streaming using mpeg-4, In:
Proceedings of IEEE/ACM Workshop On Broadband
Wireless Services and Application, 2004.
[24] N.-S. Lin, T.-H. Huang, and B.-Y. Chen, 3D model
streaming based on jpeg 2000, IEEE TCE, , vol. 53(1),
pp.182-190, 2007.
[25] Bailin Yang, Lu Ye, Zhigeng Pan, Guilin Xu. An
optimized soft 3D mobile graphics library based on J IT
backend compiler, In: Proceedings of the 4th
International Conference on Entertainment Computing
ICEC, Sanda, J apan, pp.67-75, 2005.
[26] G. Taubin Estimating the tensor of curvature of a surface
from a polyhedral approximation, In: Proceedings of
IEEE International Conference on Computer Vision,
pp.902-907, 1995.
[27] E. N. Gilbert. Capacity of a burst-noise channel. Bell
Systems Technical Journal, 1960, vol. 39, pp. 1253-1265.
[28] E. O. Elliot. Estimates of errors rates for codes on burst-
noise channels. Bell Systems Technical Journal, 1963,
vol. 42, pp. 1977-1997.
[29] Bailin Yang, Frederick W.B. Li, Zhigen Pan, Xun Wang.
1600 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
An Effective Error Resilient Transmission Scheme for
Progressive Mesh Transmission over Unreliable
Networks. Journal of Computer Science and Technology.
2008, 23(6): 1015-1025.
Bailin Yang received received the
Doctors degree in department of
computer science from Zhejiang Unive-
rsity in 2007. He is a associate professor
in the department of computer and
electronic engineering of Zhejiang Gong-
shang University. His researchinterests
are in moblie graphics, realtime
rendering and mbile game.
Zhiyong Zhang received the masters
degree in department of Mechanical and
Energy Engineering from Zhejiang
University, Hangzhou, P.R. China in
2001. In 2005, He received the Doctors
degree in department of computer science
from Zhejiang University. Now, He is a
associate professor in the department of
computer and electronic engineering of
Zhejiang Gongshang University. His
research interests are in information retrieval, pattern
recognition, and statistical machine learning.
Xun Wang received the Doctors
degree in department of computer
science from Zhejiang University. He
is a professor in the department of
computer and elect-ronic engineering
of Zhejiang Gongshang University. His
research interests are in multimedia
information retrieval, pattern
recognition, and mobile networks. and
statistical machine learning.
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1601
2011 ACADEMY PUBLISHER
Secure Identity-based Threshold Broadcast
Encryption in the Standard Model
Leyou Zhang
Department of Mathematical Science, Xidian University, Xian, 710071, China
Email: [email protected]
Qing Wu
School of Automation, Xian Institute of Posts and Telecommunications, Xian, China
Email: [email protected]
Yupu Hu
Key Laboratory of Computer Networks and Information
Security, Ministry of Education, Xidian University, Xi'an, 710071, China
Email: [email protected]
AbstractThe general threshold broadcast encryption is
not suitable for the networks with the constraints of
computation and energy. In this paper, two constructions of
the proper threshold broadcast encryption to these
networks are proposed. In the proposed schemes, any user
can dynamically join the system as a possible recipient, and
the sender can dynamically choose the set of recipients S
and the threshold value t. The new schemes achieve constant
size private keys and O(n-t)-size ciphertexts. In addition,
these schemes achieve full security in the standard model.
Finally, we also show that they are provable security under
n+1-Weak Decision Bilinear Diffie-Hellman Exponent (n+1-
wDBDHE) assumption and the static assumptions
respectively.
Index TermsBroadcast Encryption, Identity-based
Threshold broadcast encryption, Dual encryption technique,
Provable security, Standard model
I. INTRODUCTION
Broadcast Encryption (BE) was introduced by Fiat and
Naor in [1]. In a broadcast encryption scheme a
broadcaster encrypts a message for some subset S of users
who are listening on a broadcast channel. Any user in S
can use his private keys to decrypt the broadcasts. Any
user outside the privileged set should not be able to
recover the message. The threshold broadcast
encryption(TBE) problem is generalization of the concept
of broadcast encryption. It was first introduced by
Ghodosi et al. [2]. TBE has some advantages over
traditional threshold encryptions. It is specified as follows:
S
(1) The trusted party is eliminated and the system
can be set up by individual users independently;
(2) The broadcaster can choose the privileged set
and the threshold value at the time of encryption which
allows a certain dynamism in the system.
Identity-Based encryption is originally proposed by
Shamir[3], which a major advantage is that it allows one
to encrypt a message by using recipients identifiers such
as an email address. Now it has been an active area. The
first practical identity-based encryption (IBE) scheme
was proposed in 2001 by Boneh and Franklin [4], which
was provably secure against adaptive chosen ciphertext
attack in random oracle model. Then, many other kinds of
identity-based encryption were proposed [5-9]. Identity-
based cryptography significantly reduces the system
complexity and the cost for establishing and managing
the public key authentication framework known as PKI
(Public Key Infrastructure). As a result, we focus on the
construction of identity-based threshold broadcast
encryption (IBTHBE) in this paper. To the best of our
knowledge, very few works have dealt with this problem.
In [10], Chai and Cao et al propose a scheme based on
identity. But the length of the ciphertexts is n +1 and the
security relies on the random oracles. Vanesa Daza et al
propose another scheme [11]. However, its security is
still relying on the random oracles. The recent work [12]
has short ciphertexts, but the security of their scheme
based on the identity (IBTBE) is also relying on the
random oracles. In [13], authors also proposed an
efficient scheme in the standard model. But this scheme
only achieves a weak security -selective-identity security.
As a natural extension of the efforts to improve
schemes in the standard model, we propose two new
efficient identity-based threshold broadcast encryption
schemes in this paper. The proposed schemes are
constructed in the standard model. In our schemes, the
broadcaster can choose the privileged set and the
threshold value at the time of encryption. In addition,
under the full security model, the security of the first
scheme is reduced to the n+1-Weak Decision Bilinear
Diffie-Hellman Exponent (n+1-wDBDHE) assumption
and the security of the second scheme is reduced to the
static assumptions.
Manuscript received J anuary 1, 2011; revised J une 1, 2011; accepted
J uly 1, 2011.
Corresponding author: Leyou Zhang, Email: leyouzhang77@
yahoo.com.cn
II. PRELIMINARIES
1602 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
doi:10.4304/jnw.6.11.1602-1609
In this section, some definitions are given as follows:
A. Bilinear Groups
We briefly review bilinear maps and use the
following notations: LetG and be two (multiplicative)
cyclic groups of prime order p. A bilinear map is a
map : with the properties:
1
G
e
1
G G G
1. Bilinearity: for all , , u v G e , we have
.
,
p
a b Z e
( , ) ( , )
a b ab
e u v e u v =
2. Non-degeneracy: . ( , ) 1 e g g =
3. Computability: There is an efficient algorithm to
compute for all . ( , ) e u v , u v G e
B. Decisional bilinear Diffie-Hellman Exponent
assumption (BDHE)
The decisional bilinear Diffie-Hellman Exponent (BDHE)
problem is defined as follows. Algorithm B is given as
input a random tuple
(g , h
0
, y
1
,
,
n
y , y
n+2
,
, y
2n+2
, T),
where y
i
=
i
g
o
. Algorithm B's goal is to output 1 when
T =e(g,h
0
and 0 otherwise. Let TU =(g , h
0
, y
1
,
, y
n
, y
n+2
,
, y
2n+2
). Algorithm B that outputs
b has advantage
1
)
n
o
+
e
{0,1} c in solving decision BDHE in
G if
|Pr[B(TU, e(g,h
0
) =0]-Pr[B(TU, T) =0]|
1
)
n
o
+
s c .
Definition 1 The (t, c ) decisional BDHE
assumption holds if no t-time algorithm has a non-
negligible advantage c in solving the above game.
C Identity-based Threshold Broadcast Encryption
(IBTBE)
More formally, a threshold broadcast encryption scheme
consists of five algorithms.
Setup The randomized Setup algorithm takes as
input a security parameter k and outputs some public
parameters params, which will be common to all the
users of the system.
Extract The key generation algorithm is run by each
user ID
i
. It takes as input some public parameters params
and returns a correspondence private key
i
ID
d .
Threshold Encryption The encryption algorithm
takes as input a set of public keys corresponding to a set
P of n receivers, a threshold t satisfying 1 t s n, and a
message M. The output is a ciphertext C, which contains
the description of P and t.
s
Partial Decryption Partial Decryption algorithm
takes as input a ciphertext C for the pair (P, t) and a
secret key
i
ID
d of a receiver. The output is a partial
decryption value or a special symbol .
i
k
Decryption The deterministic final decryption
algorithm takes as input a ciphertext C for the pair (P, t)
and t partial decryptions corresponding k
i
to receivers in
some subset S P. The output is a message m or a
special symbol .
c
D Security Model
Concerning the security of the identity-based
cryptography, there are mainly two definitions:
Full security, which means that the attacker can
choose adaptively the identity he wants to attack (after
having seen the parameters);
Selective-ID security, which means that the
attacker must choose the identity he wants to attack at the
beginning, before seeing the parameters. The Selective-
ID security is thus weaker than full security.
To define the notion of chosen ciphertext secure
identity-based broadcast threshold decryption scheme
(IND-fullID-CCA) in the full security model, let us
consider the following game between an adversary A and
a challenger:
Setup The challenger runs Setup. Then challenger
gives the resulting common parameter to A, and keeps
master key secret. A issues the threshold parameters (n, t).
Phase 1 A issues private key extraction and
decryption queries adaptively. The adversary A
adaptively issues queries q
1
,, q
s0
, where q
i
is one of
the following:
On a private key extraction query upon ID
i
, the
challenger runs Extract to generate the private key
associated to ID
i
, then sends it to A.
, q, where q
i
is one of the following:
Extraction query (ID
i
), as in phase 1;
Decryption query, as in phase 1, but with the
constraint that Hdr = Hdr
*
. The challenger responds as in
phase 1.
Guess Finally, the adversary A outputs a guess
{0,1} b' e and wins the game if b = . b'
We say that if the above indistinguishability game
allow no decryption oracle query, then the IBTBE
scheme is only chosen plaintext(IND-fullID-CPA) secure.
There have been many methods to convert an IND-
fullID-CPA scheme to an IND-fullID-CCA scheme.
Therefore, we only focus on constructing the IND-fullID-
CPA scheme in this paper.
III NEW CONSTRUCTIONS (I)
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1603
2011 ACADEMY PUBLISHER
A. Our Construction
Let S =
1
{ , ,
n
} ID ID
, }
il
v
be n users, where
is an n-bit string and is an
1
{ ,
i i
ID v =
ij
v
n
l
-bit
string. Our construction works as follows:
Setup To generate the system parameters, the PKG
picks randomly generators in and an element
2
, g g G
o from
p
Z . Note that any user
i
ID will be associated to
a different element . This can be done by defining =
i
t
i
t
f (
i
ID ) for some n-1 degree polynomial function ( ) f x ,
where (0) f o = . PKG sets for 1 and
i
t
g
i
T = i s s n
1
g g
o
= . Then it chooses randomly h-length vector
3 31
( ,
3
,
n
) g g
, ,
=
=
g
i
U
1 i
and vectors ( ) in G , where
(U ). The public parameters PK are
1
, ,
n
U U
il
U
PK =( )
1 2 1 3 1
, , , , , , , , ,
n n
g g g T T g U U
and o is master key.
Extract(ID
i
) To generate a private key for a user
i
ID
p
Z e , the PKG first defines ( )
j
F x =
3
1
i
l
x
j ji
i
g u
=
[
for and 1 j n s s
1
( , , )
l
x x = x . Then it picks
random , and outputs the private key:
i
Z e
p
i
r
r
=
1 2
( , )
i
ID i i
d d d =
2
1
( ( ) , )
i i
n
t r
j
i
g F g
=
[
,
where = ( )
i i
F F ID =
3
1
ij
l
v
i i
j
j
g u
=
[
.
Threshold Encryption To encrypt a message M for
a set S = of n players, with threshold
for the decryption, the idea is to set up an ( )-
threshold secret sharing scheme, where . The
n public keys ( ) of users implicitly define a
degree polynomial. The idea is to compute the
values of this polynomial in the points (This will
lead to obtain the value of ).Then a sender acts as
follows:
1
{ , ,
n
ID ID
1
, ,
n
T T
}
t n s
1 n
, n
n t
N
2 N =
0 x =
1
g
Select a random element and compute
*
p
s Z e
1
s
C g = , ,
2 1 2
( , )
s
C e g g M =
3
1
( )
n
s
j
i
C F
=
=
[
.
Choose a set S of n t dummy players, such
that S S | = . For each user
i
ID S
'
e , compute
ij
i
i i
ID S
T T
e
'=
[
and
2
1
( , )
i s
i
e T g '
K = , where
ij
denotes
the Lagrange coefficients.
The ciphertexts are
1 2 3
( , , ,{ } )
i
i ID S
C C C K
'e
.
Note:
2 2
1 1
( , ) ( , )
i
i t s s
i
K
e T g e g g
'
= =
'
by using Lagrange
interpolation where . ( )
i i
t f ID ' ' =
Partial Decryption Given the ciphertexts
1 2 3
( , , ,{ } )
i
i ID S
C C C K
'e
, the receiver
i
ID S e with his
corresponding private computes as follows:
i
ID
d
3 1
0 1
( , )
( , )
i
i
i
e C d
K
e d C
= =
2
1
( , )
i
t s
e g g
.
Decryption Given the valid ciphertexts
1 2 3
( , , ,{ } )
i
i ID S
C C C K
'e
, a subset with
1
S c S
1
| | S t =
and corresponding t partial decryption
j
K , the
algorithm computes with the whole set
1
S S S = ' as
follows:
0 i
i
i
ID S
K K
' e
=
[
=
1 2
1
( , )
s
e g g
,
2
M K C = .
Efficiency In our scheme, the size of ciphertexts is
O( n t ) and the size of private key is constant as it
consists of two group elements. This is the first efficient
construction which has full security in the standard model
for the identity-based threshold broadcast encryption. In
addition, if the values and can be
precomputed and cached, so no pairing computations are
needed at the phase of Threshold Encryption. Table 1
gives the efficiency comparison between ours and the
others IBTBEs.
1 2
( , ) e g g
2
( , )
i
e T g
Note: R.O. denotes the random oracles. C-Size is the
size of ciphertext and pk is the private keys. SM denotes
the security model. Full and s-ID are full security and
selective-identity model.
B Security Analysis
Theorem 1 Suppose the n+1-wDBDHE assumption
holds. Then the proposed scheme above is semantically
secure against selective identity, chosen plaintext
attacks(IND-fullID-CPA).
Proof Suppose an adversary A has advantage c in
attacking our scheme. Using A, we build an algorithm B
that solves the decision n+1-wDBDHE problem in G with
the advantagec . For a generator g G e and
p
Z o e ,
set
i
i
Y g
o
=
( , g g
. Algorithm B is given as input a random
tuple where . Algorithm B's
goal is to output 1 when e g and 0 otherwise.
Algorithm B works by interacting with A in a threshold
full security game as follows:
0 1
, , T , , )
n
T T
0
g
0
g
G e
1
)
n
o
+
( ,
Initial A outputs a set S of identities that he wants
to corrupt, where| |
1 S n s
.
Setup B sets 2max{2 ,2 }
n
l
m q = = where is
the maximum query time for private query.
4q q
First, B selects random
integers
1 n
1 2
, , ,
n 1
o o o
e
p
Z . Let ( ) f x be the degree
1 n polynomial implicitly defined to satisfy (0) f o =
and ( )
i i
f ID o = for
i
ID S e
, note that B does not know
f since it does not know o . For , B computes
i
e ID S
1604 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
i
i
T g
o
= . Otherwise, B computes
i
o = ( )
i
f ID =
1
1
n
0 j j
j
o +
=
o with the Lagrange coefficients
j
.
Note that these Lagrange coefficients are easily
calculated since they do not depend on f . Then B
sets
0
1
j
j
ID S
i j
T T g
e
[
= .
Next, B chooses random l-length vectors
, ,
m
Z e
1 n
X X , where X =
i 1
{ , , }
i il
x x for 1 i n s s
m
.
Furthermore, B chooses random u from
1
, ,
n
u Z and
from
1 n
z , , z
p
Z , l -bit vectors from , ,
1
Y
n
Y
p
Z
where ={ ,
i
Y
1
,
i i
}
l
y y for 1 . Then it selects
for
i n s s
j
k 1 j n s s randomly from {0,1, ,
l
} , where
(2 1)
n
l
l
l = + . Some functions are defined as follows:
1
( )
l
,
j j j j
i
F ID p mk u v x
=
= +
ji ji
1
( )
l
j j j ji
i
J ID z v y
=
= +
ji
,
1
0 0 mod
( )
1
l
j ji ji
i
j j
if u v x m
K ID
otherwise
=
+ =
, 1 j l s s .
Then B constructs a set of public parameters for
the scheme by making the following assignments. B takes
as input a tuple TU = where
are random generators of G and
0 1
( , , , , , )
n
g g T T T
i
Y
0
, g g
i
g
o
= for some
random
p
Z o e . Then B chooses a random b
p
Z e and
assigns:
1 1
g Y g
o
= = , ,
2
n
b
g g g
o
=
3 1
j j j
u z p mk
j j n
g Y g
+
+
=
,1 j n i l s s s s
,
. It provides A the
1
,
ji ji
x y
j
g
+
= 1
ji n
u Y
PK =(
1 2 1 3 1
, , , , , , , , ,
n n
g g g T T g U U ).
Furthermore, this assignment means that the master secret
will be g g
2
(
n
b
g )
o o
=
1
o
which is unknown to B. Using
the definition of the public parameters, it shows that
( ) ( )
3 1
ji j
l
v F
j j ji
i
u Y
+
=
= =
[
j j
ID J ID
n j
F g g
j
0
.
Phase 1 A issues private key extraction and
decryption queries adaptively. The adversary A
adaptively issues queries
1
, ,
s
q , where is one of
the followings: Suppose the adversary A issues a query
for an identity
q
1
{ ,
j
q
}
j j
v v =
j
ID
0
l
. B checks
whether . It aborts if there is no such j. ( )
j j
K ID =
Otherwise, it answers the query as follows:(by using a
similar procedure as in case of the scheme [6]) B first
computes the Lagrange coefficients
0 1 1
, , ,
n
such
that t =
j
( )
j
f ID =
1
1
n
0 j j
j
o o
=
+
. Let
= { }
j
D S I =
S
1
{ , }
n
ID ID ' ' where
j j
ID ID ' = . B
selects a random
p
r Z ' e and generates the private key:
,
0 1 2
1
( , ) ( ( ) ,
j
j
n
t
r r
ID j j i
i
d d d g F g
=
= =
[
)
where
0
( )
j
j j
F ID
r r
o
' = . One can obtain that
j
ID
d is a
properly simulated private key for the identity
j
ID . In
fact,
0 j
d =
2
1
( )
j
n
t
r
i
i
g F
=
[
=
0
( )
n
n
j j
j b r
1
(
n
i
)
i
g g F
=
[
o
o =
= , (1)
1 1
1 1 0 0
1 1
1,
(
n n
j j j j
j j
n
b
b r r
n n j i
i i j
Y Y g Y F F
o o
= =
+
= =
[
)
Where
r
j
F =
0
( ) ( ) ( ) ( ) ( )
1 1
( ) ( )
j
j j j j j j j j j j
r
F ID J ID F ID J ID F ID
r
n j n j
Y g Y g
o
'
+ +
=
=
0
( ) ( ) ( ) ( ) ( )
1 1
( ) ( )
j
j j j j j j j j j j
F ID J ID F ID J ID F ID
r
n j n j
Y g Y g
o
'
+ +
=
0
0
( )
( ) ( ) ( )
1 1
( )
j j
j j j j j j
J ID
F ID F ID J ID r
n j n j
Y Y , Y g
'
+ +
r
i
F =
0
( ) ( ) ( ) ( ) ( )
1 1
( ) ( )
j
j j i i i i i i i i
r
F ID F ID J ID F ID J ID r
n i n i
Y g Y g
o
'
' ' ' '
+ +
=
=
0
( ) ( ) ( ) ( ) ( )
1 1
( ) ( )
j
j j i i i i i i i i
F ID F ID J ID F ID J ID r
n i n i
Y g Y g
o
o o
= =
+ +
'
+
= =
=
[
TABLE I. THE COMPARISON OF THE EFFICIENCY WITH THE OTHERS IDTHBE
Schemes Assumption C-Size pk Size Parings WithoutR.O. S.M
[10] DBDH n +1 1 1 +2 t NO Full
[11] DBDH n t 3 +2 1 t t NO Full
[12] DMBDH n 2 (0+ )+1 t NO Full
[13] n+1-DBDHE O( n t ) 2 2 YES s-ID
Ours n+1-wDBDHE O( n t ) 2 0+ t YES Full
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1605
2011 ACADEMY PUBLISHER
=
0
1 1
1 1 0 0 0
( )
( ) ( ) ( )
1 1 1 1
( )
j j
n n
j j j j
j j j j j j j j
J ID
b
F ID F ID J ID b r
n n n j n j
Y Y g Y Y Y Y g
o o
= =
'
+ + +
0 0
( ) ( )
( ) ( ) ( ) ( )
1 1
1,
(( ) )( )
i i i i
j j j j i i i i
F ID J ID
n
F ID F ID F ID J ID r
n j i j n i
i i j
Y Y Y Y g
' '
' ' '
+ +
= =
[
=
0
1 1
1 1 0
( )
( ) ( ) ( )
1 1
( )
j j
n n
j j j j
j j j j j j j j
J ID
b
F ID F ID J ID b r
n j n j
Y g Y Y Y g
o o
= =
'
+
0 0
( ) ( )
( ) ( ) ( ) ( )
1 1
1,
(( ) )( )
i i i i
j j j j i i i i
F ID J ID
n
F ID F ID F ID J ID r
n j i j n i
i i j
Y Y Y Y g
' '
' ' '
+ +
= =
[
.
Since cancels out, all the terms in this expression are
known to B. Thus, B can compute the first private key
component . B computes
1 n
Y
+
0 j
d
0
( )
j j
F ID r
j
Y g
'
=
r
g . Then the
second private key component is obtained.
Finally,
0 1
( , )
j
ID j j
d d d = is given to A.
Challenge A outputs two same-length
messages
0
,
1
M M
* *
S I =
, a threshold value t and a set of
identities on which it wishes to
be challenged. Note that . B first checks
whether there exists a
*
1 2
( , , ,
n
D ID ID
*
|S S
{1, , }
*
)
1 | t s
j n e such that
* *
( )
j j
F ID 0 = ,
then B will abort. Otherwise, B picks a random {0,1} e
and constructs the challenge ciphertexts as follows: =
*
C
* * *
1 2
(C C C
3
, , ,{ }
i
K )
= .
*
1
0
( )
0 1 0 0
( , ( , ) , ,{ } )
n
i i
i
i
J ID
b
i ID S
g M e g g T g K
=
e
where S
0
is a set of n - t dummy users. In addition, K
i
is
computed in the following manner: B first chooses a set
of dummy users such that
0
S n t
*
0
S S | = . For
each dummy user
0 i
ID S e , B computes the Lagrange
coefficients
ji
with 1 j n s s such that
* *
i
( )
j j j
ID S
i i
t f ID o
e
' '
i
= =
i
i
g T
o'
= . Then B computes
* *
ji
i
j
ID S
T
i
T
e
' =
[
.
Finally, B computes
1
2 0
1
( ,
n
ji i i
i
e g g
K
o'
=
=
)
.
Let
0
g g
. In fact, =
*
1
C g
, =
*
2
C
1 0
( ,
b
) Me g g
T
= =
1
1 0 0
( , ) ( , )
n
b
M e g g e g g
o
+ 1
1
( ( , ) ( , ))
n
b
M e g g e g g
o
+
=
1
( ( , ))
b
n
M e g g Y
1 2
( , ) M e g g
= ,
*
3
C =
*
*
1
( )
( )
0 0
1
n
i i
i i
n J ID
J ID
i
g g
=
=
=
[
i
*
( )
1
( )
i i
n
J ID
i
g
=
=
[
=
* *
( ) ( )
1
1
( )
i i i i
n
F ID J ID
n i
i
Y g
+
=
[
1
( )
n
i
i
F
=
=
[
.
and
1
0 2
1
( , )
n
ji i
i
i
K
e g g
o
=
'
=
=
1
2
1
( ,
n
ji i
i
e g g
o
)
=
'
=
2
2
1
1 1
( , )
( , )
ji
n
i
i
i
e T g
e T g
=
=
'
[
2
1
( , )
i
e T g
=
'
.
IfT is a random element of , gives no information
about B's choice of
1
G
*
C
.
Phase 2 The adversary continues to issue queries
and B responds as in phase 1.
Guess A outputs a guess {0,1} ' e and wins the
game if ' = . If ' = , B will output 1 to indicate that
B solves the n+1-wDBDHE problem, otherwise it outputs
0 to mean that it learns nothing from . When A
outputs 1, it means
*
C
1
2
|Pr( ) | ' c = > . Otherwise
1
2
Pr( ) ' = = . Therefore, we have
1
0
|Pr( ( , ( , ) ) 0) Pr( ( , ) 0)|
1 1
| | .
2 2
n
B TU e g g B TU T
o
c c
+
= =
> =
IV NEW CONSTRUCTIONS (II)
The first construction achieves full security in the
standard model. But the size of public keys is too large
and the computation cost of the private key is expensive.
In addition, the hardness assumption in our scheme is
strong.
As a natural extension to improve the first scheme,
we propose anther scheme in this section. It is based on
the dual encryption technique[14-17]. In addition, the
security of the proposed scheme is reduced to three static
(i.e. non q-based) assumptions.
A. Dual encryption technique
Recently, a new technique is applied to IBE. It is
called Dual Encryption Technique. In a dual
system[14,15], ciphertexts and keys can take on two
forms: normal or semi-functional. Semi-functional
ciphertexts and keys are not used in the real system, they
are only used in the security proof. A normal key can
decrypt normal or semi-functional ciphertexts, and a
normal ciphertext can be decrypted by normal or semi-
functional keys. However, when a semi-functional key is
used to decrypt a semi-functional ciphertext, decryption
will fail. More specifically, the semi-functional
components of the key and ciphertext will interact to
mask the blinding factor by an additional random term.
Security for dual systems is proved using a sequence of
games which are shown to be indistinguishable. The first
game is the real security game (with normal ciphertext
and keys). In the next game, the ciphertext is semi-
functional, while all the keys are normal. For an attacker
that makes q key requests, games 1 through q follow. In
game k, the first k keys are semi-functional while the
remaining keys are normal. In game q, all the keys and
the challenge ciphertext given to the attacker are semi-
functional. Hence none of the given keys are useful for
1606 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
decrypting the challenge ciphertext. At this point, proving
security becomes relatively easy. Waters[14] first
proposed a broadcast encryption scheme based on this
new technique. However, the proposed scheme is not
based on identity and also inefficient since its cost of
decryption is dependent on depth of users set.
B. Composite Order Bilinear Groups
Composite order bilinear groups were used in [14-16].
In this paper, the outputs is (N=p
1
p
2
p
3
, G , ,e), where
p
1
, p
2
, p
3
are distinct primes, G and are cyclic groups
of order N. A bilinear map e is a map e G
with the following properties:
1
G
1
G
1
G G :
(i) Bilinearity: for all , u v eG , a, be
N
Z , we have
= ; ( , )
a b
e u v ( , )
ab
e u v
(ii) Non-degeneracy: - g G e such that has
order N in .
( , ) e g g
1
G
(iii) Computability: there is an efficient algorithm to
compute for all . ( , ) e u v , u v eG
C. Static Hardness Assumption
In this section, we give our complex assumption.
These assumptions have been used in [14,15].
Assumption 1(Subgroup decision problem for 3
primes) Given (N=p
1
p
2
p
3
, , ,e), select randomly G
1
G
1
p
g G e ,
3
3 p
X G e
1
G
, , and set D=(N=
p
1
p
2
p
3
, , ,e, g,
1 2
p p 1
T G e
3
1
p
G
2
T e
G X ). It is hard to distinguish
fromT . The advantage of an algorithm is defined as
1
T
2
1 1 2
|Pr[ ( , ) 1] Pr[ ( , ) 1]| Adv A D T A D T = = = .
Definition 2 Assumption 1 holds if is
negligible.
1
Adv
Assumption 2 Given (N=p
1
p
2
p
3
, , ,e), pick
randomly
G
1
G
1
1
,
p
g X G e ,
2
, X
2
2 p
Y G e ,
3
p 3 3
, X Y G e , set
D=(N=p
1
p
2
p
3
, , ,e, g, ). Then select
, at random. It is hard to distinguish
fromT . The advantage of an algorithm is defined as
.
G
1
G
1 3
2 p p
T G e
2
2 1
Pr[ ( , ) 1 A D T = =
1 2 3
, , X X X
( , ) A D T
2 3
Y Y
2
1]|
1
T G e
1
T
| Adv ] Pr[ =
Definition 3 Assumption 2 holds if is
negligible.
2
Adv
Assumption 3 Given (N=p
1
p
2
p
3
, , ,e), pick
randomly
G
1
G
1
p
g G e ,
2
, X
2
p
G
2 2
, Y Z e ,
3
p 3
X G e ,
,
N
s Z o e
2 3
, set D=(N=p
1
p
2
p
3
, , , e, g, G
1
G
2
, , ,
s
2
g X X
o
g Y Z ). Then compute
1
( , )
s
T e g g
o
= and
pick randomly . It is hard to distinguish
from . The advantage of an algorithm is defined as
2
e
1
G T
1
T
2
T
3 1 2
|Pr[ ( , ) 1] Pr[ ( , ) 1]| Adv A D T A D T = = = .
Definition 4 Assumption 3 holds if is
negligible.
3
Adv
C. Construction
We give an initial construction at first. It works as
follows:
Let S = { } be n players where
1
, ,
n
ID ID
i
ID e
p
Z . These users want to form an ad hoc network.
Our construction works as follows:
Setup: To generate the system parameters, the PKG
picks randomly generators { , } in G
and an element
2
, g g
1
, , ,
n
h h h
o from
p
Z . Note that any user
i
ID will
be associated to a different element . This can be done
by defining =
i
t
i
t f (
i
ID ) for some n-1 degree polynomial
function ( ) f x , where (0) f o = . PKG sets for
i
t
g
i
T =
1 i n s s and
1
g g
o
= . The public parameters PK are
PK =(
1 2 1 1
, , , , , , , , ,
n n
g g g T T h h h )
and o is master key.
Extract(ID
i
) : To generate a private key for a user
i
ID
p
Z e , the PKG picks randomly and also
chooses random elements
i
r Z e
0 0 i i
R R
p
,
1
, , ,
i
R '
( 1 i i ) ( i i
R R
+1)
, , ,
in
R
3
p
G e . Then it computes private
keys as follows:
0 1
', , ,
, 1 ,
,
1
, , ) ( ,
i
ID i
d d =
i
d d
i i i
d d
+ i in
d
2 0 0 1 1 1 ( 1) 1 ( 1)
( ( ) , , , , , , , ).
i i i i i i i
ID r r r r r r
i i i i i i i i i i n in
g hu R g R u R u R u R u R
o
+ +
' =
Threshold Encryption: To encrypt a message M for
a set S =
1
{ , , }
n
ID ID
t n
of n players, with threshold
s for the decryption, the idea is to set up an ( )-
threshold secret sharing scheme, where . The
n public keys ( ) of users implicitly define a
, n N
2 N n t =
1
, ,
n
T T
1 n degree polynomial. The idea is to compute the
values of this polynomial in the points (This will
lead to obtain the value of ).Then a sender acts as
follows:
0 x =
1
g
Select a random element and compute
*
p
s Z e
1
s
C g = , and
2 1 2
( , )
s
C e g g M =
3
1
i
n
ID
( )
s
i
i
C
=
=
[
h h .
Choose a set S of dummy players, such
that
n t
S S | = . For each user
i
ID S
'
e , compute
ij
i
i i
ID S
T T
e
' =
[
and
2
1
( , )
i s
i
e T g '
K = , where
ij
denotes
the Lagrange coefficients.
The ciphertexts are
1 2 3
( , , ,{ } )
i
i ID S
C C C K
'e
.
Note:
2 2
1 1
( , ) ( , )
i
i t s s
i
K
e T g e g g
'
= =
'
by using Lagrange
interpolation where ( )
i i
t f ID ' ' = .
Partial Decryption: Given the ciphertexts
1 2 3
( , , ,{ } )
i
i ID S
C C C K
'e
, the receiver
i
ID S e with his
corresponding private computes as follows:
i
ID
d
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1607
2011 ACADEMY PUBLISHER
3 0
2
0 1
1,
( , ) 1
( , )
( , )
i
j
i
i n t s ID
i ij
j j i
e C d
K
e g g
e d d C
= =
'
= =
[
.
Decryption: Given the valid ciphertexts
1 2 3
( , , ,{ } )
i
i ID S
C C C K
'e
, a subset with
1
S S c
1
| | S t = and
corresponding partial decryption t
j
K , the algorithm
computes with the whole set
1
S S S ' = as
follows:
0 i
i
ID
K K
i
S
'
=
e
[
=
1 2
)
1
( ,
s
e g g
and
2
M K C = .
Correctness:
In fact, if the ciphertexts is valid, then
one can obtain the following equation holds.
0 1 2
( , , ) C C C C =
1
0 2
1,
( , ')
( ,
j
n
ID
i ij
j j i
e C d
e d d C
= =
[
)
=
0
1
2 0
1 1,
(( ) , )
( ( ) ( ), )
i i
j i i i
n
ID r s
i i
i
n n
ID t ID r s
i i ij
i j j i
e h u g R
e g h u R R g
=
= = =
'
[
[ [
0
1 1
2 0
1 1,
(( ) , ) (( ) , )
( , ) (( ) , ) ( , ) (( ), )
i i i
j i i i
s n
ID r ID s s
i i i
i i
n n ID t ID r s s s
i i ij
i j
e h u g e h u R
e g g e h u g e R g e R g
= =
= =
'
=
[ [
[ [
n
j i =
=
2
1
( , )
i
t s
e g g
.
Note: In the previous equation, the orthogonality property
of is used. It is described simply as follows.
1 2
, ,
p p p
G G G
3
p
Lemma[14] When h G for ,
i j
i p j
h G e e i j = ,
is the identity element in . ( , )
i j
e h h
1
G
By using this lemma, one can obtain
0 0
1 1,
(( ) , ) ( , ) (( ), ) 1.
j i
n n
ID ID s s
i i i ij
i j j
e h u R e R g e R g
= =
' = =
[ [
s
i =
=
Efficiency analysis:
Our construction achieves O(1)-size ciphertexts. The
private key of construction private key is linear in the
maximal size of S. In addition, e g and can
be precomputed, so there is no pair computations at the
phase of Encryption. Furthermore, the security of the
proposed scheme is reduced to the static assumptions.
These assumptions are more natural than those in the
existing schemes. However, the size of private keys relies
on the number of set S. Based on the proposed scheme,
we can give the main construction (II).
1 2
( , ) g
2
( , )
i
e T g
Setup, Encryption and Decryption are similar to the
first scheme.
Extract(ID
i
) : To generate a private key for a
user
i
ID
p
Z e
in
R R e
i
, the PKG picks randomly r and
also chooses random elements
. Then it computes private keys as
follows:
i
Z e
1
, ,
i i
R
Partial Decryption: Given the ciphertexts
1 2 3
( , , ,{ } )
i
i ID S
C C C K
'e
, the receiver
i
ID S e with his
corresponding private d computes as follows:
i
ID
3 1
0 1 2
( , ) 1
( , ) ( , )
i
i
i t s
i
e C d
K
e d C e g g
= = .
Correctness can be easily obtained. I
Table 2 give the comparisons of efficiency with our
two schemes.
p
0 0
, ,
i i i
R R R '
( 1)
,
( 1)
, ,
i i +
3
p
G
( ,
0 1
)
ID i
d d
i
d =
2
(
i i
n
t I
j j i
0
)
r
i 0
i
1,
( ( ) ,
i
D r
i j
g hu
= =
R g R' =
[
.
TABLE II THE COMPARISON OF THE EFFICIENCY BETWEEN OUR TWO
SCHEMES
Schemes
ASSUMPTION
C-Size
pk
Size
D. Security analysis
In this section, we will prove the security of the proposed
scheme. We first define semi-functional keys and semi-
functional ciphertexts. Let denote a generator of .
2
g
2
p
G
Semi-functional keys: At first, a normal key
0 1 1 1
( , ', , , , , , )
i i n
d d d d d d
+
is obtained using the
Extract algorithm. Then some random elements
0 0
, ,
j
' for 1, , j n = and are chosen in j i =
N
Z . The
semi-functional keys are set as follows.
0
0 0 2
d d g
= ,
2
d d g
' ' = ,
2
j
j j
d d g
= , , 1, , j = n j i = .
Semi-functional ciphertexts: At first, a normal semi-
functional ciphertext ( ,
0 1 2
, ) C C C ' ' ' is obtained using the
Encrypt algorithm. Then two random elements
1 2
, are
chosen in
N
Z . The semi-functional ciphertexts are set as
follows. C C
0 0
' = , C C
1 2
1 1 2
g
'
= ,
2
2 2 2
C C g
' = .
We organize our proof as a sequence of games. The
first game defined will be the real identity-based
encryption game and the last one will be one in which the
adversary has no advantage unconditionally. We will
show that each game is indistinguishable from the next
(under three complexity assumptions). We first define the
games as:
Parings
Construction
(I)
n+1-wDBDHE O( ) n t t 2 0+
Construction
(II)
Static
assumptions
O(1) 2 2
Game
real
: This is a real IBTBE security game.
For 0 i q s s , the Game
i
is defined as follows.
Game
i
: Let O denote the set of private keys which
the adversary queries during the games. This game is a
real IBTBE security game with the two exceptions: (1)
The challenge ciphertext will be a semi-functional
ciphertext on the challenge set . (2) The first i keys will
be semi-functional private keys. The rest of keys in
*
S
O
will be normal.
Note: In game
0
, the challenge ciphertext is semi-
functional. In game
q
, the challenge ciphertexts and all
keys are semi-functional.
Game
final
: This game is same with Game
q
except that
the challenge ciphertext is a semi-functional encryption
1608 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
[13] L.Y. Zhang, Y.P. Hu and Qing Wu. "Identity-based
threshold broadcast encryption in the standard model".
KSII Trans. on internet and information systems .Vol. 4,
NO. 3, J une 2010pp.400-410.
of random group element of G .
1
It can be easily shown that these games are
indistinguishable in a set of Lemmas. For the pages
limited, we omit them here(It can be obtained from [16-
18]). Then we have the following theorem.
[14] B. Waters. "Dual system encryption: realizing fully secure
ibe and hibe under simple assumptions". Proceeding of
Advances in Cryptology-Crypto, LNCS 5677, Berlin:
Springer-Verlag press, 2009, pp. 619-636.(The full paper
appeared Cryptology ePrint Archive Report 2009/385 )
Theorem 2 If Assumption 1,2 and 3 hold, then our
IBTBE is IND-ID-CPA secure.
[15] A. Lewko and B. Waters. "New Techniques for Dual
System Encryptio n and Fully Secure HIBE with Short
Ciphertexts". Proceeding of the 7th Theory of
Cryptography Conference 2010, LNCS 5978, Berlin:
Springer-Verlag press, 2010, pp. 455479.
ACKNOWLEDGEMENT
This work is supported in part by the Nature Science
Foundation of China under grant (60970119, 60803149),
the National Basic Research Program of China(973)
under grant 2007CB311201 and the Fundamental
Research Funds for the Central Universities(Public key
broadcast encryption based on new mathematical
hardness assumptions).
[16] L.Y. Zhang, Y.P. Hu and Q. Wu. "Fully Secure Identity-
based Broadcast Encryption in the Subgroups". China
Communications, 2011, Vol. 8, No. 2, 152-158.
[17] L.Y. Zhang, Y.P. Hu and Q. Wu. "Adaptively Secure
Identity-based Broadcast Encryption with constant size
private keys and ciphertexts from the Subgroups".
Mathematical and Computer Modelling(In press). Online
press, 2011, http:// dx.doi.ort/10.1016/j.mcm.2011.01.004.
REFERENCES
[1] A. Fiat, M. Naor. "Broadcast encryption". In: Proceedings
of CRYPTO, Berlin: Springer-Verlag, LNCS 773, 1994,
pp. 480-491.
[18] L.Y. Zhang, Q. Wu and Y.P. Hu. Adaptively Secure
Identity-based Encryption in the Anonymous
Communications. ICIC Express Letters, Vol. 5. No. 9(A),
2011, pp. 3209-3216. [2] H. Ghodosi, J . Pieprzyk and R. Safavi-Naini. "Dynamic
threshold cryptosystems: a new scheme in group oriented
cryptography". In: Proceedings of Pragocrypt 96, CTU
Publishing House, 1996 , pp. 370-379.
Leyou Zhang: male. He received his M.E.
and Ph.D. degrees in Applied Mathematics
form Xidian University, Xian, China in
2002 and 2009, respectively. Now he is an
Associate Professor in the department of
Mathematical science of Xidian University.
His current research interests include
network security, computer security, and
cryptography. He has published more than thirty papers in
international and domestic journals a
[3] A. Shamir, "Identity-based Cryptosystems and Signature
Schemes", In: Proceedings of CRYPTO, Berlin: Springer-
Verlag, LNCS 196, 1984, pp. 4753.
[4] D. Boneh and M. Franklin. "Identity-based encryption
from the well pairing". In: Proceedings of CRYPTO,
Berlin: Springer-Verlag , LNCS 2193, 2001, pp. 213-229.
[5] D. Boneh and X. Boyen. "Efficient selective-id secure
identity based encryption without random oracles". In:
Proceedings of EuroCryp, Berlin: Springer-Verlag, LNCS
3027, 2004 , pp. 223-238,.
nd conferences.
Qing Wu: female. She received her Ph.D.
from the Xidian University in 2009. Now she
is an Associate Professor in the school of
automation of Xian institute of posts and
telecommunication. Her current research
interests include information security and
applied mathematics. She has published more
than twenty papers in international and
domestic journals and conferences.
[6] C. Cocks. "An identity based encryption scheme based on
quadratic residues". In: Proceedings of Cryptography and
coding, Berlin: Springer-Verlag, LNCS 2260, 2001, pp.
360-363.
[7] D. Boneh and J . Katz. "Improved Efficiency for CCA-
Secure Cryptosystems Built Using Identity-Based
Encryption". In: Proceedings of CT-RSA, Berlin: Springer-
Verlag, LNCS 3376, 2005, pp. 87-103.
[8] R. Canetti, S. Halevi, and J . Katz. "Chosen-ciphertext
security from identity-based encryption". In: Proceedings
of EuroCrypt, Berlin: Springer-Verlag, LNCS 3027, 2004,
pp. 207-222.
Yupu Hu: male. He received his Ph.D.
from the Xidian University in 1999. Now
he is a Professor in the School of
Telecommunications Engineering of
Xidian University. His current research
interests include information security and
cryptography. He has published more
than a hundred papers in international
and domestic journals and conferences. He is a Member
of China Institute of Communications and a Director of
Chinese Association for Cryptologic Research.
[9] S. Chattterjee and P. Sarkar. "Generalization of the
Selectve-ID Security Model for HIBE Protocols". In:
Proceedings of PKC, Berlin: Springer-Verlag, LNCS 3958,
2006, pp. 241-256.
[10] Z. Chai, Z. Cao and Y. Zhou. "Efficient ID-based
Broadcast Threshold Decryption in Ad Hoc Network". In:
Proceedings of IMSCCS 06, IEEE Computer Society,
Volume 2, 2006, pp. 148-154.
[11] V. Daza, J . Herranz and P. Morillo etal. "CCA2-Secure
Threshold Broadcast Encryption with Shorter Ciphertexts".
In: Proceedings of ProvSec 2007, Berlin: Springer-Verlag,
LNCS 4784, 2007, pp. 35-50.
[12] C. Delerablee and D. Pointcheval. "Dynamic Threshold
Public-Key Encryption". In: Proceedings of CRYPTO,
Berlin: Springer-Verlag, LNCS 5157, 2008, pp. 317-334.
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1609
2011 ACADEMY PUBLISHER
A Power Allocation Algorithm Based on
Cooperative Game Theory in Multi-cell OFDM
Systems
Ping Wang
1, 2
1 Broadband Wireless communications and Multimedia laboratory, Tongji University, Shanghai, China.
2 Shanghai Key Laboratory of Digital Media Processing and Transmission, Shanghai, China
Email: [email protected]
J ing Han
1
, Fuqiang Liu
1
, Yang Liu
1
, J ing Xu
3
1 Broadband Wireless communications and Multimedia laboratory, Tongji University, Shanghai, China.
3 Shanghai Research Center for Wireless Communications, Shanghai, China
Email: {han_han0307, liufuqiang, yeyunxuana}@163.com
AbstractA centralized resource allocation algorithm in
multi-cell OFDM systems is studied, which aims at
improving the performance of wireless communication
systems and enhancing users spectral efficiency on the edge
of the cell. The proposed resource allocation algorithm can
be divided into two steps. The first step is sub-carrier
allocation based on matrix searching in single cell and the
second one is joint power allocation based on cooperative
game theory in multi-cell. By comparing with traditional
resource allocation algorithms in multi-cell scenario, we find
that the proposed algorithm has lower computational
complexity and good fairness performance.
Index TermsOFDM, resource allocation, game theory,
multi-cell, cooperation
I. INTRODUCTION
In multi-cell systems, it is a great challenge to use the
limited radio resources efficiently. Resource allocation is
an important means to improve spectrum efficiency in
interference limited wireless networks. In distributed
systems, a user usually has no knowledge of other users,
so a non-cooperative game model is built. In such model,
SIR (signal-to-interference ratio) is used to measure
system utility and create a utility function. Each
unauthorized user allocates resource independently only
to maximize its own utility to reach Nash equilibrium.
However, for the whole system, system utility is probably
not the best.
Therefore, when non-cooperative game theory is
applied in resource allocation, there is always a conflict
between individual benefit and system benefit [1].
Though some methods, such as the use of the price
function, have been proposed to solve this problem, they
are difficult in practice. D. Goodman firstly applied non-
cooperative game theory to power allocation in CDMA
systems [10-12]. In [13-14], the authors studied multi-cell
power control in different aspects. The algorithms first
chose the optimal cell and then implemented power
control among users in single cell. In [15-16], the authors
studied static non-cooperative game. The algorithm in [15]
implemented more severe punishment to users with better
channel condition, so that it effectively kept good fairness
among different users. [17] used dynamic game model. It
proposed a distributed power control algorithm based on
potential game model. In [18], the power allocation
among cells was carried out by non-cooperative game,
but it did not give the solving process. All these work are
mainly based on non-cooperative game theory, which
may not maximize the whole system utility.
In centralized wireless networks, since resource
allocation and scheduling are performed by a central base
station, a cooperative game theory model can be built for
resource allocation. In such model, users can cooperate
and consult with each other and the system utility is
theoretically optimal [6]. Hence, this paper focuses on
centralized resource allocation in multi-cell systems. The
best resource allocation scheme can only be obtained by
jointly allocating subcarriers and power among cells. But
the computational complexity is too high to realize. Most
practical resource allocation algorithms generally consist
of two steps. The first step is to allocate sub-carriers in
single cell. The second one is to jointly allocate power in
multi-cell.
This paper proposes an algorithm for multi-cell
resource allocation in broadband multi-carrier system,
which includes:
(1) Sub-carrier allocation
A new sub-carrier allocation algorithm based on matrix
searching in single cell is proposed. Firstly, the initial
power allocation is finished based on channel
environment and rate ratio constraint of different users.
Then sub-carriers are allocated after taking both the
maximal sum-rate capacity and users fairness into
account, which guarantees the benefit of users located on
the cell edge and makes users with poor channel
condition obtain sub-carriers as well. Whats more, the
complexity is reduced compared with the algorithm
1610 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
doi:10.4304/jnw.6.11.1610-1617
which allocates sub-carriers first and then exchanges sub-
carriers [3].
(2) Power allocation
The main problem in multi-cell systems is co-channel
interference among adjacent cells. After using statistical
channel state information, this algorithm introduces an
idea of cooperative game theory. Aiming at maximize the
net utility of system (i.e., QoS-satisfied function based on
sum-rate capacity), the proposed algorithm models
resource allocation process like a cooperative game
among users in different cells, and the Nash bargaining
solution (namely the assignment result of sub-carriers and
power) can be obtained through a power allocation
algorithm whose complexity is controllable. The
simulation shows that this algorithm can approximate the
maximal sum-rate capacity of a multi-cell system while
meet users QoS fairness as well.
The reminder of this paper is organized as follows.
Section 2 gives the system model in single cell and multi-
cell. And then the proposed resource allocation algorithm
is presented in detail in section 3. At last, the
effectiveness and rationality of the proposed algorithm
are verified by comparing with other traditional
algorithms in section 4. Finally, a conclusion is made in
section 5.
II. SYSTEM MODEL
Figure 1. Resource allocation model in multi-cell
The system model is shown in Fig. 1. Assume that the
total band of the system is B and the number of sub-
carriers is C. The multi-access mode is orthogonal
frequency division multiple access (OFDMA). The
degree of fast fading in adjacent sub-carriers is similar so
that a group of S consecutive sub-carriers with similar
fading characteristics can be seen as a sub-channel.
Therefore, the number of sub-channels (denoted by N) is
C/S and the labels of them are denoted from 1 to N.
Considering I adjacent cells with co-frequency
interference, the number of active users in each cell is K.
Assume that the CSI (channel state information) detected
by a mobile station can be fed back to the base station
through control channel without error. The base stations
among adjacent cells are connected by optical fiber and
control information is real time transmission.
A. Single Cell System Model
For a single cell, the downlink resource allocation
model is described as follows:
{ }
, ,
2
, , ,
2
,
1 1 0
1 ,
1 1
2 ,
3 ,
4 ,
1
5 1 2 1 2
max log 1
/
:
: 0,
: 0,1
: 1,
: : : : :
k n k n
K N
k n k n k n
p
k n
K N
k n total
k n
k n
k n
K
k n
k
K K
p h
U
N N B N
C p p
C p
C
C
C R R R r r r
= =
= =
=
= +
=
=
=
,
(1)
where
2
, k n
h
is channel gain of user k on sub-channel n,
and
, k n
p is the power assigned to user k on sub-channel
n. Each sub-channel can be considered as an additive
white Gaussian noise (AWGN) channel, and
0
N
is the
power spectral density of such channel.
total
P
presents the
total transmission power .
, k n
can only be 0 or 1. If it is
equal to 1, it means that sub-channel n is assigned to user
k. Otherwise, it is 0. Define the signal-to-noise ratio
(SNR) of user k on sub-channel n as
2
, , 0
/ ( / )
k n k n
S h N B N =
and the corresponding
receiving SNR as
, , k n k n
p S .
1
C restricts that the sum of
transmission power of all users does not exceed the
maximum transmission power of base station.
2
C
restricts that the power assigned to a sub-carrier is not
negative.
3
C
restricts that a sub-channel only stays in
two states, assigned or unassigned.
4
C
restricts that a
sub-channel can only be assigned to one user.
5
C specifies that the rates the users obtained must meet
the requirements of ratio constraints, in which
1 2
: :
K
R R R
are the rates obtained by users and
1 2
: :
K
r r r are the requirements of ratio constraints
which should be satisfied.
In this model we assume that users experience
independent multipath Rayleigh fading. A base station
can obtain the entire CSI. Sub-channel is a basic unit
during allocation. Generally, users are located in different
places of a cell, so the transmission loss and shadow
fading are different. Therefore, the channel gain can be
further expressed as follows:
, , , , k n k n k n k n
h l s g =
(2)
where
, k n
l ,
, k n
s and
, k n
g
represent transmission loss,
shadow fading and multipath fading of user k on sub-
channel n, respectively. The mean gains of them are
assumed to be 0dB. If the time scale of resource
allocation is a transmission time interval (TTI) and the
unit is millisecond, shadow fading and transmission
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1611
2011 ACADEMY PUBLISHER
fading will only depend on users location. Therefore, the
mean SNR can be expressed as:
2
, , 0
/ ( / )
k k n k n
S l s N B N =
(3)
Define the sum-rate capacity of user k as follows:
2
, ,
0
1
log(1 )
/
k
k n k n
k
n
p h
R
N N B N
= +
(4)
where
k
is the set of sub-channels which user k uses.
B. Multicell System Model
Given I adjacent cells with co-frequency interference,
the number of active users in each cell isk .
, , i k m
p and
, , i k m
h represent the transmission power and channel gain
of userk
on sub-channel m in base stationi , respectively.
, ,
i
j k m
h is the channel gain of this user on sub-channel m
in co-frequency cell j .
The SIR of this user can be expressed as:
, , , ,
, ,
2
, , , ,
1
i k m i k m
i k m K
i
j k m j k m
j i k
h p
h p
=
=
+
(5)
In multi-cell OFDMA systems, sub-channel allocation
is assumed to be finished in each cell.
, i m
k
represents the
user who is assigned sub-channel m in cell i . Then the
set of users who need power allocation on co-frequency
sub-channel m are
1, ,
{ ... }
m I m
M k k = . Since a sub-channel
in one cell can only be assigned to one user during one
TTI, the number of users in co-frequency channel equals
the number of co-frequency cells. Each co-frequency
channel is independent. Therefore, the power allocation
in I adjacent cells is equivalent to the power allocation
among I users on co-frequency channels, and the
maximizing of system throughput is equivalent to the
maximizing of capacity sum on each co-frequency
channel in each cell. This can be achieved through
cooperation in multi-cell.
. RESOURCE ALLOCATION SCHEME
A. Sub-carrier Allocation
Users channel gain is determined by transmission loss,
shadow fading and multipath fading. Besides,
transmission loss, shadow fading, together with users
rate ratio are only relative to users. Hence, we can assume
that the sub-channels assigned to each user have the same
initial power. In a view of average, users sum-rate
capacity also needs to satisfy the requirement of rate ratio
constraints. Therefore, how to optimize the assignment of
sub-carriers is presented as follows.
,
1
1
1
2 ,
3
1
4
5 1 2 1 2
1
max log(1 )
:
: 0,
:
:0
: : : : :
k k
K
k k k
x p
k
K
k k total
k
k n
K
k
k
k
K K
U x p S
N
C x p P
C p
C x N
C x N
C R R R r r r
=
=
=
= +
=
=
=
(6)
where
k
x is a positive integer, representing the number of
sub-channels assigned to user k.
k
p represents the initial
transmission power of user k on each sub-channel.
K
R
represents the sum-rate capacity of user k on average.
To solve (6), Lagrange multiplier method is used. The
cost function L is written as follows:
1
1 1
1
1 1 1
2
1
log(1 )
( ) ( )
1
( log(1 ) log(1 ))
K
k k k
k
K K
k k total k
k k
K
k k k k
k k
L x p S
N
x p P x N
r
x p S x p S
N r
=
= =
=
= +
+ +
+ + +
(7)
where
2
{ }
K
k k
=
are Lagrange multiplier. Find
the partial derivative of L in relation to
k
x and
k
p . The
following equation is derived after setting both
derivatives to 0:
1
1 1
1 1
1 1
1 1
ln(1 )
1 1
ln(1 )
1 1
k k
k k
k k
k
k k
p S S
p S
p S p S
S p S
p S
p S p S
+
+ +
= +
+ +
(8)
(8) is correct for 2,3,..., k K = . Combining (8) with (6),
the optimal initial power allocation on average can be
achieved through Newton-Raphson method [2].
After the initial power is determined, the sub-carrier can
be assigned in single cell. The proposed sub-carrier
allocation is based on matrix-searching and has three
steps. First, the matrix of sum-rate capacity of K users on
N sub-channels is figured out. Thus the question becomes
how to find the corresponding user for each sub-channel
in a KN matrix. Second, sort all the users according to
their rate requirements and assign sub-carriers for the first
time, ensuring that each user is assigned a sub-channel at
least. Define the fairness function as:
k
k
k
R
r
= (9)
where
k
R represents the obtained sum-rate capacity.
k
r
represents the required ratio of rate capacity. Finally,
finish allocating sub-carriers based on the fairness
1612 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
function . The algorithm is described in detail as shown
in Fig. 2:
Figure 2. The matrix-searching based sub-carrier allocation algorithm
in single cell
B. Power Allocation Strategy Based on Game Theory
Power allocation on co-frequency sub-channel is
realized via a cooperative game process. To make a user
not simply pursue the utility maximization in power
allocation, the interference to other users should also be
considered. Thus the pricing function of transmission
power obtained by each user is introduced into the
cooperative game theory, which represents the cost that
the user has to pay for using system resource. The system
utility can reach the optimal state when each user arrives
at a tradeoff between the obtained utility and the
produced interference.
In a multi-cell system, let { , , , , } G P A S I U =
present a
gaming process, whose parameters are described below:
(1) P represents participants, who are a set of users
experiencing co-frequency interference on the same sub-
channel of each cell.
(2) A represents strategy set, which include
1
{ ,..., }
I
A p p = , where
,1,1 , ,
[ ,..., ]
i i i k n
p p p = .
(3) S represents the gaming order, whose default value
is conducting strategy choosing at the same time.
(4) I represents information. Every participant in game
knows the strategy choices of all participants.
(5) U represents the income, that is, utility function. Let
vector P represent the set of transmission power obtained
by all users after the game, and ( , )
c
k k k
u p p
represent
the net utility obtained by user k in the end, which can be
expressed as follows:
( , ) ( , ) ( , )
c
k k k k k k k k k
u p p u p p c p p
= (10)
where
k
p and
k
p
is the utility function of user k ignoring
pricing factor. ( , )
k k k
c p p
= =
=
(11)
In this paper, the sum-rate capacity is used as the utility
function, and on any co-frequency sub-channel, the utility
function of user k is:
, , , ,
2
, , , ,
( , )
log(1 )
k k k
i k m i k m
i
j k m j k m
j i
u p p
p h
h p
= +
+
(12)
where
, , k i k m
p p = and the pricing function increases
linearly with the transmission power as follows:
( , )
k k k k k
c p p p
= (13)
where
k
represents the pricing factor, which gives the
price of power per unit. If the priorities of users are
identical, so is the pricing factor. The function of net
utility is derived below:
, , , ,
2
, , , ,
( , )
log(1 )
c
k k k
i k m i k m
k k i
j k m j k m
j i
u p p
p h
p
h p
= +
+
(14)
After sub-carriers are finished allocation in single cell,
multi-cell power allocation based on cooperative game
theory are conducted, which is shown in Fig. 3. The
gaming goal in a multi-cell system is to obtain Nash
bargaining solution, which maximizes the net utility of
system. To guarantee the fairness among users, the
system strategy set is updated until the following
conditions are met: the number of cells with power gain
is larger than that with power loss and the whole system
must obtain power gain. If the net utility does not
increase after the strategy set is changed, Nash bargaining
solution is obtained.
(1) Initialize 0
k
R = ,
k
= ,
,
0
k n
= ,
k =1,2,K , A={1,2,N }
(2) Figure out the matrix of rate capacity of k users on
n sub-carriers, denoted by
, 2 ,
1
log (1 )
k n k n k
R p S
N
= +
(3) Sort users according to their required rate in
descending order and repeat K times to find n in row
vector of matrix R, which should satisfy
, , k n k j
R R and j A . Then set
,
1
k n
= ,
{ }
k k
n = , { } A A n = ,
, k k k n
R R R = +
(4) WhenA
,
find k which satisfies
k i
and
1 i K
.
If k is obtained, find n which satisfies
, , k n k j
R R
and j A . Then set
,
1
k n
= ,
{ }
k k
n = , { } A A n = ,
, k k k n
R R R = +
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1613
2011 ACADEMY PUBLISHER
Figure 3. The multi-cell power allocation algorithmbased on
cooperative game theory
. SIMULATION ANALYSIS
A. Simulation environment
The power allocation algorithm in multi-cell OFDM
systems is simulated by MATLAB. Frequency selective
channels contain six independent Rayleigh multipath and
the maximum delay spread is 5us. Other system
parameters are shown in Table I. This simulation uses
discrete event-driven mechanism for dynamic simulation.
In order to obtain stable and reliable performance, the
results are obtained from the average of 10000
implementations on random channel.
TABLE I.
SIMULATION PARAMETERS
Parameters Value
Number of co-frequency cells 2
Number of sub-carriers 1024
Total transmission power 1W
Total systembandwidth 1M
Number of users 2-10
AWGN power spectral density -80dBw/Hz
Average channel gain 0-30dB
The proposed algorithm is compared with other three
traditional resource allocation algorithms, namely:
Algorithm 1: represents the proposed algorithm,
which includes sub-carrier allocation based on matrix
searching and power allocation based on cooperative
game theory.
Algorithm 2: consists of direct sub-carrier allocation
[7-9] and equal power allocation.
Algorithm 3: consists of direct sub-carrier allocation
[7-9] and Water-filling power allocation [4][5].
Algorithm 4: consists of the proposed sub-carrier
allocation in the paper and equal power allocation.
B. Results and discussion
Fig. 4 and Fig. 5 compare the four resources allocation
algorithms from the perspective of system capacity,
which plot the normalized system throughput. The users
rate ratios in Fig. 4 and Fig. 5 are 421:1 and
equal, respectively. As shown in Fig. 4 and 5,
algorithm 3 has the highest throughput and the
proposed algorithm has better throughput than
algorithm 2 and 4. This is because in algorithm 3, sub-
carrier allocation and power allocation are based on
throughput, which can achieve the highest system
throughput. Although sub-carrier allocation based on
matrix searching in algorithm 1 will lose part of
system throughput for its paying attention to fairness,
the result of total resource allocation in algorithm 1 is
still better than those in algorithm 2 and 4. This is
because algorithm 1 allocates power based on
cooperative game theory, which is to maximize the net
utility of system and thus can approximate the maximal
sum-rate capacity in multi-cell systems. Furthermore, its
superiority is enhanced as the number of users is
increased and the system capacity is more approximate
to that of algorithm 3.
Figure 4. Systemthroughput when the requirements of rates are
unequal
(1) Allocate sub-carriers in every cell. The result is
taken as the initial value of multi-cell power
allocation.
(2) For each co-frequency sub-channel, calculate the
net utility function of each user in single cell
without or with the cooperation, which are denoted
by
( , )
k k k
u p p
and
( , )
c
k k k
u p p
, respectively.
Let
( , ) ( , )
c
k k k k k k k
u u p p u p p
=
.
(3) Starting from the user with the smallest
k
u ,
change the choice of strategy set and re-calculate the
net utility function. If the net utility increases and
the number of cells with power gain is larger than
that with power loss, update the strategy set.
Otherwise, return to step (2).
(4) Loop all the co-frequency users in sequence till
the net utility of system does not increase.
(5) Loop all the sub-channels in sequence till the
system strategy set does not change. To this point,
the strategy set is the result of Nash bargaining
solution.
1614 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
Figure 5. Systemthroughput when the requirements of rates are equal
Next the proposed algorithm is compared with
algorithm 3 and 4 from the perspective of system
fairness. Figs. 6, 7 and 8 plot the sum-rate capacity of
all users in a cell with users rate ratio of 4:2:1: ...: 1
when the number of users in each cell is 3, 5 and 8,
respectively. We can see that the proposed algorithm is
the best in terms of fairness and it can most approximate
the requirement of users' rate ratio. Algorithm 4 is better
than algorithm 3 in terms of fairness because it adopts the
proposed subcarrier allocation algorithm in this paper.
However, algorithm 4 has smaller sum-rate capacity than
algorithm 1 and its fairness is also slightly weaker. This is
because equal power allocation does not differentiate
channel gain for users in different locations. Though
algorithm 3 maximizes the sum-rate capacity, it does not
nearly meet any requirements of users rate ratio.
Algorithm 2 is not taken into consideration in comparison
because of its worst fairness. Therefore, the proposed
algorithm improves system fairness greatly, which can
improve the spectral efficiency on the cell edge and
guarantee the rate demands of all users.
Figure 6. Systemfairness when the number of users is 3
Figure 7. Systemfairness when the number of users is 5
Figure 8. Systemfairness when the number of users is 8
Figure 9. Relative bit latency when the number of users is 3, 5, 8
From Fig. 9, we can see that when the number of
users increases, the proposed algorithm has the slowest
growth in relative bit latency. This is because our
algorithm considers the fairness among users and gives
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1615
2011 ACADEMY PUBLISHER
each user a relatively fair chance to obtain the time-
frequency resource. On the contrary, we can find that
algorithm 3 and algorithm 4 only pursue high sum
rates, and give all the resources to the users with good
channel condition. Thus, they fail to meet the
requirements of other users.
From the perspective of computational complexity,
the proposed algorithm requires numerical iteration.
During initial power allocation before subcarrier
allocation, there is a numerical iteration. During multi-
cell power allocation based on cooperative game theory,
there is also a numerical iteration. Thus it has higher
algorithm complexity compared with equal power
allocation.
As sub-carrier allocation is considered, compared with
the algorithm which allocates sub-carriers first and then
exchanges sub-carriers, the proposed matrix-searching
algorithm reduces complexity actually. However, the
initial power allocation requires numerical iteration,
which is more complex than sub-carrier allocation itself.
Thus compared with the sub-carrier allocation algorithm
with equal initial power, the proposed algorithm is a little
more complex. But its performance is improved greatly
since users in different location of a cell have different
channel gain.
As power allocation is considered, although equal
power allocation is the simplest, it is not used in reality
for its inability to meet the requirements of users rate.
Compared with algorithm 3, the proposed algorithm
indicates that sub-carrier allocation in single cell has
approximated the system optimal solution to some extent.
And in multi-cell cooperation, the strategy set of game
theory needs to be changed only when the system utility
is increased and the number of cells with power gain is
bigger than that with power loss. Thus, the complexity
caused by the proposed resource allocation is lower than
that by Water-filling power allocation in algorithm 3.
In general, the initial power allocation in sub-carrier
allocation and the multi-cell power allocation based on
cooperative game theory both have the linear complexity
( ) O k
(where k is the number of users). Although this
algorithms complexity is higher when compared with
that of ideal resource allocation algorithms such as equal
power allocation, it is actually decreased when compared
with some currently applied algorithms such as Water-
filling. Furthermore, the proposed algorithm is only
slightly worse than algorithm 3 in terms of system
capacity. And it is the best in terms of system fairness.
. CONCLUSION
This paper proposes a resource allocation algorithm
based on game theory in a centralized multi-cell OFDM
system, including the matrix-searching based subcarrier
allocation algorithm in single cell and the joint power
allocation algorithm in multi-cell based on cooperative
game theory.
The proposed algorithm is compared with other three
traditional resource allocation algorithms from the
perspective of system capacity, fairness and complexity.
The results show that the proposed algorithm achieves a
good tradeoff between system throughput and fairness.
And its complexity is reduced compared with the multi-
cell water-filling algorithm which achieves highest
throughput. Furthermore, it can nearly satisfy the
requirements of users rate ratio and the users on the cell
edge can get a significant spectral efficiency gain.
ACKNOWLEDGMENT
This work was supported by the National Science and
Technology Major Project of China under Grant
2010ZX03002-007, Sino-Finland International
Cooperation Project under Grant 2010DFB10410,
Shanghai Science and Technology Committee under
Grant 09511501100, the Opening Project of Shanghai
Key Laboratory of Digital Media Processing and
Transmission and National Natural Science Foundation
of China under Grant 61073153.
REFERENCES
[1] Han Tao. Spectrum Allocation Technology Based on
Game Theory in Cognitive Radio Networks, Doctor thesis:
Beijing University of Post and Telecommunications, 2009,
pp. 48-67.
[2] Wong C. Y., Tsui C. Y., Cheng R. S., et al. A real-time
subcarrier allocation scheme for multiple access downlink
OFDM transmission, Proceedings of IEEE VTC.
Amsterdam, Netherlands, 1999: 11241128.
[3] Kim Keunyoung, Kim Hoon and Han Youngnam, et a1.
Iterative and greedy resource allocation in an uplink
OFDMA system, Proceedings of IEEE PIMRC.
Barcelona, Spain, 2004: 23772381.
[4] Kim Keunyoung, Kim Hoon and Han Youngnam.
Subcarrier and power allocation in OFDMA systems,
Proceedings of IEEE VTC 2004. Los Angeles, California,
USA, 2004: 10581062.
[5] Choe Kwang Don, Lim Yeon J u and Park Sang Kyu.
Subcarrier allocation with low complexity in multiuser
OFDM systems, Proceedings of IEEE MILCOM 2008.
Monterey, CA, United States, 2004: 822826.
[6] Zhang Guopeng. Research on Resource Allocation and
Cooperative Mechanism in Wireless Networks Based on
Game Theory, Doctor thesis: Xidian University, 2009, pp.
62-84.
[7] Xu Wenjun. Resource Allocation Strategies Study in
Broadband Wireless Communication System, Doctor
thesis: Beijing University of Post and Telecommunications,
2008, pp. 54-90.
[8] Shen Zukang, Andrews J . G., Evens B. L.. Adaptive
resource allocation in multiuser OFDM systems with
proportional rate constraints, IEEE Transactions on
Wireless Communications, 2005, 4(6): 2726~2737.
[9] Hu Yahui. Researches on Radio Resource Management in
MIMO OFDM Systems, Doctor thesis: Beijing University
of Post and Telecommunications, 2009, pp. 45-78.
[10] Goodman D., Mandayam N. Power control for wireless
data, IEEE Wireless Communications, 2000, 7(2): 48-54.
[11] Saraydar C U, Mandayam N B, Goodman D J . Efficient
power control via pricing in wireless data networks, IEEE
Transactions on Communications, 2002, 50(2):291-303.
[12] Saraydar C U, Mandayam N B, Goodman D J. Pricing
and power control in a multicell wireless data network,
1616 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
IEEE Journal on Selected Areas in Communications, 2001,
19(10): 1883-1892.
[13] Alpcan T, Basar T, Dey S. A power control game based
on outage probabilities for multicell wireless data
networks, IEEE Transactions on Wireless
Communications, 2006, 5(4):890-899.
[14] Sarma G, Paganini F. Game theoretic approach to power
control in cellular CDMA, IEEE VTC. ORlando, FL,
United States, 2003. pp. 6-9.
[15] Zhong Chong-xian, Li Chun-guo, Yang Lu-xi. Dynamic
Resource Allocation Algorithm for Multi-cell OFDMA
Systems Based on Noncooperative Game Theory, Journal
of Electronics & Information Technology, 2009,
8(31):1935-1940.
[16] H. Kwon, B. G. LEE. Distributed resource allocation
through noncooperative game approach in multi-cell
OFDMA systems, IEEE ICC 2006, Istanbul, J une 2006.
[17] Qiu J ing, Zhou Zheng. Distributed Rescource Allocation
Based on Game Theroy in Multi-cell OFDMA Systems,
International Journal of Wireless Information Networks,
2009, 1(16):44-50.
[18] Wang L, Xue Y S, Schulz E. Resource allocation in
multicell OFDM systems based on noncooperative game,
The 17th Annual IEEE International Symposium on
Personal Indoor and Mobile Radio Communications.
2006.1-5
Ping Wang, born in China, 1978-2-28. He graduated from the
department of computer science and engineering at Shanghai
J iaotong University, China and received Ph. D. degree in 2007.
His major field of study is wireless communication. He joined
the college of electronic and information engineering at Tongji
University in 2007 and now is a lecturer. His current and
previous interests include routing algorithms and resource
management in wireless networks, vehicular ad hoc network
and streaming media transmission.
Jing Han, born in China, 1987-3-7. She graduated from the
department of information and communication engineering at
Tongji University and received B.S. degree in 2009. Her major
field of study is wireless communication. Now she is a graduate
in the department of information and communication
engineering at Tongji University. Her main research interests
are in enhanced MIMO and radio resource management for the
next generation mobile communications.
Fuqiang Liu, born in China, 1963-3-7. He graduated from the
department of automation at China University of Mining and
received Ph. D. degree in 1996. His major field of study is
signal processing. Now he is a professor in the department of
information and communication engineering at Tongji
University. His main research interests are in routing algorithms
in wireless broadband access and image manipulation.
Yang Liu, born in China, 1987-8-10. He graduated from the
department of information and communication engineering at
Tongji University and received B.S. degree in 2010. His major
field of study is wireless communication. Now he is a graduate
in the department of information and communication
engineering at Tongji University. His main research interests
are in relay technologies and radio resource management for
LTE systems.
Jing Xu, born in China, 1975-5-6. Now he is a researcher in the
Shanghai Research Center for Wireless Communications. His
main research interests are in system architecture, networking
and resource allocation in B3G/4G systems.
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1617
2011 ACADEMY PUBLISHER
Expectation Value Calculation of Grid QoS
Parameters Based on Algorithm Prim
Kaijian Liang
School of Application & Technology, Hunan Institute of Engineering, Xiangtan, China
Email: [email protected]
Linfeng Bai
School of Information Engineering, Henan Institute of Science and Technology, Xinxiang, China
Xilong Qu
School of computer and communication, Hunan Institute of Engineering, Xiangtan, China
AbstractFrom the perspective of selecting service by QoS
attributes, a computation method of QoS expectation value,
which is based on Algorithm Prim, was presented to provide
support for selection of service. On the basis of the ability of
service providers, by Algorithm Prim, this method succeeded in
calculating a set of balanced expectation values of QoS.
Selection of service based on these QoS values would be
beneficial to optimization of system resources and protection of
the users of those services. An example with analysis has been
provided to demonstrate the feasibility and effectiveness of the
method.
Index Terms grid computing, service matchmaking, QoS
parameters, algorithm prim
I INTRODUCTION
To network technique as the core, new generation
network computing environment is a hotspot and leading
edge domain of current international research. The goal of
network system construction combining with network
technique is integrating computing facility; storage device;
service facility and instrument fromdifferent place, building
large computing and data processing shoring of foundation
construction and achieving wide range sharing; effective
aggregating and full releasing abased on computing resource;
data resource and service resource on the internet.
Traditional internet achieved the connection of computer
hardware, Web achieved the connection of web page, when
grid try to achieve the connection of all resource, including
computing resource; storage resource; communication
resource; software resource; information resource and
knowledge resource. In Service-Oriented network
environment, discovering and selection is a very important
link, in the process, we need not only guarantee the veracity
of service location, but also consider the need of user, so
foundation and selection abased on QOS service arise.
Service foundation is a process which could meet the
need of specific service of user in the network, and achieve
automation and intellectualization. There is not strict divide
between service foundation and service selection, in some
research work, service foundation includes service selection.
Generally speaking, service foundation emphasizes the
process in founding candidate service collection, namely the
way on gaining candidate service, but service selection
emphasizes selecting a suitable service for user from
candidate service collection. In this sense, service foundation
is the preorder step of service selection, as a roughing
process, the result collection is the object of service selection
operating. The size of result collection; the way of gaining;
and veracity have direct effect on service selection strategy.
If it adopts a very strict standard for the need of all users in
service foundation, service selection has to do nothing, and
vice versa.
Fromthe view of user, they always want to find an
optimal service, but they cant, owing to price; service; times;
factors. Practical application, users take two sides into
consideration: one side, meeting the need of QOS with a
better cost performance; on the other side, different user has
different attention on the attributive of QOS except satisfy
basic QOS needs, some have specific requests on service
price, others may pay attention to the response time of
service or creditworthiness. So you say, when taking the two
side into consideration, aiming at specific user, it is user to
measure the satisfaction of QOS needs at last. User has
different attention on the attributive of QOS; it is the QOS
needs predilection of user. But in practical application, it is
very important to study the QOS needs predilection to
satisfy QOS needs of specific user.
It has important significance to realize the potential value
of gridding service resource. In microeconomics, price is the
effective lever of adjusting supply-demand relations between
consumer and commodity. In foreseeable gridding
technology application domain, there are many similarities
between behaviors of gridding service with user and
consuming behavior of commodity in market with consumer.
Combining microeconomics theory, it can introduce market
mechanism into paid service of gridding service. In fact,
paid service itself reflect an effective mechanism that
configures gridding service resource, and it is beneficial
for the whole operation order of gridding environment.
Because lever of price can have an effect guiding impact on
1618 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
doi:10.4304/jnw.6.11.1618-1624
network using behavior in different user, as a adjusting user
incentive mechanismin using gridding service resource,
despite paid service cannot distribute resource and restrain
its behavior explicitly when facing limitless condition at
present network environment. Therefore, this paid use in
gridding service has benefit property in commercial
activity.
Because of basic mechanismsustained by QOS attribute,
it can configure, discover, select, distributes resource on the
basic of QOS attribute. In current many system, not only
grid system, but also distributed systemand Peer-to-Peer
system, all its introduce SLA mechanism, which can
describe QOS information resource and bind specific
application. Some researcher introduces Service data into
grid service, which can be used to describe a kind of grid
service information including QOS information. G-QOSM
base on OGSA, provides a QOS management model facing
service, and expand grid service description on the
foundation of service data. It sustains resource based on
QOS attribute and service foundation, also the latest GGF
standard, and match OGSA latest standard. QGS in
G-QOSM frame exist in every domain, keeping in touch
with user application program, and catch service request
constrained by QOS parameter. According to the given
parameter, it can find the best matching service and consult
SLA; Base on foundation sign a contract to guarantee user
service quality.
The discovery and selection of service based on QoS
attributes can facilitate the optimization of systemresources
and guarantee the quality of customer service, which has
been a hot research topic in grid computing. Moreover, it is
also an issue to be sorted out for the application and
commercialization of grid computing. In the commercialized
environment of service-oriented grid application, the users
will consider their own benefit and efficiency while using
the service. Whereas among a number of candidate services,
the way users determine the equilibriumrequirements of
QoS appears critical as equilibriumrequirements of QoS
have a direct impact on QoS matchmaking parameters and
the selection of services. Therefore, it is essential for users to
present the expected value of QoS parameters and method of
computing [1].
Up to date, similar researches have focused on models of
service selection and algorithm, esp. establishing effective
and applicable models combining closely the system
structure so as to improve the systemefficiency [2,3]. As for
Algorithm, the main interest lies in how to improve the
precision and accuracy of algorithm and stress the
effectiveness of computing [4, 5]. About the estimation of
parameter, some researchers have been carried out in related
fields. For example, for the estimation of network
performance, the reference literature no.6 [6] as listed has
proposed a method of computing which can be used to
estimate path capacity, on the basis of algorithmwhich can
deal with the estimation of capacity of end-to-end single
congestion path and available bandwidth. In literature no.7
as listed [7] the estimation method of discrete wavelet
transformis applied in the research of synthesized business
flow of high-speed Internet. The related work in parameter
estimation has received adequate attention and plays a great
role in the corresponding fields. These achievements have
provided useful theoretical basis and method for selection of
service and protection of quality of user service, although the
function of QoS requirements was ignored and no specific
result was achieved. For the optimization of system
resources and protection of the user service efficiency, this
paper will study how to calculate the value of expected QoS
parameters.
Source reservation technology in service-oriented
computing environment is a basic technology for service
quality control, but there are still great difficulties. On one
hand, factors such as network environment heterogeneity,
the breadth of distributed independent nodes, node
management and complexity of security strategies, etc,
increase the difficulty of resource reservation; on the other
hand, reservation has a lot of key issues to be resolved,
including reservation technical fault tolerance, the validity of
reserved resources, resource sharing to be faced by
reservation, etc, which require valid, reliable and robust
reservation technology; while not increase too much system
expense and ensure not affect the overall system
performance.
With the development of computer science, graph theory
progress at an alarming rate, and it is a major embranchment
in applied mathematics. On one hand, computer science
provides computing equipment for graph theory; on the
other hand, it needs graph theory to describe and solve many
problems in modern sciences practical application. Graph
theory was applied to many domains as a method or tool in
describing the relation of affairs at present, such as computer
science, physics, chemical, operational research, information
theory, cybernetics, network communication, social science,
economic management, military, national defense, and
agriculture and industry production. Primis an important
method to solve the weighted graph shortest or the optimal
path problemin graph theory, and then it can be used to
project decision described by graph theory.
II. QOS PARAMETERS OF GRID SERVICES AND SERVICE
MATCHMAKING
A Efficiency Type Qos Parameters
"Efficiency" is a term used in the field of economic
management and means "income", "interest" originally
[8].Network application should also follow the principles of
"market economy" and commercialized "efficiency" also
exists. As economic grid environment is concerned, owing
to the existing "commodity market, economic laws also
function. Users of service expect not only the basic function
but also others such as the most convenient and safe service
at the minimumcost. Consequently, the users requirement
of QoS is also accompanied by pursuit of "efficiency" and
the QoS attributes of service also include the consideration
of "benefit. Both parties of supply and demand of the
service follow the rules of market economy for QoS
matching parameters. On the users side, economic benefits
constitute the prior consideration, of which service the price,
response time may be included in the cost efficiency type
QoS parameters which the smaller, the better. But, for other
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1619
2011 ACADEMY PUBLISHER
QoS parameters such as credit and reliability which can be
listed in economic and social efficiency type parameters, the
bigger value is preferred..
Classification of grid QoS parameters on the basis of
efficiency has it practical value in application. Based on
efficiency, users can carry out their calculation of QoS
parameters matchmaking by means of certain effective
algorithmwhen they implement resources discovery and
selection of service so as to decide the most appropriate
service for themselves and get the best efficiency and
provide groundings for the specific service finally. On the
other hand, it also helps keep the balance between supply
and consumption of the resources and improves the level of
optimization of the systemand operational effectiveness of
resources.
B Service Matchmaking based on Qos Attributes
In the service-oriented computation, a unified port can be
abstracted from service for designated access to various
resources including computation, storage and network. In
practical implementation a unified service port can be
formulated hidden to users. For example, a computation
service can be done by a single or multiple processing
machines, of which details need not directly be expressed in
the service contract. In other words, the granularity of
service function is changeable and the function can be
realized by a single host or distributed system[9]. It thus
provides a possibility that the QoS attributes are made as a
part of the port so that the systemcan select among services
based on QoS attributes, which makes it easier to ensure the
QoS requirements of users.
To make service discovery and selection based on the
attributes of QoS, it is required to establish the QoS attribute
set for each service and determine the corresponding QoS
parameters. When the user applies for service, firstly they
are supposed to declare their QoS requirements, then the
systemcan make matchmaking calculations according to the
candidate QoS service attributes to discover the service to
satisfy the requirements. To be specific, it is to match the
QoS parameters of the service with the required parameter of
the user. In this way the quality of customer service is
ensured [9].
One of the ways to establish QoS attributes is to extend
WSDL&UDDI. The purpose of extending WSDL is to
better describe service, add QoS attributes to the description
of WSDL and expand the service attributes. For instance, a
new genre of service QoS can be added to WSDL [10] to
describe the various QoS attributes of service. Meanwhile,
corresponding extend is also supposed to be implemented in
UDDI so that when the service in the UDDI is published,
users can discover and select service according to QoS
attributes. With the support of the service discovery and QoS
attributes, we guarantee the demand of users for QoS more
closely.
There are three functions in pretreatment of data named
standardization: firstly, comparing size by different type
attribute value. If QOS attribute data is different, weight
comparison would not express easily. Secondly, the not
dimension, if the QOS attribute dimension is different,
attribute would not common measure. Even the same
attribute, it may use different prickle, then the different
numerical value. In various kinds multiple target assessment
method, assessing require remove the effect of dimensional
selection on assessment result, this is the not dimension. It
tries to eliminate dimension, reflecting the good or bad of
attribute value with only the size of numeric. Thirdly, the
normalization, different type attribute value numerical value
size is different in the primary attribute value table, putting it
into the interval between 0 to1.
Besides, it also could solve the incomplete compensatory
by nonlinearity transformation or other methods in
pretreatment of data. There are many data preprocessing
method, including linear transformation, standard 0-1
transformation, vector standardization, and so on. This text
adopts the following method to dispose.
III ESTIMATION OF QOS PARAMETER EXPECTATION VALUE
BASED ON PRIM ALGORITHM
A The Prim Algorithm of Minimum Spanning Tree
(MST)
The minimumspanning tree of the graph can be obtained
by means of primalgorithmin an undirected connected
graph. This algorithm, like Kruscal algorithm, is also widely
used in multitudinous domains such as network, civil
engineering and so on to solve many practical problems
[11].
Kruscal is a very mature arithmetic in graph theory, it can
evaluate shortest path tree in a weighted undirected
connected graph. According to limbic weight number
compositor from a small beginning into a force, it
investigates each side of graph G side collection T. If the
been investigated two peaks belong to two different
connected component, then putting this side into the selected
side collection TE, meanwhile, connecting two connected
component to one connection component; else rounding this
side. And so on, if the connected component number of T is
1, this connected component is one of G minimum
spanning tree.
PrimAlgorithmsuppose an undirected connected graph is
( , ) G V E = , the two tuples represents the set of points and
edge set respectively, then the minimumspanning tree of
G is ( , ) T U TE = . The basic idea of primalgorithmis: the
initial status is
0
{ } U v = , {} TE = and then repeat
execution of the following operations: among all sides of
u U ,v V U , find a side of minimumcost( , ) u v and
merge it into the collection TE and at the same time merge
v
into U until U V = . Then in TE there must
be 1 n sides. T makes the MinimumSpanning Tree. The
specific algorithmpseudo-code is described as :
1. Initialization:
0
{ } U v = (
0
v means any vertex in V );
{} TE = ;
2. The cycle stops until
U V =
(1)Among all sides of
u U
,
v V U
find a side
of minimumcost
( , ) u v
;
(2) {( , )} TE TE u v = + ;
1620 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
(3) { } U U v = +
Obviously, the key in Prim algorithm is to find the
shortest side to connect
U
and
V U
to expand the
spanning tree
T
. The spanning tree selected in this way
bears the minimum overall weights. With regard to the
efficiency type QoS parameters of users in grid computing
environment, by means of proper method of modeling, these
parameters can be converted to the form of undirected
connected graph, by primalgorithm, a spanning tree of
minimumoverall weights can be produced. Namely, a QoS
parameter expectation value which keeps equilibrium
between both parties of demand and supply can be obtained.
B Efficiency Type QoS Parameter Modeling
The description of candidate services shows the ability of
the QoS grid services to provide users service. For the
modeling of QoS parameters by graph structure, the QoS
parameters of the values in same or different types should be
unified by a common measure standard. Owing to the
difference in the capacity of each candidate service, there
can be a huge discrepancy in the values so that
standardization is necessary for the unification of the
measure.
Definition 1.The standardization of numeric QoS
parameters. It includes QoS Parameters such as service price,
response time etc. If a numeric QoS parameter is
i
q ,i Z ,
and the corresponding stanardized one is
'
i
q then
'
1
n
i i i
i
q q q
=
= , , i n Z
Definition 2. Standardization of ratio type QoS
parameters. It includes QoS Parameters such as reliability,
credit etc. Suppose a ratio type QoS parameter is
i
q ,i Z
and the corresponding stanardized one is
'
i
q , then
'
1
(1 )
n
i i i
i
q q q
=
= , , i n Z
The following theoremwill show many a side will require
at least n vertexes to forma single connected complete graph
or dense graph.
Theorem1. Suppose there are e sides and n vertexes are
required to construct a single connected complete graph or
dense graph. And n satisfies:
1 8 2 1 n e = + +
Proof: Mathematical induction is applied. When 1,2 e =
and by calculation with the theorem, we get
2 n = , 3 n = Obviously it is true. Suppose e k = and at
least
1 8 2 1 n k = + +
vertexes is required; when
1 e k = +
( k Z ),
'
1 8( 1) 2 1 n k = + + + ,
Obviously, there is:
1 8( 1) 2 1 1 8 2 1 k k + + + + +
within, and
1 8( 1) 1 8 8 9 8 1 2 k k k k + + + = + +
when the equality establishes, a single noncomplete
connected graph is formed,
'
n n =
, only one side is to be
added to the original graph; if the equality doesnt establish,
then definitely
'
1 n n = +
, when one side and one vertex are
added to the original graph, a single connected dense graph
is formed. The above two situations conformto the reality.
Q.E.D
Definition 3. Weighted-edge of QoS. It represents QoS
parameter and the weight of the side is the standardized
value of such QoS parameters.
Definition 4. Single connected graph of QoS attributes.
The single connected graph made with certain type of QoS
( , )
QoSType
G V E =
, E stands for the collection of QoS
weighted edges, v stands for the collection of vertexes
related to the QoS weighted side. If , E e e Z = , then
1 8 2 1 V e = + + .
Among the multiple candidate services, definition 4
establishes an association model for the related QoS
attributes and each QoS parameters are closely related to
each other via the model which thus provided basis and
groundings for examining the relationships between those
QoS parameters.
C Estimation of QoS parameter expectation value
By establishing the QoS attributes single connected graph
of each candidate service by Definition 4, with Prim
algorithm we can get a spanning tree of minimum
dissipation value, namely, the QoS parameter expectation
value of minimumdissipation value, which can be used for
next stage of selection of service .
Suppose the candidate service collection is
{ }
i
S s i Z
+
=
and the corresponding QoS attributes collection is:
{ [1, ], [1, ]}
s ij
QoS q i n j m =
in which j actually stands for the type of QoS attributes.
Here follows the actual method of calculation:
While
j m
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1621
2011 ACADEMY PUBLISHER
do
{
1. Establish the QoS attributes single connected graph
j
G
of E n = ;
2. Initialize j
G
:
0
{ } U v = , {} TE = ;
3. The cycle goes on until U V =
(1)Among all sides of u U ,v V U find a
side at minimumcost( , ) u v ;
(2) {( , )} TE TE u v = + ;
(3) { } U U v = + ;
4. To get the expectation value of j
q
,
1 1
( )
T E
n
j ij k
i k
E q q e T E
= =
=
Or
1
( ) 1
TE
j k
k
E q e TE
=
= ,
in which
k
e TE .
}
Then the targe QoS parameter expectation value is:
( ) ( ( ) [1, ])
s j
E Q o S E q j m =
As far as the density of probability is concerned, a random
collection of candidate services can reflect the equilibrium
distribution of the service provider. Consequently, with such
capacity of service provision, users can estimate the
reasonable QoS expectation parameter value by means of the
above mentioned method and use it as the groundings for
discovery and selection of service so that then they can
maximize the efficiency while using the paid service.
IV THE EXAMPLE AND ANALYSIS
Hereby lets demonstrate the process of the specific
method of calculation and compare it with Kruscal algorithm
[11] used for solving the same problem. Suppose
{ [1,5]}
i
S s i = ,
{ [1,5], [1,3]}
s ij
QoS q i j =
,
in which
1 i
q
2 i
q
3 i
q stand for service price
(currency unit), response time (millisecond) and reliability
(percentage). Specific value will be offered in the following
matrix:
100 10 82
120 20 90
150 16 80
80 30 92
200 6 88
s
QoS
=
According to the above-mentioned method of QoS
parameter expectation value, we firstly process the data by
standard and get the following matrix of numbers:
'
0.15 0.12 0.26
0.18 0.24 0.15
0.23 0.20 0.29
0.12 0.37 0.12
0.31 0.07 0.18
s
QoS
=
Following that the QoS attributes single connected graph
1 1 1
( , ) G V E = of
1 i
q and weighted edge sets
1
{0.15, 0.18, 0.23, 0.12, 0.31} E = are established.
By Theorem1 we can get that
1 1
1 8 2 1 4 V E = + + = .
If the initial status is
1
{} TE = then the status of the
selected set of points U and edge set
1
TE are
0 3
{ , } U v v = ,
1
{0.12} T E = ;
0 3 1
{ , , } U v v v = ,
1
{0.12, 0.15} T E = ;
0 3 1 2
{ , , , } U v v v v = ,
1
{0.12, 0.15, 0.18} T E = .
Finally the first QoS expectation value
1
1 1
1 1
( ) 98
T E
n
i i j k
i k
E q q e T E
= =
=
is
obtained. Similarly, we can get
2
2 2
1 1
( ) 11
TE
n
i ij k
i k
E q q e TE
= =
=
,
3
3 3
1
( ) 1 0.85
T E
i k
k
E q e T E
=
= .
Thus we can get QoS expectation value
( ) (98,11, 0.85)
s
E Q o S =
Fromthe above example, it is obvious that the calculation
method has disposed of some QoS parameters and provided
equilibriumcombination value of low dissipation. It actually
has reduced the range of service selection, the load of
calculation in matchmaking and clarified the target of
matchmaking, which makes the process of service selection
more precise and accurate. The time complexity of Prim
algorithmis
2
( ) n , whereas that of Kruscal algorithmis
1622 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
2
( (log )) e e . The time complexity of the former seems
worse than the latter but each has its own advantage.
Although the calculation of QoS parameter based on Prim
algorithmhas no advantage in time complexity, it is more
appropriate for the calculation of minimumspanning tree of
dense graph, which is very similar to the QoS parameter
model. In contrast, the calculation of QoS expectation
parameter value based on Kruscal algorithmis more suitable
for spares graph. In fact, the results of the two methods of
calculation are very close to each other, although it has
strengths in time complexity.
V CONCLUSION
From the perspective of selecting service by QoS
attributes, a computation method of QoS expectation
parameter value based on AlgorithmPrimis presented in
order to provide support for selection of service, which is
beneficial to the optimization of resource consumption and
the protection of customers efficiency in use. The
achievement expressed this article provides a useful
perspective and method for selction of service and QoS
gurantee and therefore bears significant value in both theory
and practice.At the next stage our research will concentrate
on combining the method of computation of QoS
expectation parameters value with the effective selection of
service, testing and assessing its efficiency correctly.
Service foundation is a process which could meet the
need of specific service of user in the network, and achieve
automation and intellectualization. There is not strict divide
between service foundation and service selection, in some
research work, service foundation includes service selection.
Generally speaking, service foundation emphasizes the
process in founding candidate service collection, namely the
way on gaining candidate service, but service selection
emphasizes selecting a suitable service for user from
candidate service collection. In this sense, service foundation
is the preorder step of service selection, as a roughing
process, the result collection is the object of service selection
operating. The size of result collection; the way of gaining;
and veracity have direct effect on service selection strategy.
If it adopts a very strict standard for the need of all users in
service foundation, service selection has to do nothing, and
vice versa.
Because of basic mechanismsustained by QOS attribute,
it can configure, discover, select, distributes resource on the
basic of QOS attribute. In current many system, not only
grid system, but also distributed systemand Peer-to-Peer
system, all its introduce SLA mechanism, which can
describe QOS information resource and bind specific
application. Some researcher introduces Service data into
grid service, which can be used to describe a kind of grid
service information including QOS information. G-QOSM
base on OGSA, provides a QOS management model facing
service, and expand grid service description on the
foundation of service data. It sustains resource based on
QOS attribute and service foundation, also the latest GGF
standard, and match OGSA latest standard. QGS in
G-QOSM frame exist in every domain, keeping in touch
with user application program, and catch service request
constrained by QOS parameter. According to the given
parameter, it can find the best matching service and consult
SLA; Base on foundation sign a contract to guarantee user
service quality.
With the development of computer science, graph theory
progress at an alarming rate, and it is a major embranchment
in applied mathematics. On one hand, computer science
provides computing equipment for graph theory; on the
other hand, it needs graph theory to describe and solve many
problems in modern sciences practical application. Graph
theory was applied to many domains as a method or tool in
describing the relation of affairs at present, such as computer
science, physics, chemical, operational research, information
theory, cybernetics, network communication, social science,
economic management, military, national defense, and
agriculture and industry production. Primis an important
method to solve the weighted graph shortest or the optimal
path problemin graph theory, and then it can be used to
project decision described by graph theory.
The discovery and selection of service based on QoS
attributes can facilitate the optimization of systemresources
and guarantee the quality of customer service, which has
been a hot research topic in grid computing. Moreover, it is
also an issue to be sorted out for the application and
commercialization of grid computing. In the commercialized
environment of service-oriented grid application, the users
will consider their own benefit and efficiency while using
the service. Whereas among a number of candidate services,
the way users determine the equilibriumrequirements of
QoS appears critical as equilibriumrequirements of QoS
have a direct impact on QoS matchmaking parameters and
the selection of services. Therefore, it is essential for users to
present the expected value of QoS parameters and method of
computing.
ACKNOWLEDGEMENT
Authors gratefully acknowledge the Projects Supported
by Scientific Research Fund of Hunan Provincial Education
Department(09C271, 08A009 and 08B015 ) for supporting
this research.
Project supported by Provincial Natural Science
Foundation of Hunan(10JJ6099)supports the research.
Project supported by Provincial Science & Technology
plan project of Hunan (2010GK3048) supports the research.
This research is supported by the construct programof the
key discipline in Hunan province.
This work was supported by the National Natural Science
Foundation of China (51075138)
REFERENCES
[1] LIANG Quan YANG Yang LIANG Kai-jian. Guarantee and
control of quality of service on grid system: A survey. Control
and Decision, 2007, 22(2): 121~126.
[2] Charles Kubicek. Applying a Stochastic Model to a Dynamic,
QoS Enabled Web Services Hosting Environment. Electronic
Notes in Theoretical Computer Science, 2006, 151(3): 77-95.
[3] Junseok Hwang, Martin B.H. Weiss. Service differentiation
economic models and analysis of market-based QoS
interconnections. Telematics and Informatics, 2008, 25(4):
262-279.
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1623
2011 ACADEMY PUBLISHER
[4] Sanya Tangpongprasit, Takahiro Katagiri, Kenji Kise, et al. A
time-to-live based reservation algorithmon fully decentralized
resource discovery in Grid computing. Parallel Computing,
2005, 31(6): 529-543.
[5] Haibin Cai, Xiaohui Hu, Qingchong L, et al. A novel
intelligent service selection algorithm and application for
ubiquitous web services environment. Expert Systems with
Applications, 2009, 36(2): 2200-2212.
[6] Liu Shi-dong Zhang Shun-yi Qiu Gong-an Sun Yan-fei, An
Improved Path Performance Parameter Estimation Technique
Based on End-to-End Measurements, Journal of Electronics
and Information Technology, 2007, 29(7): 1618-1621.
[7] WAN Jun DOU Wen-Hua LUO Jian-Shu CHEN Ying-Wu,
Discrete Wavelet Spectrum's Characherization and Its
Parameter Estimating for Multifractal Network Traffic,
Chinese Journa of Computers , 2007, 30(1): 18-26.
[8] ZHU Jianming, WANG Yuhong, SUN Baowen,
Development Requirement and Efficiency of E-Government,
an Analysis, Economic Science Press, Beijing, 2009.
[9] Rashid J. Al-Ali, Omer F. Rana, David W. Walker. G-QoSM:
Grid Service Discovery Using QoS Properties. Computing and
Informatics Journal, 2002, 21(4): 363382.
[10] Shaikh Ali A, Rana O, Al-Ali R, et al. UDDIe: An extended
registry for web services. Proceedings of Workshop on
Service-Oriented Computing: Models, Architectures and
Applications SAINT2003, Olando, USA: IEEE CS Press,
2003:1623~1632.
[11] WANG Xiaodong, Design and Analysis of Algorithm. Qing
Hua University Press, Beijing, 2008
[12] Al-Ali R, Laszewski G, Amin K, et al. QoS support for
high-performance scientific applications. In: Proceedings of
the IEEE/ACM 4th International Symposium on Cluster
Computing and the Grid, Chicago IL, 2004. Los Alamitos:
IEEE Computer Society Press, 2004, pp. 134~143.
[13] von Borstel F D, Gordillo J L. Model-based development of
virtual laboratories for robotics over the inter-net. IEEE
Transactions on Systems, Man and Cybernetics, Part A:
Systems and Humans, 2010, 40 (3) :623-634 .
[14] J. MacLaren. Advance reservation: State of art. GGF
GRAAP-WG, See Web Site
at:http://www.fz-juelich.de/zam/RD/coop/ggf/fraap/graap-wg.
html,Last visited: Feb 2005.
[15] Al-Ali R, Hafid A, Rana O, et al.A n Approach for QoS
Adaptation in Service-Oriented Grids [J].Concurrency and
Computation: Practice and Experience
Journal,2004,16(5):401~412.
[16] Liang Quan. Study of Service-Oriented Grid Models,
Strategies and Methods with QoS Guarantee. Beijing: Beijing
University of Technology, 2008
[17] Bruno R. Preiss. Translated by Hu Guangbin, Wang Song, Hui
Min, etc. Data Structures and Algorithms. Beijing: Electronic
Industry Press, 2003.
[18] Awad M K,Xuemin Shen. OFDMAbased two-hop
coopera-tive relay network resources allocation. IEEE ICC 08.
USA: Institute of Electrical and Electronics Engineers, 2008,
pp. 4414-4418.
[19] Shi Jinfa, Jiao Hejun, Sun Jianhui. Research on Collaborative
Design System of small and medium-sized enterprises
forNetworked Manufacturing. Proc.38th International
Conference on Computers and Industrial Engineering. Beijing,
China: 2008, pp. 2146-2153.
[20] F.Y. Zhu. Fractal descriptionA new analysis technique for
information system. Journal of East China University of
Science and Technology, Vol.14, pp.101~103, 1988.
[21] S.X. Qu. The Relation between Fractal Dimenasion and
Entropy. Chinese Journal of High Pressure Physics, Vol.7
No.2, pp.127~132, 2011.
[22] Z.L. Yan, W.H. Qiu, Z.Q. Chen. Evaluation of SystemOrder
Degree as Viewed from Entropy. Systems
EngineeringTheroy & Practice, No. 6, pp. 46~49, 1997.
[23] N. Cheng. Fractal and MIS. Modern Information, No.2,
pp.37~39, 2003.
[24] B. Cheng, H. H. Hu, Z. Wu. Fractal Knowledge Chain
Research in Knowledge Management. Modern Management
Science, No. 9, pp. 58~60, 2005.
[25] J. Wu, S.F. Liu.Entropy Model of Enterprise Knowledge
Metastasis. Statistics & Decision, No. 2, pp.141~143, 2007.
[26] X. B. Li. Entropy-Information Theory and an Analysis of the
Effectiveness of Systems Engineering's Methodology.
Systems Engineering Theory & Practice, No. 2, pp. 38~44,
2010.
Liang Kaijian, Male, Born in August 1965, in Dongkou,
Hunan, PhD, the professor of Hunan Insitute of Engineering.
research direction: intelligent technology, funded by Hunan
Natural Science Joint Fund leader and current vice president
of application technology, manufacturing information in
Xiangtan City Group experts. Research Interests: knowledge
discovery and intelligent technology. In recent years, chaired
the participating countries from the provincial education
department Corky gold key projects and scientific research 4;
published more than 20 academic papers, which were
retrieved included six three.
Linfeng Bai School of Information Engineering, Henan
Institute of Science and Technology, Xinxiang, China
Xilong Qu (1978-), PhD., the associate professor of
Hunan Insitute of Engineering. He graduated from
Southwest Jiaotong University in 2006 and earned the
Doctor degree. His research interesting are networked
manufacturing, agile supply chain, and papers with high
quality, and more than 20 papers are indexed by ISTP and
EI.
1624 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
Web Page Classification using an Ensemble of
Support Vector Machine Classifiers
Shaobo Zhong*
College of Elementary Education, Chongqing Normal University, Chongqing 400700, China
Email: [email protected]
Dongsheng Zou
College of Computer Science, Chongqing University, Chongqing, 400044, China
Email: [email protected]
Abstract-Web Page Classification (WPC) is both an
important and challenging topic in data mining. The
knowledge of WPC can help users to obtain useable
information from the huge internet dataset automatically
and efficiently. Many efforts have been made to WPC.
However, there is still room for improvement of current
approaches. One particular challenge in training classifiers
comes from the fact that the available dataset is usually
unbalanced. Standard machine learning algorithms tend to
be overwhelmed by the major class and ignore the minor
one and thus lead to high false negative rate. In this paper, a
novel approach for Web page classification was proposed to
address this problem by using an ensemble of support
vector machine classifiers to perform this work. Principal
Component Analysis (PCA) is used for feature reduction
and Independent Component Analysis (ICA) for feature
selection. The experimental results indicate that the
proposed approach outperforms other existing classifiers
widely used in WPC.
Index-Terms Web Page Classification, Support Vector
Machine, Ensemble Classifier.
I. INTRODUCTION
With the rapid development of the World Wide Web,
the mass of online text data has grown at very fast speed
in recent years. Information retrieval is facing great
challenge due to the explosion of the network scales.
How to obtain useable information from the huge internet
raw data automatically and efficiently becomes more and
more important than any time before. Researchers have
been actively studying on web mining with various data
in the World Wide Web. They study various fields such
as focused crawler, information extraction, opining
mining, usage mining, information integration, social
network analysis and so on. Search engines and Web
directories are the essential attempts. Actually in each
field, classification is one of the methods that organize
the subject. Classification is a supervised method of
grouping data in a way, that more similar elements come
together in the same group, but clustering is an
unsupervised method that can find hidden relations
among data, which can be used to divide members of a
class to even more related clusters. Usually classification
is done according to some rules such as latent or obvious
analogies among things which are studied. Finding
existent pattern is a complicated procedure because these
patterns are usually hidden and can not be seen obviously.
Therefore, machine learning algorithms are needed for
classification. This makes many researchers focus on the
issue of WPC technology. WPC can deal with the
unorganized data on the web. The purpose of WPC is to
classify the Internet web pages into a certain number of
pre-defined categories.
During the past two decades, many methods have been
proposed for WPC, such as Naive Bayes (NB) classifier
[1], self-organization neural networks [2], Support
Vector Machine [3], etc. Recently some methods attempt
to use some hybrid approach for WPC. For example,
Weimin and Aixin [4] used body, title, heading and meta
text as feature by using SVM and Naive Bayesian
classifier. The result shows that combination of these
features with SVM classifier gives higher efficiency for
web page classification system. Xin J in et al. [5] used
ReliefF, Information Gain, Gain ratio and Chi Square as
feature selection technique for improving the web page
classification performance. Rung-Ching and Chung-Hsun
[6] proposed a web page classification method by using
two types of features as inputs to SVM classification.
The output of two SVM is used as inputs of voting
schema to determine the category of the web page. The
voting improves the performance when compares with
the traditional methods. Fang et al. [7] proposed a web
page classification by using five classification methods.
The output of these SVMs is used as inputs of voting
Manuscript received J uly 1, 2010; revised J anuary 1, 2011; accepted
J anuary 22, 2011.
This work was funded by the Key Project of Chinese Ministry of
Science and Technology (No. 2008ZX07315-001), Major scientific and
technological special project of Chongqing (No.2008AB5038).
*corresponding author: Shaobo Zhong.
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1625
2011 ACADEMY PUBLISHER
doi:10.4304/jnw.6.11.1625-1630
method and picks the class with the most votes as the
final classification result. This method improves the
performance when compared with the individual
classifiers. Zhang et al. [8] presented a web page
categorization based on a least square support vector
machine (LS-SVM) with latent semantic analysis (LSA).
LSA uses Singular Value Decom-postion (SVD) to
obtain latent semantic structure of original
term-document matrix solving the polysemous and
synonymous keywords problem. LS-SVM is an effective
method for learning the classification knowledge from
massive data, especially on condition of high cost in
getting labeled classical examples. The F-value is 98.2%
by using LS-SVM method. Moayed et al. [9] used a
swarm intelligence algorithm in the filed of WPC by
focusing on Persian web pages. Ant Miner II is the used
algorithm. The highest accuracy for News site 1 is 89%.
Hossaini et al. [10] used Genetic Algorithm (GA) for
classification and clustering. The algorithm works on
variable size vectors. At the GA part they combined
standard crossover and mutation operators with K-means
algorithm for improving diversity and correctness of
results. By means of this method they achieved more
accurate classes and defined subclasses as clusters. Their
method shows more accurate results than fixed size
methods. The accuracy rate is about 90.7% and also
overload of unnecessary elements in vectors is bypassed.
He et al. [11] used an approach using Naive Bayes
(NB) classifier based on Independent Component
Analysis (ICA) for WPC. Some other researchers also
addressed this problem [13-22]. However, there is
significant room for improvement of current approaches.
One particular challenge in training classifiers comes
from the fact that the dataset used for WPC is unbalanced
[12] to some extent. The number of one kind of web
pages can be much smaller or greater than another.
Standard machine learning algorithms without
considering class-imbalance tend to be overwhelmed by
the major class and ignore the minor one and lead to high
false negative rate by predicting the positive point as the
negative one[23]. However, the accurate classification of
web page from the minority class is equivalently
important as others. In order to overcome this
disadvantage, a common approach is to change the
distribution of positive and negative sites during training
by randomly selecting a subset of the training data for the
majority class. But this approach fails to utilize all of the
information available in the training data extracted from
the original web pages.
In this paper, a novel approach for WPC is proposed.
Our approach uses an ensemble classifier to deal with
WPC. The novel approach implements an ensemble of
SVM classifiers trained on the natural distribution of
the data extracted from the original web pages. The
ensemble classifier can reduce the variance caused by the
peculiarities of a single training set and hence be able to
learn a more expressive concept in classification than a
single classifier. In addition, PCA algorithm is used for
feature reduction and ICA algorithm for feature selection.
The experimental results indicate that the proposed
approach was indeed providing satisfactory accuracy in
web page classification.
This paper is organized as follows. Section II focuses
on the method. Section III describes the experiments.
The conclusion and future work are discussed in Section
IV.
II. METHODS
The process of WPC consists of web page retrieval
processing, stemming, stop-word filtering, the weight of
regular words calculating, feature reduction and selection,
and finally the document classification using ensemble
classifier. In web page retrieval phase, we will also
retrieval the latest news web pages category form the
Yahoo.com, and store them in our local databases
according to Ref. [4]. In this way out research work can
be compared with previous efforts.
A. Web Page Representation
It is difficult to carry on the WPC directly because the
words in web documents are huge and complex. In this
paper, we extract character words constitutes eigenvector
with Vector Space Model (VSM), which is considered as
one of most popular model for representing the feature of
text contents. In this model, each document is tokenized
with a stop-word remover and Porter stemming [24] in
order to get feature words used as Eigen values. Finally
the documents are projected to an eigenvector, as follow:
)) ( , , ); ( , ); ( , ( ) (
2 2 1 1
d w t d w t d w t d V
n n
L = , (1)
Where
i
t denotes the
i
-th keyword and ) (d w
i
is the
weight of
i
t in document d .
B. Weight calculation
One obvious feature that appears in HTML documents
but not in plain text documents is HTML tags. The
information derived from different tags bear different
importance. For example, a word present in the TITLE
element is generally more representative of the
documents content than a word present in the BODY
element. So, according to the HTML tags in which the
terms are included in, we defined a new method of
weight calculation as follows:
( ) ( ) [ ] ) , ( )
~
, (
2
1
) (
i j j j
d t d t W d W + = (2)
where )
~
, ( d t W is the weight of t in document
d
~
according to frequency of words appeared in the
HTML documents.
1626 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
+
+
=
d N
t
t
j
n
N
d t tf
n
N
d t tf
d t W
2
) 01 . 0 log( )
~
, (
) 01 . 0 log( )
~
, (
)
~
, (
(3)
where, )
~
, ( d t tf is the frequency of
t
in
document d
~
. N is the number of total documents.
And
t
n
is the number as documents in which
i
-th
keyword appears. ) , (
i j
d t is the location of the words
appeared in the HMTL document as following functions
( )
=
k
e
i k j k i j
d e t TF e d t ) , , ( ) ( ) , ( (4)
Where
k
e is an HTML element, ) (
k
e denotes the
weight assigned to the element
k
e and
) , , (
i k j
d e t TF denotes the number of times term
j
t is
present in the element
k
e of HTML page
i
d . We
define the function ) (
k
e as:
=
elsewhere
TITLE or META is e if
e
. 1
,
) (
(5)
where, 6 , 5 , 4 , 3 , 2 = were tested and compared with
standard ) , (
i j
d t TF .The experimental results showed
that using ensemble classifier can obtain the best results
while the value of
equals 6.
C. Feature reduction
The method presenting feature words will generally
create multidimensional datasets. PCA is certainly the
most widely used method for multivariate statistical
analysis. It reduces data dimensionality by performing a
covariance analysis between factors. As such, it is
suitable for datasets in multiple dimensions. The
efficiency of the filter approach of PCA is relatively high.
According to the different processing manners, PCA can
be divided into data method and matrix method. We
choose matrix method, and represent the training sample
in the form of document-lemma matrix
n m ij
w
= ) ( ,
where covariance is the weight of terms existing in the
set of documents. All data which calculated the variance
and covariance are represented in matrix. Then, get the
eigenvectors of the covariance matrix, which are
corresponding to the main component of the original data.
We selected the first-used eigenvectors m ,
the herein, as eigenvectors is 100, 200, 400, etc. The
principal components set is n matrix
=
n ij
M ) (l ,
where
ij
l is the eigenvectors being extracted out of the
reduced state from original data size
n m
to data
size n . The complete analysis of the PCA method
used in this paper is given in Ref. 25 and Ref. 26.
D. Feature selection
Independent Component Analysis (ICA) [27] is a
novel statistical signal and data analysis method. The
purpose of ICA is to linearly transform the original data
into components which are as much as statistically
independent [28]. The task of ICA is to find Separation
matrix W to make Wx y = where
T
N
y y y y ) , , , (
2 1
L = is called output variable, and
T
N
x x x x ) , , , (
2 1
L = is an observed random variable. If
i
y is mutually independent, then
i
y is the estimated
value of an independent random
variable
T
N
s s s s ) , , , (
2 1
L = . It can be seen as an
extension of PCA towards higher order dependencies.
E. An ensemble of SVM classifiers
Support vector machine
Support vector machine (SVM)classifier, motivated by
results of statistical learning theory[29][30], is one of the
most effective machine learning algorithms for many
complex binary classification problems .Given the
training set
{ }
l
l l
Y X y x y x y x T ) ( ) , ( , ), , ( ), , (
2 2 1 1
= L
when
the penalty factor
C
and kernel function
(.,.) K
are
selected properly, we can construct a function
b x x K x x K x g
X x i
i i
X x i
i i
+ =
+
: :
) , ( ) , ( ) ( , (6)
where the non-negative weights
i
and b are
computed during training by solving a convex quadratic
programming. In order to estimate the probability of an
unlabeled input
x
belonging to the positive
class, ) | 1 ( x y P = , we map the value ) (x g to the
probability by (Platt, 1999)
)] ) ( * exp( 1 /[ 1 )) ( ( ) | 1 Pr(
,
B x g A x g P x y
B A
+ + = = = (7)
Where A andB are then obtained by solving the
optimization problem
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1627
2011 ACADEMY PUBLISHER
{
l i xi g P pi
y if N
y if N N
ti t s
p t p t z F
B A
i
i
l
i
i i i i
B A z
L , 2 , 1 )), ( (
, 1 ) 2 /( 1
, 1 ) 2 /( ) 1 (
. .
)) 1 log( ) 1 ( ) log( ( ) ( min
,
1 ) , (
= =
= +
+ = + +
=
+ = =
+ +
= =
(8)
Where
+
N and
N , respectively, represent the
number of positive and negative points in training set.
Then the label of the new input
x
is assigned to be
positive if the posterior probability is greater than a
threshold, otherwise negative, i.e.
1, Pr( 1| )
( )
1.
if y x threshold
f x
otherwise
= >
=
(9)
where 1 corresponds to positive class, whereas -1
corresponds to negative class.
An ensemble of SVM classifiers
An ensemble of SVM classifiers is a collection of
SVM classifiers, each trained on a subset of the training
set (obtained by sampling from the entire training points)
in order to get better results [31]. The prediction of the
ensemble of SVMs is computed from the prediction of
the individual SVM classifier, that is, during
classification, for a new unlabeled input
test
x ,the j -th
SVM classifier in the collection returns a
probability ) | 1 (
test j
x y P = of
test
x
belonging to the
positive class, where m j L , 2 , 1 = and
m
is the
number of SVM classifiers in the collection. The
ensemble estimated probability, ) | 1 (
test Ens
x y P = , is
obtained by
=
=
= = =
m j
j
test j test Ens
x y P m x y P
1
) | 1 ( ) / 1 ( ) | 1 ( (10)
Fig.1 shows the architecture of the ensemble of SVM
classifiers.
Figure 1. Architecture of the ensemble classifier fusing m SVM
classifiers. Each one is trained on a balanced subsample of the training
data.
III. EXPERIMENTAL RESULTS AND DISCUSSION
For experimental purpose, we build the dataset in the
similar way as He et al. [11]. We choose the web page
dataset from the Yahoo sports news. The dataset includes
six categories of web pages. They are Soccer, NBA, Golf,
Tennis, Boxing and NFL. The whole set include 3,160
web pages, i.e.880 documents of Soccer, 560 documents
of NBA, 320 documents of Golf, 640 documents of
Tennis,280 documents of Boxing, 480 documents of
NFL. Among the dataset, 2500 documents (about 80%)
selected randomly from different classes were used for
training data, and the remaining other document for test
data.
As for performance measure, the standard information
retrieval measures, such as recall ( r ), precision ( p ),
and F1 ( ) /( 2 1 p r rp F + = ) are used to estimate the
performance of our method. To compare with other
approaches, we have done the classification on the same
dataset by using TFIDF, NB classifier and Hes
improved NB ( denoted as NBICA)[11] .
The experimental results of WPC on our dataset are
shown in Table 1. For the category of Soccer, NBA, Golf,
Tennis, Boxing and NFL, the value of F1 are 91.55%,
92.97%, 94.40%, 92.50%, 94.55% and 93.87%,
respectively. Meanwhile, the overall average of F1
measure is 93.31%. Comparing with NBICA, the overall
F1 value is increased modestly from 92.13 to 93.31% by
1628 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
using our approach. In addition, the F1 value for each
category is relatively stable with our approach. However,
the lowest F1 value is 75.85% for Soccer category while
the highest one is 98.81% for rugby category with
NBICA. The F1 value varies evidently because the sizes
for each category of web pages are unbalanced with
NBICA. As observed from Table1, we can summarize
that this problem is solved with our approach by using an
ensemble of SVM classifiers.
For comparison we used some other methods, such as
TFIDF [32], NB and NBICA for WPC on the same
dataset. The experimental results of WPC are shown in
Table 2. By using TFIDF, NB and NBICA methods, the
overall average F1 value are 81.78, 84.04 and 89.63%,
respectively. Our method of ensemble classifier improves
F1 by 3-11%. These results indicate the superior
performance of our approach over that of some existing
methods for WPC.
TABLE 1.
EXPERIMENTAL RESULT USING ENSEMBLE CLASSIFIER.
Class No. Recall (%) Precision (%) F1 (%)
1.Soccer 90.36 92.78 91.55
2.NBA 95.66 90.42 92.97
3.Golf 96.27 92.6 94.40
4.Tennis 94.50 90.58 92.50
5.Boxing 95.68 93.45 94.55
6.NFL 95.45 92.35 93.87
Average 94.65 92.0 93.31
TABLE 2.
F1 VALUE BY USING DIFFERENT APPROACHES
Class No.
TFIDF
(%)
NB (%)
NBICA
(%)
Ensemble
classifier
(%)
1.Soccer 84.32 85.85 90.25 91.55
2.NBA 83.44 93.56 93.68 92.97
3.Golf 74.37 76.30 84.56 94.40
4.Tennis 85.60 85.81 93.56 92.50
5.Boxing 80.16 78.69 83.40 94.55
6.NFL 82.76 84.05 92.30 93.87
Average 81.78 84.04 89.63 93.31
IV. CONCLUSION
Automated web pages classification, which is a
challenging research direction in text mining, plays an
important role to establish the semantic web. Many
efforts have been made for WPC. However, there is
significant room for improvement of current approaches.
One particular challenge in training classifiers comes
from the fact that the dataset used for WPC is unbalanced
to some extent. Consequently, the F1 value of most
existing methods is unstable. In this article, we have
studied the problem of unbalanced dataset in WPC. We
proposed a novel approach using an ensemble of SVM
classifiers to address this problem. The comparison of
performance among four methods, namely TFIDF, NB,
NBICA and our ensemble classifier, has been presented
in this paper. The experimental results indicate that the
proposed approach could solve the problem well.
Moreover, the F1 value is increased modestly with our
approach.
In future research, we should address to increase the
number of categories to a large extent to observe the F1
value with our approach. Moreover, combined with some
existing algorithms, such as Genetic algorithm, our
method of ensemble classifier can be further improved.
ACKNOWLEDGMENTS
The authors thank the editor and referees for their
careful review and valuable critical comments. We also
thank Prof. He for valuable suggestions and comments.
This work is supported by the Key Project of Chinese
Ministry of Science and Technology (No.
2008ZX07315-001), Major scientific and technological
special project of Chongqing (No.2008AB5038)
Education project of Chongqing Normal
University(080201), The Chongqing Key Research
Base of Humanities and Social Sciences: the Financial
Support from Chongqing Research Center of Elementary
Teacher Education. The authors are grateful for
reviewers who made constructive comments.
REFERENCES
[1] Fan Y., Zheng C., Wang Q. Y., Cai Q. S., Liu J . Web Page
Classification Based on Naive Bayes Method (In Chinese),
J ournal of Software, 2001, pp. 1386-1392.
[2] Zhang Y. Z. The Automatic Classification of Web Pages
Based on Neural Networks. Neural information processing,
ICONIP2001 Proceedings, Shanghai, China, 14-18
November 2001, Vol.2, pp. 570- 575.
[3] Xue W. M., Bao H., Huang W. T., Lu Y. C. Web Page
Classification Based on SVM. Intelligent Control and
Automation, 21-23 J une 2006, vol.2, pp. 6111- 6114.
[4] W. Xue, H. Bao, W. Huan, and Y. Lu, Web Page
Classification Based on SVM, 6th World Congress on
Intelligent Control and Automation, Dalian, China, 2006,
pp. 6111-6114,.
[5] J . Xin, L. Rongyan, S. Xian, and B. Rongfang, Automatic
Web Pages Categorization with ReliefF and Hidden Naive
Bayes, Proceedings of the 2007 ACM symposium on
Applied computing, Seoul, Korea, 2007, pp. 617-621.
[6]Chen R., Hsieh C., and Chen H. Web Page Classification
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1629
2011 ACADEMY PUBLISHER
Based On A Support Vector Machine Using A Weighted
Vote Schema. Expert Systems with Applications, 2006, vol.
31, pp. 427-435.
[7] Rui F., Alexander M., and Babis T. A Voting Method for
the classification of Web Pages. IEEE/WIC/ACM
International Conference on Web Intelligence and
Intelligent Agent Technology Workshops, 2006, pp.
610-613.
[8]Zhang Y., Fan B., Xiao L. B. Web Page Classification
Based-on A Least Square Support Vector Machine with
Latent Semantic Analysis. Fifth International Conference
on Fuzzy Systems and Knowledge Discovery, 2008, pp.
528-532.
[9] Moayed M. J .; Sabery, A. H.; Khanteymoory, A. Ant
Colony algorithm for Web Page Classification. 2008
International Symposium on information technology Kuala
Lumpur, Malaysia, 26-29 August 2008, pp. 8-13.
[10]. Hossaini, Z Rahmani, A. M. Setayeshi, S. Web pages
classification and clustering by means of genetic algorithm:
a variable size page representing approach. 2008
International conference on Computational Intelligence for
Modeling Control & Automation (CIMCA 2008), 10-12
December 2008, pp. 436-440.
[11] He Z. L., Liu Z. J . A Novel Approach to Nave Bayes Web
Page Automatic Classification. Fifth International
Conference on Fuzzy Systems and Knowledge Discovery,
2008, pp. 361-365.
[12] J apkowicz N. The class imbalance problem: significance
and strategies. In: IC-AI2000, Special Track on Inductive
Learning Las Vegas, Nevada, 2000.
[13] Xu S.M., Wu B.,Ma C.. Efflcient SVM Chinese Web page
classifier based on pre-classification. Computer
Engineering and Applications, 2010, pp. 125-128.
[14] Araujo L.,Martinez R.J. Web Spam Detection: New
Classification Features Based on Qualified Link Analysis
and Language Models. IEEE Transaction on information
forensics and security, 2010, Vol. 5 (3), pp. 581-590.
[15] Chen T.C., Dick, S., Miller, J . Detecting Visually Similar
Web Pages: Application to Phasing Detection. ACM
transaction on Internet techonology. 2010, Vol.10 (2), pp.
5
[16] Ofuonye E., Beatty P., Dick S.. Prevalence and
classification of web page defects. Online Information
Review, 2010, Vol. 34 (1), pp.160-174.
[17] Golub K., Lykke M. Automated classification of web
pages in hierarchical browsing. J ournal of documentation,
2009, Vol. 65 (6), pp. 901-925.
[18] Hou C.Q., Jiao L.C. Graph based Co-training algorithm for
web page classification. Acta Electronica Sinica, 2009,
pp.2173-80.
[19] Farhoodi, M., Yari A., Mahmoudi M. A Persian Web
Page Classifier Applying a Combination of Content-Based
and Context-Based Features.International J ournal of
Information Studies, 2009,pp.263-71.
[20] Selamat A., Subroto I.M.I., Choon C. Arabic script
web page language identification using hybrid-KNN
method.International J ournal of Computational
Intelligence and Applications,2009,pp.315-43.
[21] Zhu Z.G., Deng C.S., Kong L.P. Algorithm research on
classifying Web users navigation patterns based on
N-gram .J ournal of the China Society for Scientific and
Technical Information,2009,pp.389-394.
[22] Peng X.G., Ming Z., Wang H.T. WordNet based Web page
classification system with category expansion. J ournal of
Shenzhen University Science & Engineering, 2009,
pp.118-122.
[23] Liu, X. Y., Zhou, Z. H.. The influence of class imbalance
on cost-sensitive learning: an empirical study. In: Sixth
IEEE International Conference on Data Mining
(ICDM06), Hong Kong, 2006.
[24] The Porter Stemming algorithm, http://www.
tartarus.org/~martin/PorterStemmer.
[25] Calvo R. A., Partridge M., J abri M.. A comparative study
of principal components analysis techniques. In
Proceedings 9th Australian Conference on Neural
Networks, Brisbane, QLD1998, pp. 276-281.
[26] Selamat, A., Omatu, S. Neural Networks for Web News
Classification Based on PCA. Proceedings of the
International J oint Conference, 20-24 J uly 2003, vol. 3, pp.
1792 - 1797.
[27] Hyvarinen A., Karhunen J., and Oja E., 2001. Independent
Component Analysis, Wiley-Interscience, New York.
[28] Nacim F. C., Bernard R., Nathalie A.G. A Comparison of
Dimensionality Reduction Techniques for Web Structure
Mining. IEEE/WIC/ACM International Conference on, 2-5
Nov. 2007, pp. 116 119.
[29] Vapnik V., 1995. The Nature of Statistical Learning
Theory. Springer, New York.
[30] Vapnik V., 1998. Statistical Learning Theory. Wiley, New
York.
[31] Dietterich, T.G., 2000. Ensemble methods in machine
learning. In: Lecture Notes in Computer Science, vol. 1857,
pp. 115.
[32] Yang J . P., Honavar V., Miler L. Mobile intelligent agents
for document classification and retrieval: a machine
learning approach. Proceeding of the Eurpoean
Symposium on cybemetics and Systems Research, Vienna,
Austria, 1998, pp.707-712.
Shaobo Zhong was born in Sichuan, P.R. China, in J anuary
24, 1973. He obtained the bachelors the master's degree in
Mathematics and Computer Science of the Chongqing Normal
University, China in 1998, and the doctors degree in College
of Computer Science of the Chongqing University, China in
2008.His research interest includes machine learning, data
mining and web page classification.
1630 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
Integration of Unascertained Method with Neural
Networks and Its Application
Huawang Shi
Hebei University of Engineering, Handan, P. R. China
[email protected]
AbstractThis paper presents the adoption of artificial
neural network (ANN) model and Unascertained system to
assist decision-makers in forecasting the early warning of
financial in China. Artificial neural network (ANN) has
outstanding characteristics in machine learning, fault,
tolerant, parallel reasoning and processing nonlinear
problem abilities. Unascertained system that imitates the
human brain's thinking logical is a kind of mathematical
tools used to deal with imprecise and uncertain knowledge.
Integrating unascertained method with neural network
technology, the reasoning process of network coding can be
tracked, and the output of the network can be given a
physical explanation. Application case shows that combines
unascertained systems with feedforward artificial neural
networks can obtain more reasonable and more advantage
of nonlinear mapping that can handle more complete type of
data.
Index Termsartificial neural network, unascertained
system, financial early warning
I. INTRODUCTION
Unascertained system that imitates the human brain's
thinking logical is a kind of mathematical tools used to
deal with imprecise and uncertain knowledge. Artificial
neural network that imitates the function of human
neurons may function as a general estimator, mapping the
relationship between input and output. Combination of
these two methods can take into account the effect of
complementary effect of each other? Our theoretical
analyses are the following aspects: First, the artificial
neural network is a nonlinear mapping from input to
output; it does not rely on any mathematical model.
Unascertained system also as a nonlinear mapping is to
convert input signals x in domain U into signal y in
domainV as output. Second, artificial neural networks
can only deal with explicit data classification, and not
suitable for the expression of a rule-based knowledge.
However unascertained systems can handle abnormal,
incomplete and uncertain data. Third, the artificial neural
networks knowledge representation and treatment are
simple in form, and hard to the introduction of heuristic
knowledge, and the lower efficiency of the network.
Unascertained system can make use of expertise
knowledge, thus be easy to introduce of heuristic
knowledge that making the reasoning process more
reasonable. Finally, artificial neural network's greatest
strength are memory, learning and inductive functions;
Unascertained system does not have the learning function.
So, in theory, combining unascertained systems with
feed forward artificial neural networks can obtain more
reasonable and more advantage of nonlinear mapping that
can handle more complete and comprehensive type of
data.
The rest of this paper is organized as follows:
Unascertained Number and Algorithm are described in
Section2. Section3 describes Unascertained BP Neural
Networks in detail and gives Network Learning Process.
The experimental results on Unascertained BP Neural
Networks and some discussions are presented in
Section4. Finally, Section5 provides the conclusion.
II. MATERIALS AND METHODS
A. Introduction to Unascertained Number:
1) Definition of Unascertained number:
Unascertained mathematics, proposed by Want [1], is a
tool to describe subjective uncertainty quantitatively. It
deals mainly with unascertained information, which
differs from stochastic information, fuzzy information,
and grey information. Unascertained information refers to
the information demanded by decision-making over
which the message itself has no uncertainty but, because
of situation constraints, the decision-make cannot grasp
the whole information needed. Hence, all systems
containing the behavior factors, such as the problem of
clustering have unascertained property.
Definition 1: Suppose
a
is arbitrary real number,
1 0 < , then definite [ ] ( ) [ ] x a a , , is first-order
unascertained number, where
( )
=
=
R x a x
a x
x
, 0
,
(1)
Note that [ ] a a, express the interval of value, and
( ) x = express belief degree of a . When =1, belief
degree of a is 1. Where =0, belief degree of a is zero.
Definition 1: Suppose [ ] b a, is arbitrary closed interval,
b x x x a
n
= < < < =
2 1
, if
( )
( )
= =
=
other
n i x x
x
i i
, 0
, , 2 , 1 ,
(2)
and
1
n
i
i
=
=
= =
=
other
k i x x
x
i i
, 0
, , 2 , 1 ,
(3)
0
i
1,
i
=1,2,,
k
,
1
k
i
i
i
=
=
.
Then first-order unascertained number :
( ) ( )
1 1
1 1
, ,
k k
i i i i
i i
E A x x x
= =
=
,
( )
=
=
=
other
x x
x
k
i
i i
, 0
1
,
1
(4)
It is expected value of unascertained number A .
When 1 = , as ( ) A E , unascertained number A is
discrete type random variable. When 1 < , ( ) A E is first-
order unascertained number. Where
=
k
i
i i
x
1
1
as
expected value of
A
that belief degree is .
2) Algorithm of unascertained number:
Each unascertained number includes two parts of
probable value and belief degree. So, unascertained
number algorithm also includes two parts. Suppose
unascertained numbers are
A
and
B
.Where
( )
( )
= =
= =
other
m i x x
x f A
i i
, 0
, , 2 , 1 ,
,
( )
( )
= =
= =
other
n i y y
x g B
i i
, 0
, , 2 , 1 ,
(5)
B A C =
also is unascertained number. Probable
value and belief degree of
C
is calculated as follows.
(1)Constituted multiply matrix of probable value of
unascertained number
A
and
B
, where individual is
probable value number series
1
x ,
2
x ,,
k
x
and
1
y
,
2
y
,,
m
y
as
A
and
B
, permute from little to
big.
(2)Constituted multiply matrix of belief degree of
unascertained number
A
and
B
, where individual is belief
degree number
series
1
,
2
,,
m
and
1
,
2
,,
n
are
A
,
B
.Suppose
ij
a
and
ij
b
individual is element of multiply matrix of
probable value of
A
and
B
, here
i
is line of matrix,
j
is
array of matrix. We called
ij
a
and
ij
b
as relevant position
element.
(3)
1
x
,
2
x
,,
k
x
result from multiply matrix of probable
value of unascertained number
A
and
B
, which permute
from little to big. And an equal element is one element of
relevant position element in multiply matrix of belief
degree. Suppose
1
r
,
2
r
,,
k
r
is relevant position element
permutation. Where
( )
( )
= =
= =
other
k i x x r
x C
i i
, 0
, 2 , 1 ,
(6)
Suppose
( ) x C =
is arithmetic product of
unascertained number
A
and
B
. Where
( ) ( )
( )
= =
= = =
other
k i x x r
x g x f B A C
i i
, 0
, 2 , 1 ,
(7)
3) Unascertained membership:
Using Unascertained to describe "uncertain" or
"unclear boundary" phenomenon, the key problem is that
a reasonable Unascertained membership function.
Despite the clear definition rules of the construction
unascertained measure, the definition is non-structural in
nature, and did not give a specific construction method. It
still needs to be in accordance with the background
knowledge in specific areas, known to the measured data
and personal experience of decision-makers, etc.
Under normal circumstances, decision-makers do not
know exactly state of membership function. At this point,
the simplest and most reasonable method is by fitting line
shape of membership function. A standard membership
functions is in Figure 1.
Figure 1. A standard membership functions curve
The class
1
I membership function ( ) x
1
was
expressed by the broken line DI AC
1
; The class
2
I
membership function ( ) x
2
expressed by the broken
line EI OCC
2
; The class
3
I membership function
( ) x
3
was expressed by the broken line FI ODC
3
; The
class
4
I of membership function ( ) x
4
was expressed
by the broken line GI OEC
4
; The class
5
I of
membership function ( ) x
5
was expressed by the
broken line B OFC
5
B. Introduction to ANN
Artificial Neural Networks (ANNs) are composed of
simple elements that imitate the biological nervous
systems. In the last few decades, significant research has
been reported in the field of ANNs and the proposed
ANN architectures have proven the inefficiency in
various applications in the field of engineering. The
structure of a neural network of most commonly used
type is schematically shown in Fig.1. It consists of
several layers of processing units (also termed neurons,
nodes). The input values are processed within the
individual neurons of the input layer and then the output
values of these neurons are forwarded to the neurons in
the hidden layer. Each connection has an associated
1632 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
parameter indicating the strength of this connection, these
called weight.
Figure 1. The single layer of feedforward networks.
The NN model frequently used is multilayer
perceptron learning with error back-propagation. In the
present research work, the sequence with which the input
vectors occur for the ANN straining is not taken into
account, thus they are static networks that propagate the
values to the layers in a feed-forward way. The training
of the neural networks is performed through a back-
propagation algorithm. In General, the back-propagation
algorithm is a gradient-descent algorithm in which the
network weights are moved along the negative of the
gradient of the performance function.
Artificial Neural Network (ANN) is basically as
implied model of the biological neuron and uses an
approach similar to human brain to make decisions and to
arrive at conclusions[7]. Every neuron model consists of
a processing element with synaptic input connections and
a single output. The structure of a neural network of most
commonly used type is schematically shown in figure 1.
Figure. 2 Neural model.
The neuron can be defined as
=
= + =
n
i
j i ij j
x w f X W f y
1
) ( ) (
where,
x
is input signals,
ij
is synaptic weights of
neuron,
f
is the activation function and
y
is the output
signal of neuron. The architecture of multi-layered
feedforward neural network is shown in Fig. 2.
j =(1,2,...,J) i=(1,2,...,I)
xM
xm
x2
x2
yP
yp
y2
y1
wmi
wip wij
Figure 3. The model of BP net
It consists of one input layer, one output layer and
hidden layer. It may have one or more hidden layers. All
layers are fully connected and of the feedforward type.
The outputs are nonlinear function of inputs, and are
controlled by weights that are computed during learning
process.
At present, the BP neural network is one of the most
matures, wide spread artificial neural network. Its basic
network is three-layer feed-forward neural network such
as input layer, hidden layer, and output layer. The input
signals must firstly disseminate for- ward into the hidden
nodeThe output information of the concealment node
transmits into output node Via- function actionFinally
the output variable result is obtainedThe BP network
can realize complex non-linear mapping relations will
fully from input to output and has good exuding ability,
which can complete the duty of complex pattem
recognition
ANN has outstanding characteristics in machine
learning, fault, tolerant, parallel reasoning and processing
nonlinear problem abilities. It offers significant support
in terms of organizing, classifying, and summarizing
data. It also helps to discern patterns among input data,
requires few ones, and achieves a high degree of
prediction accuracy. These characteristics make neural
network technology a potentially promising alternative
tool for recognition, classification, and forecasting in the
area of construction, in terms of accuracy, adaptability,
robustness, effectiveness, and efficiency. Therefore, cost
application areas that require prediction could be
implemented by ANN.
C. Unascertained BP Neural Network
1) Description of unascertained BP network:
Assuming there is N known samples, divided intoK
categories,
k
X represents the k th sample space with
the sample size for
k
N , apparently: N N
K
k
k
=
=1
.
k
i
x represents i th sample (
k
N i 1 ),
so { }
T
k
N
k
i
k k
k
x x x X
) ( ) ( ) (
1
, , , , = . Suppose that each
sample
k
i
x has J characteristics (or indicators), the j th
feature (or indicators) is
j
I , J j 1 .
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1633
2011 ACADEMY PUBLISHER
k
ij
x represents the observation value of sample
k
i
x
with reference to the j th characteristic (or indicator).
2) Unascertained BP neural network structure:
Unascertained BP neural network structure and the
structure of BP neural network is basically the same seen
in Fig.1
Figure 4. The unascertained BP network structure.
The first layer is input layer, the number of nodes is
the same of feature space dimension. The second layer is
hidden layer. The third layer is output layer; the layer
number of nodes is equal to the classification number.
3) The desired membership calculating method:
In the usual artificial neural network, training samples
were divided into specific categories, that is, a sample is
determined belonging to a category. Therefore, training in
the network, its corresponding output node of the desired
output as "1", and the rest of the output node of the
desired output for the "0". However, in practice, data are
often sick, and its classification border is not very
specific, and samples are belonging to categories in
certain degree of membership. Therefore, the desired
output is not simply a two-valued logic, need to calculate
exactly, which leads to uncertainty in the network.
As the input data may be numerical value, also
possible be the degree of membership, the corresponding
desired output, there are differences in the calculation.
The following discussion is made under numerical value
input:
Supposing there are
k
N
=
N N
K
k
k
1
samples in
k th category and category center is
k
O :
T k
J
k
j
k
k
O O O O ) , , , , (
1
=
) 1 , 1 ( K k J j
(8)
Where,
=
=
k
N
i
k
ij
k
k
j
x
N
O
1
1
( )
T
J
K
k
k
j j
O O O O
K
O , , ,
1
2 1
1
= =
=
(9)
( )
2
1
2
1
=
=
K
k
j
k
j j
O O
K
(10)
=
=
J
j
j
j
j
w
1
2
2
(11)
Obviously, 1 0
j
w ,and 1
1
=
=
J
j
j
w . Therefore,
j
w is the indicator j classification weight of given
classification.
Set ) , , , , (
1 iJ ij i i
x x x x = ) 1 ( N i as any
training samples.
When the larger
k i
O x , the farther sample
i
x
away from the center of k th category and its
membership belonging to the k th category be smaller.
On the other hand, when
k i
O x the smaller, the
nearer sample
i
x away from the center of k th category
and its membership belonging to the
k
th category be
larger.
When the larger
j
w , the greater the contribution to
classification of indicator
j
I , that is, the more important
to classification of indicator
j
I On the other hand, when
the smaller
j
w , it shows that the smaller the contribution
to classification of indicator
j
I , that is, the less
important for classification of indicators
j
I . From the
above, we can define the weighted distance of sample
i
x
to the k th class center
k
O :
=
=
J
j
k
j ij j ik
O x w
1
2
) ( (12)
=
+ +
=
K
k ik ik
i k
x
1
1 1
) (
(13)
Obviously, 1 ) ( 0
i k
x , 1 ) (
1
=
=
K
k
i k
x .
Therefore, as ) (
i k
x is unascertained membership of
sample belonging to the k th category, that is, it is the
expectations output of membership degree that we have
to calculate: ) (
i k k
x d =
4) Mathematical derivation of amendment
ij
w :
Supposing that
j
O represent output of the j th node,
i
O express the output of i th node of the relative former
layer and
k
O express output of the k th node of the
relative behind layer
ij
w express connection weights of
the upper layer nodei to this node j :
1634 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
=
i
i ij j
O w net
j
net
j j
e
net f O
+
= =
1
1
) ( (14)
Where,
j
net express the net input of nodes j .
When
j
O is the output of output layer nodes, the
actual output
j j
O y = , Set
j
d is the desired
output of node j : ) (
i
k
j
x d = , then squares
sum error of output are as follows:
= =
j
j j
j
j j i
d O d y E
2 2
) (
2
1
) (
2
1
(15)
The total output error is:
=
i
i
E E (16)
Considering amendments to the weights:
i j j j j
ij
j
j
j
j ij
O O O d O
w
net
net
O
O
E
w
E
=
)] 1 ( [ ) (
(17)
) 1 ( ) (
j j j j j
O O d O =
(18)
i j
ij
O
w
E
=
When
j
O is the output of hidden layer nodes,
j
O affects each node of the lower classes.
Output square error:
=
k
k k
d y E
2
) (
2
1
The actual output of k th node of the output layer:
k
net
k k
e
net f y
+
= =
1
1
) (
=
j
j jk k
O w net
j
net
j j
e
net f O
+
= =
1
1
) (
=
i
i ij j
O w net
Considering amendments to the weights:
i j j jk k k
k
k k
ij
j
j
j
j
k
k
k
k ij
O O O w y y d y
w
net
net
O
O
net
net
y
y
E
w
E
=
) 1 ( ) 1 ( ) (
Set ) 1 ( ) (
k k k k k
y y d y =
Then
i j j
k
jk k
ij
O O O w
w
E
) 1 ( =
Set ) 1 (
j j
k
jk k j
O O w =
Then
i j
ij
O
w
E
=
.
5) Network learning process:
Set counter , t and 0 = t , randomly generated initial
values of weights ) (t w
ij
, set learning rate , the system
error
, set the
maximum number of iterations T .
Enter the study samples X , and calculate the desired
output membership
k
d of sample
Calculate the input value ) (t net
j
of each node and
output value ) (t O
j
:
=
i
i ij j
t O t w t net ) ( ) ( ) (
,
) (
1
1
)) ( ( ) (
t net
j j
j
e
t net f t O
+
= =
Calculate error ), (t E
=
k
k k
d y t E
2
) (
2
1
) (
Stop criteria: If ) (t E or T t > , then stop. Otherwise,
turn to (6);
Calculate the Adjustment value ) (t
j
of calculation
errors.
Output layer:
)) ( 1 ( ) ( ) ) ( ( ) ( t O t O d t O t
j j j j j
=
Hidden layer:
)) ( 1 ( ) ( ) ( ) ( ) ( t O t O t w t t
j j
k
jk k j
=
Where,
k
express the lower node number to node
j
Calculate the Adjustment value of weights
) ( ) ( ) 1 ( ) ( t O t t w t w
i j ij ij
+ =
Revise weights:
) ( ) ( ) 1 ( t w t w t w
ij ij ij
+ = +
, 1 + = t t Turn to (3).
6) Network identification:
Suppose x is the sample to be recognized. Input x
into the trained network. Suppose the greatest output is of
the
0
k output node, x belongs to the
0
k th category is
determined.
{ } K k x k
k
k
, , 2 , 1 | ) ( max
0
= = (19)
D. Unascertained RBF Neural Networks
1) Structure of unascertained RBF network:
Unascertained RBF network consists of three layers
such as input layer, hidden layer and output layer, which
neurons in same layers has no connection, and between
the adjacent two-layers has fully connected. Number of
input layer neurons is the sample dimension; hidden layer
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1635
2011 ACADEMY PUBLISHER
and output layer neuron number are the classification
number of samples. Unascertained RBF network is
characterized by only one hidden layer; hidden layer
neurons nodes select the Gaussian function to have a non-
linear mapping of input and output, layer neurons are
linear combine node. Its structure is shown in Fig.2.
Figure 5. The unascertained RBF network structure.
Suppose the input sample was x , then the output of
the i th hidden layer nodes was as follows:
=
2
2
exp
i
i
i
m x
(20)
Where is European norm,
i
m and
i
were the
centers and width of the
i
th hidden layer units of RBF.
The
j
th neuron actual output of output layer is:
=
=
K
i
i ij j
w y
1
(21)
1
0
=
Compared with BP neural network, RBF network
many has quicker convergence speed, because o close to
i
m has a larger output value, far away from
i
m , and its
output decreases rapidly.
2) The desired membership calculating method:
Unascertained RBF neural network in the desired
output method of calculating degree of membership.
Given known n samples, each of the known samples
i
x are point of d dimensional feature space, that is:
( )
T
id i i i
x x x x , ,
2 , 1
=
The n samples are divided into K categories:
K
C C C , , ,
2 1
,
k
m is the category center vector of
k
C ) , , 2 , 1 ( K k = . Considering the same type of
sample point should be in-dimensional feature space with
each other more "close" is reasonable. We have assumed
that the "close" is the Euclidean distance proximity.
Supposing the
i
th training sample is
( )
T
id i i i
x x x x , ,
2 , 1
= , the j th ( ) d j 1 data is
ij
x that is the nominal quantity of data. Supposing
k
m
classified Center Vector of
k
C :
T
kd k k k
m m m m ) , , , (
2 1
=
(22)
Unascertained classification in accordance with the
point of view, give a classification, first of all concerned
are in a given category, the characteristics of the
classification of all make a little contribution, and
contribution to the value of quantitative calculation.
Hereinafter referred to as "normalized" after the
classification of the characteristics of the contribution
value of the characteristics regarding the classification of
the classification weights. And, in the calculation of the
sample about when various types of membership, in
essence, to use a variety of characteristics of the weight
classification.
In order to quantitatively describe the contributions of
d
characteristics to the initial classification.
Let
=
=
C
k
k
m
C
m
1
1
(23)
) , , , (
2 1 d
m m m m =
Let , 1 , ) (
1
2
1
2
d j m m
C
j
C
k
kj j
=
=
(24)
The size of variance
2
j
reflects the extent of discrete
the type of
K
centers as
K
m m m , , ,
2 1
in the first
feature on values.
Let
=
=
d
j
j j j
w
1
2 2
(25)
Obviously,
j
w satisfied : 1 0
j
w and
1
1
=
=
d
j
j
w .
Then,
j
w is called the classification weights of j
characteristics under a given classification conditions.
Let
=
=
d
j
kj ij j ik
m x w
1
2
) (
(26)
Where: is non-negative real number, usually taken
as
01 . 0 ~ 001 . 0 =
.
In (26), if 0 =
j
w , it is illustrated that the
characteristic j has no contribution of distinction
between K categories, so j should not appear in the
calculation of the weight in the distance.
Thus, we can calculate the possibility of some measure
that the sample
i
x belonging to the k th category as
follows:
=
+ +
= =
K
k ik ik
k i ik
C x
1
1 1
) (
(27)
Where, 001 . 0 ~ 01 . 0 = ,
1636 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
Obviously, 1 0
ik
and 1
1
=
=
K
k
ik
, therefore,
ik
known as the unascertained measure of samples
i
x
belonging to the k th category, that is, we want to
calculate the expectations output membership degree
i
k
d ,
then
ik
i
k
d =
3) Unascertained RBF neural Network Learning
Process:
RBF network has been used for the study, it is to
classify the N known samples in K categories, and to
determine the classification of unknown samples
x
.
Unascertained RBF network is not only stress the desired
output 1 or 0, but also required specific calculation, and
the rest are same with BP networks. Unascertained RBF
network parameters need to learn there are three: the
center of basis function and the variance as well as the
weights of hidden layer to output layer connection. The
learning steps are as follows:
Center adjust: Unascertained-means clustering
algorithm.
Given classification number K and the system
accuracy
1
, set counter
0 = t
;
Give the initial classification of
n
samples, get K
cluster center vector
) , , 2 , 1 ( ), ( n k t m
k
=
Calculating the unascertained measure
K k n i t
ik
~ 1 , ~ 1 ), ( = =
of samples
i
x
belonged to
the
k
th category
Determine a new type of center vector
) 1 ( + t m
k
from
ik
as follows:
= =
= +
n
i
ik
n
i
i ik k
x t m
1 1
) 1 (
(28)
Calculate
=
+ =
K
k
k k
t m t m err
1
) ( ) 1 ( , and if
1
err
, so, stop iteration and turn to f); Otherwise, let
1 + = t t
,turn to c)
Recalculate unascertained measure of the sample
i
x
belonging to the k th category,
Determining the varianc: In the center adjustment
process, the variance
k
is determined by (28).
The study of connection weights
Supposing ) ~ 1 ( , K i
i
= is the output of the i th
neuron months in hidden layer, ) ~ 1 ( , K j y
j
= is the
actual output of the j neurons in output layer,
j
d is the
expectations corresponding output. Then,
) ~ 1 ( ,
1
K j w y
K
i
i ij j
= =
=
.
The output layer error is:
=
=
K
j
j j
y d E
1
2
) (
2
1
(29)
i j j
ij
j
j ij
y d
w
y
y
E
w
E
) ( =
(30)
i j j
ij
ij
y d
w
E
w ) ( =
= (31)
Weight correction formula is as follows:
) ( ) ( ) 1 ( t w t w t w
ij ij ij
+ = + (32)
4) Unascertained RBF neural network learning process:
To identify samplesx , input x to the trained network,
supposing the greatest output is of the
0
k th output node,
then x belongs to the
0
k th category.
Recognition Criteria:
{ } K k x k
k
k
, , 2 , 1 | ) ( max
0
= = (33)
Its said that the sample x belongs to the
0
k th
category.
E. Application case
In a market economy, enterprises are faced with a wide
variety of risks. Therefore the establishment of a sound
and effective financial risk early warning system is of
great necessity to the monitoring and control of financial
risk [11, 12]. We put the 45 selected sample data divided
into training samples and test samples (30 as training
samples, 15 as test samples) into unascertained neural
network system. There were 15 nodes which value affect
financial risk[11,12] is to input into neural network , 13
nodes in the hidden layer, and 1 node that indict the
output value (1 represents safety and 0 represents
unsaved) of the risk in the output layer.
The learning rate was 0.01, and expectative error was
0.001.Then the neural network was programmed by
software Matlab7.1. The training results are shown in
Table1. The network structure is 15x13x1. The average
variance EMS was 2.343 1110-5,a nd training time was
54 second. Trained 2386 times, reaching the goal,
training completed, the network convergence, when the
total error is 0.000996. Re-enter the training samples to
the best network training network detection, error rate to
0, and the network fitting fitting rate of 99.8%. Samples
will be entered into the prediction network prediction,
prediction results were shown in table 1.
F. Conclusions
Comparing Table.1 with the sample data, there is only
one sample of mistake. Therefore the misjudgment rate is
6.67%, that is the correct identification rate is 93.33%.
From this example, we can see unascertained neural
network for classification has a high application value. So,
not only in theory but also in practice, combining
unascertained systems with feedforward artificial neural
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1637
2011 ACADEMY PUBLISHER
networks can obtain more reasonable and more advantage
of nonlinear mapping that can handle more complete type
of and comprehensive data.
Comparing Table.2 with the sample data, there were
no samples of mistake. Therefore, the misjudgment rate is
0, that is, the correct identification rate is 100%.
From this example, we can see unascertained RBF
neural network for classification has a high application
value. So, not only in theory but also in practice,
combining unascertained systems with RBF artificial
neural networks can obtain more reasonable and more
advantage of nonlinear mapping that can handle more
complete type of and comprehensive data.
REFERENCES
[1] Wang Guangyuan. Unascertained information and
unascertained process. J ournal of Harbin University of
engineering, 1990(4):1-9. (in Chinese).
[2] Liu K D, Wu H Q, Pang Y J . Process and Application of
Uncertain Information. Beijing: Science Press, 1999 (in
Chinese).
[3] LIU Ya-jing, MAO Shan-jun, LI Mei, YAO J iming. Study
of a Comprehensive Assessment Method for Coal M ine
Safety Based on a Hierarchical Grey Analysis. J China
Univ Mining Technol 2007, 17(1):00060010.
[4] H.W.Shi, W.Q. Li, W.Q. Meng. A New Approach to
Construction Project Risk Assessment Based on Rough Set
and Information Entropy. 2008 International Conference
on Information Management, Innovation Management and
Industrial Engineering. Dec 2008:187-190.
[5] Lee,H.S.(2005).A fuzzy multi-criteria decision making
model for the selection of the distribution center. Lecture
notes in articial intelligence, 3612, 1290-1299.
[6] Liu J une,Wang Haikuai,Zhang Likun. Application of
Evaluating Model of Unascertained Measure in Bid &
Tender of Construction Supervision. Hong Kong, China:
Proceedings of 2004 International Conference on
Construction & Real Estate Management, 2004, 337-340.
[7] Li Wan-qing, Ma Li-hua, Meng Wen-qing. Based on
Unascertained Number Estimating Method of Projects
Duration. Statistic and Decision, 2006, (5):131-133.(in
Chinese)
[8] SHI Huawang. The Risk Early-warning of Hidden Danger
in Coal Mine Based on Rough Set-neural network.
Proceeding of the 2nd International Conference on Risk
Management and Engineering Management. November 4-
6,2008.pp314-317
[9] B. Irie, S. Miyake, Capability of three-layered perceptions,
Proceedings of IEEE International Conference on Neural
Networks, San Diego, USA, J uly 1988, pp. 641648.
[10] Salchenberger, L.M., Cinar,E.M., Lash,N.A. Neural
networks: a new tool for predicting thrift failures. Decision
Sciences, 1992, 23, 899-916.
[11] Bose NK, Liang P. Neural network fundamental with
graphs, algorithms and applications. McGraw-Hill
International Editions; 1992.
[12] MATLAB The Mathworks Inc., version 7.0.1.24704.
TABLE I.
PREDICTIVE RESULT TABLE OF PREDICTIVE SAMPLE
No. 1 2 3 4 5
Unsafe 0.6662 0.8934 0.2795 0.7171 0.7472
Safe 0.3338 0.1066 0.7205 0.2829 0.2528
No. 6 7 8 9 10
Unsafe 0.4332 0.6769 0.7105 0.5683 0.6029
Safe 0.5668 0.3231 0.2895 0.4317 0.3971
No. 11 12 13 14 15
Unsafe 0.0412 0.178 0.9355 0.8348 0.0642
Safe 0.9588 0.822 0.0645 0.1652 0.9358
1638 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
Researches on Grid Security Authentication
Algorithm in Cloud Computing
Keshou Wu *
Department of Computer Science and Technology, Xiamen University of Technology, Xiamen 361005, China
Email: [email protected]
Lizhao Liu
Department of Computer Science and Technology, Xiamen University of Technology, Xiamen 361005, China
Email: [email protected]
Jian Liu
College of Information Sciences and Technology, The Pennsylvania State University, PA, USA
Email: [email protected]
Weifeng Li, Gang Xie, Xiaona Tong and Yun Lin
KOLLZOK Intelligent Technology Co., Ltd, Xiamen, 361024, China
Email: [email protected]
Abstract
1
Focusing on multi-machine distributed
computing security problems in cloud computing, the paper
has proposed a grid distributed parallel authentication
model based on trusted computing, which can realize
simultaneous verification of grid authentication and grid
behavior on upper layer of SSL and TLS protocols.
Adaptive grid authentication method is established applying
adaptive stream cipher framework; an adaptive stream
cipher heuristic code generator and k-means heuristic
behavior trust query function is proposed and acted as
authentication kernel. Through comparison of the test
results of TLS and SSL authentication protocol and the new
grid authentication method, the effectiveness of the new grid
authentication method has been explained.
Index Termsdistributed computing; trusted computing;
cloud computing; grid behavior; grid authentication; TLS;
SSL
I. INTRODUCTION
Transport Layer Security (TLS) and its predecessor,
Secure Sockets Layer (SSL), are cryptographic protocols
that provide communications security over the
Internet[1][2]. TLS and SSL encrypt the segments of
network connections above the Transport Layer, using
symmetric cryptography for privacy and a keyed message
authentication code for message reliability. Several
versions of the protocols are in widespread use in
applications such as web browsing, electronic mail[3][4],
Internet faxing, instant messaging and voice-over-IP
(VoIP).TLS is an IETF standards track protocol, last
updated in RFC 5246 and is based on the earlier SSL
specifications developed by Netscape
*the corresponding author.
The work is supported by: The national natural science Foundation
(60903203)
Corporation[5][6][7]. The TLS protocol allows
client/server applications to communicate across a
network in a way designed to prevent eavesdropping and
tampering.A TLS client and server negotiate a stateful
connection by using a handshaking procedure. During
this handshake, the client and server agree on various
parameters used to establish the connection's
security[8][9][10].
Cloud computing refers to the provision of
computational resources on demand via a computer
network. In the traditional model of computing, both data
and software are fully contained on the user's computer;
in cloud computing, the user's computer may contain
almost no software or data (perhaps a minimal operating
system and web browser only), serving as little more than
a display terminal for processes occurring on a network
of computers far away[11][12]. A common shorthand for
a provider's cloud computing service (or even an
aggregation of all existing cloud services) is "The Cloud".
The most common analogy to explain cloud computing is
that of public utilities such as electricity, gas, and water.
Just as centralized and standardized utilities free
individuals from the vagaries of generating their own
electricity or pumping their own water, cloud computing
frees the user from having to deal with the physical,
hardware aspects of a computer or the more mundane
software maintenance tasks of possessing a physical
computer in their home or office. Instead they use a share
of a vast network of computers, reaping economies of
scale [13][14].
Grid computing is a term referring to the combination
of computer resources from multiple administrative
domains to reach a common goal. The grid can be
thought of as a distributed system with non-interactive
workloads that involve a large number of files. What
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1639
2011 ACADEMY PUBLISHER
doi:10.4304/jnw.6.11.1639-1646
distinguishes grid computing from conventional high
performance computing systems such as cluster
computing is that grids tend to be more loosely coupled,
heterogeneous, and geographically dispersed. Although a
grid can be dedicated to a specialized application, it is
more common that a single grid will be used for a variety
of different purposes. Grids are often constructed with the
aid of general-purpose grid software libraries known as
middle ware[15][16].
Trusted Computing (TC) is a technology developed
and promoted by the Trusted Computing Group.The term
is taken from the field of trusted systems and has a
specialized meaning. With Trusted Computing, the
computer will consistently behave in expected ways, and
those behaviors will be enforced by hardware and
software. In practice, Trusted Computing uses
cryptography to help enforce a selected behavior. The
main functionality of TC is to allow someone else to
verify that only authorized code runs on a system. This
authorization covers initial booting and kernel and may
also cover applications and various scripts. Just by itself
TC does not protect against attacks that exploit security
vulnerabilities introduced by programming bugs[17][18].
Figure 1. Grid distributed parallel authentication model
II. GRID DISTRIBUTED PARALLEL AUTHENTICATION
MODEL
If the grid entity A want to intact with grid entity B in
a cloud ,the grid entity A will first go into grid entities
heuristic trusted query, this process need to calculate
trusted value in grid domain and out of grid domain; at
the other hand it need to compute the grid entity adaptive
authentication. If the verify behavior reach its gate value
the information will be sent to the decision module
besides the information of grid entity adaptive
authentication, then the decision module will give the
comprehensive information of the trusted value of grid B
for A. During the process grid B will interact with grid
entity adaptive authentication module to give sufficient
information or else it will be rejected.
III. ADAPTIVE GRID AUTHENTICATION VERIFY FRAME
Adaptive grid authentication verifies can realize signal
self- detection and self-adjusting. [19] The adaptive
generator initialization within the production of
continuous or intermittent output with automatic
recognition and adjustment function of the generate
signal, through the design of reference models or self-
tuning controller module can be achieved on the output or
received signal real-time adjustment and dynamic match.
Adaptive encryption control principle is as Figure 2.First
initiative the clock module and the clock stimulus module
as a self-reference model, since the self-reference model
will reconstruct when the detective signal received from
the self-detection module does not match, and the re-
constructed reference model is not dependent on external
stimulation, which depends only on the initial
algorithm
Ts
. This means that as long as both encryption
and decryption have the same reference model, after the
same initialization, they can always get synchronous
control signal. For example, the use of the two CMOS
unit can keep output synchronism at
8
1 0 / k S i
.Take the
output signal from the self-reference model as the first
stage parameter of chaos cascade module, the output
signal of the first stage of chaotic module as the input
1640 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
signal of self-tuning module and at the same time, as the
input signal of the second stage
Logistic
generator; the
output signal of the second stage
Logistic
generator as the
input signal of the third stage or input signal of key
generator; the output signal of the third stage and key k
together as the initial key of key generator.The self-
detection module consists of two detectors, which are
responsible for adding state value
1 w
to the plaintext and
testing encrypted state value
1c w
and channel feedback
state value
1c w
. When the encrypted value and the
channel feedback state values are abnormal, feedback
should be done to the reference model, then the self-
reference model will update with a new reference model
according to the current state value of initial algorithm,
thus to update all the output signals of chaotic module,
key generators will also update new key stream without
changing of key k.The receiver uses the decoder to return
1c w
to the sender first to confirm the acceptance of the
cipher, at the same time offers
1c w
evaluation to the
sender for the channel safety index testing thus to
determine whether there is need to resend the cipher or
renew the reference model.
Figure 2. Adaptive grid authentication method
IV. HEURISTIC CODE GENERATOR AND HEURISTIC
BEHAVIOR TRUST QUERY FUNCTION
A. Structure of heuristic code generator
Supposed that the grid entity have some shape volume
and weight[20], the length of grid entity is L, the width W
and the height H, the direction of the layers vary from
each other due to different position of grid entity, and the
x, y, z mentioned above correspondingly represents
different value[21]. If the grid entity is located behind of
grid domain, then x equals to W, y equals to H, and z
equals to L, which means the direction of the layer is
along with the length of entity. If the grid entity is located
in the side of compartment, then x equals to L, y equals to
H, z equals to W, and the direction of the layer is along
with the width of compartment. If there are two entitiess
set respectively behind and side of the grid, then x equals
to W, y equals to L, z equals to H, and the direction of the
layer is along with the height of grid. Call the best layer
loading program and the plane optimal layout program to
solve these and the specific steps are as follows:
(1) enter the grid entity size L, W, H, the gridgo box
size s
1
, s
2
, and s
3
(make s
1
s
2
s
3
). Assign x, y, z with
L, W, H according to the position of the grid entity;
(2) call the best layer loading program to compute a, b,
c;
(3) Calculate the optimal layout of each layer:
If a 0, make
2 m
s s =
3 n
s s = , call the plane
optimal layout program genenator
2 3
( , ) s s ;
If b 0, make
1 m
s s =
3 n
s s = , call the plane
optimal layout program genenator
1 3
( , ) s s ;
If c 0, make
1 m
s s =
2 n
s s = , call the plane
optimal layout program genenator
1 2
( , ) s s ;
(4) Calculate the total number of the loading container
Sum= a*genenator
2 3
( , ) s s + b*genenator
1 3
( , ) s s +
c*genenator
1 2
( , ) s s ;
(5) Output Sum, a, b, c, and the corresponding values
of the parameters of the layout.
Given a grid grid entityset C, C={1,2,,n}, the set
C
0
={0} denotes the grid;
ij
d is the trusted value between
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1641
2011 ACADEMY PUBLISHER
arbitrary two nodes i, j in C and C
0
;
i
q i=1,,n is the
demand of the grid i; w is the maximum of the trusted
capability of, R is the number of the grid entity that needs
to finish the vertity, which is
1
/ ,
n
i
i
R q w
=
=
(1)
is the rounded up function, such
as 6.2 7 =
;
r
ij
x (r=1,, R, i and j=0,, n, and where i
not equals to j) is the decision variables,
r
ij
x =1 if and
only if the r routine pass the arc(i, j), otherwise
r
ij
x =0;
i
r
y ((r=1,, R, i =1,, n ) is the demand of the i grid
which meets by the r routine;
r
S denotes the grid set
served by the r routine,
r
S denotes the number of grid
included in S. There are some assumptions of the model:
(1) the trusted values between two nodes is symmetric,
ij
d =
ji
d ;
(2) the trusted values of the nodes satisfy the triangular
inequality, which is
ik kj ij
d d d + > ;
(3) all the grid entity start from the grid and back to
grid after each delivery;
(4) every grid s needs must be satisfied and can be
done by one or more grid entity.
The objective of this problem is to arrange the routine
to minimize the cost of delivery. The cost is represented
by the total travelling trusted value. As the description
above, the problem can be modeled as:
min
1 0 0
R n n
r
i j i j
r i j
d x
= = =
(2)
0 0
0, , ; 1, ,
n n
r r
ik kj
i j
x x k n r R
= =
= = =
(3)
1 0
1 0, ,
R n
r
ij
r i
x j n
= =
=
(4)
1
1, ,
R
ri i
r
y q i n
=
= =
(5)
{ } 1 1, , ; 0
r
ij
i s j s
x S r R S C
= =
(6)
1
1, ,
n
ri
i
y w r R
=
=
(7)
0
1, , ; 1, ,
n
r
i j i r i
j
x q y r R i n
=
= =
(8)
{0,1} , 1, , 1, ,
r
ij
x i j n r R = = ;
(9)
0 1, , ; 1, ,
i ri
q y i n r R = = (10)
The constraint (2) is to minimize the total travelling
trusted value; constraint (3) means the flow conservation,
that is, the number of grid entity is equal between
entering and exiting of a node; Constraint (4) and (5)
ensure that each node is visited at least one time and the
requirement is satisfied; (6) shows that the edges between
served grid s equals to the number of served grid s minus
1 in each route, (7) shows thetruseted capability of grid;
(8) shows that the grid is served only the grid pass.
Compute 1: input , x y ,
m
s ,
n
s
Compute 2: determine whether the grid A and B can be
trusted in its domain, return 0 if not.
Compute 3: for
(
1 1 1
/ 2 ; / ;
n n
y y s y y s y = + +
)
{ for (
1 1 1
/ 2 ; / ;
m m
x x s x x s x = + +
)
{initialize
5 5
0 x y = = and compute
2 4
, x y
using
1 0 0
R n n
r
i j i j
r i j
d x
= = =
and
0 0
0, , ; 1, ,
n n
r r
ik kj
i j
x x k n r R
= =
= = =
;
If (
1
/ y/
n m m
s y s s
)
{
2
y/
m
y s =
3 3
0 x y = = ; Compute
4
x using
1
1, ,
R
ri i
r
y q i n
=
= =
}
else
{
2 1
/
n m
y s y s =
; Compute
3
y using
{ } 1 1, , ; 0
r
ij
i s j s
x S r R S C
= =
;
If (
2 2
/ / 2
n m n m m
s x s s x s s >
) { compute
3 4
, x x using 0 1, , ; 1, ,
i ri
q y i n r R = = .}
Else { compute
3 4
, x x using
1
1, ,
n
ri
i
y w r R
=
=
and
1
1, ,
R
ri i
r
y q i n
=
= =
;
If(
3 4 m n n
x s x s x s > ){ compute
5 5
, x y using
0
1, , ; 1, ,
n
r
i j i r i
j
x q y r R i n
=
= =
and
{0,1} , 1, , 1, ,
r
ij
x i j n r R = = ;
}
}
}
Compute the total number of
generator=
1 1 2 2 3 3 4 4 5 5
x y x y x y x y x y + + + + .
Compare and record the generator number and the
agreement method.
}
};
Compute 4Output the optimized result.
We can also
have
5 3 4
( ) /
m n n
x x s x s x s =
5 2
( ) /
m m
y y s y s =
.
The objective function is
generator=
1 1 2 2 3 3 4 4 5 5
x y x y x y x y x y + + + + .
When
2 2
/ / 2
n m n m m
s x s s x s s >
,
1642 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
3 2
/ 1
n m
x s x s = +
4 3
( ) /
m n
x x s x s =
,
The objective function is
generator=
1 1 2 2 3 3 4 4
x y x y x y x y + + + .
So in such verify model, the parameters can be solved
as long as x
1
and y
1
are known. The range of
1 1
, x y are
1
0 /
m
x x s
and
1
0 /
n
y y s
, which becomes to
1
/2 /
m m
x s x x s
and
1
/2 /
n n
y s y y s
on
considering the symmetry of verify model 4. The
objective optimization value can be found after the
traversal of all the combinations of
1 1
, x y .
B. k-means heuristic behavior trust query function
In many practical applications, the k-means clustering
algorithm (k-means algorithm) which is based on
partition clustering has been proven to be effective and
generate good results[22][23]. The steps of a general k-
means algorithm are:
Select k couples of initial cluster center;
Assign sample x which need to be classified to some
cluster center one by one according to the minimum
trusted value principle;
Calculate new value of every cluster center. Generally
the new cluster center is the mean vector of the sample
contained in the cluster field. The mean vector of the
sample in k couples of cluster need to be calculated
respectively.
Reclassify the sample and repeat iteration. The
algorithm converges when every cluster center no longer
moves, then calculation finishes.
The principle of k-means algorithm is to find k couples
of partition with a least square error and make the
generated result as compact and separate as possible. The
k-mean algorithm is relatively scalable and efficient
dealing with large data sets and the complexity
is O( ) nkt , in which n means the number of objects, k is
the number of cluster, and t is the number of iterations.
The case mainly discussed in this paper is that the
demand of grid may be greater than the maximum trusted
capacity of grid entity. Hence, it is prior to meet each grid
wholly, and then merge the remaining part to other grid to
meet.
Next the principle discussed is used to cluster the grid
entities and determine the grid s served by the same grid.
However, the SDVRP is a constraint clustering problem,
the calculation may not converge, so the number of
iterations N needs to be set to terminate forcibly and set
the clustering evaluation criteria to select better clustering
results. The clustering evaluation function used in this
paper is:
1
( )
j
R
ij
j i C
Min sumD d
=
=
,
C
j
represents cluster j. The formula above calculates
the sum of trusted value between every grid entity and the
center in the cluster. Select the minimum sum as the best
clustering result. The concrete steps are below:
Step 1: Find the grid entity whose demand is greater
than or equal to the trusted capacity of grid. Split the
demand q
i
to two parts and
s c
i i
q q , and
/
/
s
i i
c
i i i
q w q w
q q w q w
=
=
means to round down, for example 6.6 6 =
.
The demand of
s
i
q is individually met and the remained
demand
c
i
q and the other entity are merged to some
other circuit to meet. Modify the demand of the grid i to
be
c
i
q ;
Step 2: Randomly select R couples of initial cluster
center
1 1
1 , , R from the grid set 1, 2, , C n = , and
mark as set
1 1 1
1 , , P R = . Initialize every cluster set
( 1, , )
i
C i R = = , and set the value of the
maximum number of iterations N;
Step 3: Cluster the grid s. Calculate the trusted value d
ij
between every grid entity and every cluster center, and
find the nearest cluster center of every grid entity. The
nearer the trusted value is, the higher priority the grid
entity has to join the center. If the cluster wanted to join
is full loaded, then choose the second nearest. When there
is still remaining demand in the cluster, and if the adding
of the demand make the total demand of cluster C
j
exceed
W (
j
QC W > ), compute the unmet demand of grid i,
which is denoted by S, and transmit the unmet demand to
other grid of C
j.
The transmission principle is: firstly find
the grid entity (include grid i) whose demand is not less
than S in cluster C
j
, then find the cluster whose residual
demand
1
( )
z z
SuQ W QC S z P j = . Compute
the trusted value between these grid entitys and these
clusters and choose the grid entity with smallest trusted
value to split. Guess the grid entity k and its
corresponding cluster center p, add k to cluster C
p
and the
unmet demand S is met by this route. If the residual
demand of all clusters
z
SuQ S < , then select the cluster
with largest residual demand to join until S is fully met.
Repeat this step until all the grid s demands are met.
Step 4: Calculate the sum of the trusted value between
every clustering grid entity and its cluster center SumD;
Step 5: Use the following way to adjust the cluster
center and get the new
2 2
1 , , R . The coordinate
position of the cluster center
2
( 1, , ) j j R = is
2 2
1 1
, ,
i i i i
i i
j j
x C y C
i i
x x y y
n n
= =
Where n
i
is the number of grid entity in C
i
;
Step 6: Repeat Step 3-5 until reach the maximum
iteration number N. Output the clustering results
corresponding to the minimum value of sumD;
Step 7: Optimize the result of step 6 by simulated
annealing algorithm. The cool way
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1643
2011 ACADEMY PUBLISHER
is ( 1) ( ) T t k T t + = . In the formula k is a positive
constant slightly less than 1.00 and t is the times of
cooling.
In step 1, the situation that the grid demand is greater
than the trusted capacity of grid is considered. In step 2 to
6, cluster the grid s need to, and find the optimal
clustering solution. In step 7, the route optimization is
done for solving TSP problem.
The clustering process is:
(1) Random determine R (obtained from formula (1))
couples of cluster center;
(2) Calculate the trusted value between every grid
entity and every cluster
center ( 1, , ; 1, , )
ij
d i n j R = = . Sort
( 1, , )
ij
d j R = from small to large and find the
smallest trusted value from every grid entity to the cluster
center.
(3) If the smallest trusted value d
kp
is found, then the
corresponding grid k is added to cluster p, and add the
grid corresponding to the second smallest value to the
corresponding cluster, compute the residual demand SuQ
(that is, the capacity of gridriage minus the amount of
grid mounted) of the cluster and turn down. When the
residual demand of cluster is less than the demand the
grid s want to add, the split entities are selected to split in
cluster. The principle of split entity selection will be
discussed later.
(4) When the total demand of the cluster that the grid s
want to join has reached the maximumtruseted capacity
of grid entity, the second nearest cluster will be
considered. Turn down until all grid s are added into a
cluster.
In order to ensure the load factor and the least
requirement of grid entity, the grids need is allowed to
split, so the principle of grid choice splitting should be
considered. If grid i is added into a cluster p which is not
fully loaded, which makes the total demand of the cluster
exceeds the maximum trusted capacity of grid entity, the
demand needs to be split to meet. If the second nearest
cluster center is far away from the grid, the traffic trusted
values increase greatly. The unmet demand will be
allowed to transmit to a entity whose demand is greater
than the unmet demand of grid i in cluster p and which is
relatively close to the other cluster whose residual
demand should be greater than the unmet demand of grid
i, to make the demand of this entity split meet. The
demand of grid i is totally met by cluster p. If the residual
demand of all clusters is lower than the unmet demand of
grid i, then choose the one with the maximum residual
demand to join to avoid being split too many times.
V. COMPARISON OF THE TEST RESULTS OF TLS AND SSL
AUTHENTICATION PROTOCOL AND THE NEW GRID
AUTHENTICATION METHOD
We set the clouds as a pool with hundreds of
computers and there are many grid entities that cannot be
trusted or should be limited for intact, then we set some
entity to send the request to other grids to compute or
calculate some information together, so every grid in the
clouds will go into the TLS\SSL model and our new
model using distributed parallel authentication model
based on trusted computing, then we reminder the
accuracy and lead time of all the model.
From the table1 and table2 we can see that the
accuracy rate of SSL&TLS authentication is lower than
distributed parallel authentication model, the lead time of
SSL&TLS authentication is longer than distributed
parallel authentication model. In table3 we will show the
detail comprehensive improvement for different clouds
and different internet environment.
TABLE I.
ACCURACY RATE OF SSL&TLS AUTHENTICATION AND THE LEAD TIME
Experiment
index
Accuracy
rate(km)
TLS
TIME(ms)
SSL TIME(ms)
1 63.1 163 537
2 69.2 175 805
3 67.9 170 966
4 66.4 166 881
5 65.0 169 946
Aver. 66.32 168.6 827
TABLE II.
THE ACCURACY RATE AND LEAD TIME FOR USING DISTRIBUTED
PARALLEL AUTHENTICATION MODEL
Experiment index Accuracy rate Computation time(s)
1 94.49 5.515
2 92.12 5.170
3 94.57 4.911
4 99.45 5.069
5 94.74 5.010
6 91.12 5.053
7 94.49 5.586
8 92.12 5.174
9 94.57 4.938
10 99.45 5.068
Aver. 92.712 5.1494
From the table3 we can see the comprehensive
evaluation of distributed parallel authentication model is
much better than SSL&TLS that the distributed parallel
authentication model use less computing operation and
computing times but with 30 higher correct accuracy
percent and 35.7 equal total percent.
1644 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
TABLE III.
ACCURACY RATE OF SSL&TLS AUTHENTICATION AND THE LEAD TIME
AUTHENTICATION MODEL AND SSL&TLS
Authentication
Comprehensive evaluation of our algorithm Comprehensive evaluation of SSL&TLS
Improve-ment (%)
Correct times Computation time(s) Correct times Computation time(s)
A01 4675585 4.6 5307907.00 17 31.9
A02 8158990 7.9 8542757.00 64 34.5
A03 8149102 9.5 8413577.00 60 33.1
A04 10696819 15.2 10708613.00 440 30.1
A05 13682582 22.5 13403505.00 1900 32.1
A10 13929231 22.3 13403505 40 33.9
A11 9972833 10.7 10569587.00 86 35.6
B01 4228238 5.6 4629056.00 27 38.7
B02 5966489 9.2 6239394.00 78 34.4
B03 7618932 13.6 7714649.00 122 31.2
B04 9915250 24.3 9471386.00 545 34.7
B05 12895562 36.1 11482700 1224 32.3
B10 12575805 35.8 11482700 516 39.5
B11 12074965 17.9 10552825.00 85 34.4
C01 6894370 7.0 7653121.00 56 3.9
C02 9809075 11.7 11340760.00 71 33.5
C03 14331431 17.6 15151732.00 206 35.4
C04 21142701 31.8 21018042.00 564 30.6
C05 27019924 47.4 25858494.00 3811 34.5
C10 27011949 46.2 25858494 259 34.5
C11 32827882 23.8 30604668 188 37.3
D01 9747314 9.5 10391059.00 34 36.2
D02 15100849 16.6 15566936.00 311 33.0
D03 19555495 25.2 20541296.00 412 34.8
D04 31193493 48.7 29916416.00 1822 34.3
D05 41245476 72.4 36242004 2598 33.8
D10 41002814 70.6 36242004 1037 33.1
D11 50395453 35.7 45026152 523 31.9
EQUAL 9238523 34..9 20958379 581 35.7
VI. CONCLUSION
From the above analysis, take trusted computing as the
basis, in a cloud computing, grid distributed parallel
authentication method which is realized by grid
authentication and grid behavior simultaneous
authentication, established on the upper layer of SSL and
TLS protocols, by adaptive stream cipher heuristic code
generator and heuristic behavior trust query function,
plays well in authentication. However, on the trust issue
of grid behavior, further standardization is needed on
entities quantitative trust level within a domain, while the
core of the heuristic algorithm needs to quantify the grid
entities with the shape, weight, size and other physical
indicators as a physical entity, this quantitative method
still needs to be further improved, so as to promote
adaptive stream cipher authentication framework and
improve the upper trusted computing platform.
REFERENCES
[1] http://tools.ietf.org/html/rfc5246
[2] The SSL Protocol: Version 3.0Netscape's final SSL 3.0
draft (November 18, 1996)
[3] "SSL/TLS in Detail". Microsoft TechNet. Updated July
31, 2003.
[4] Thomas Y. C. Woo, Raghuram Bindignavle, Shaowen Su
and Simon S. Lam, SNP: An interface for secure network
programming Proceedings USENIX Summer Technical
Conference, June 1994
[5] Dierks, T. and E. Rescorla. "The Transport Layer Security
(TLS) Protocol Version 1.1, RFC 4346".
http://tools.ietf.org/html/rfc5246#ref-TLS1.1.
[6] National Institute of Standards and Technology.
"Implementation Guidance for FIPS PUB 140-2 and the
Cryptographic Module Validation Program".
http://csrc.nist.gov/groups/STM/cmvp/documents/fips140
-2/FIPS1402IG.pdf.
[7] Eric Rescorla (2009-11-05). "Understanding the TLS
Renegotiation Attack". Educated Guesswork.
http://www.educatedguesswork.org/2009/11/understandin
g_the_TLS_renegoti.html.
[8] McMillan, Robert (2009-11-20). "Security Pro Says New
SSL Attack Can Hit Many Sites". PC World.
[9] "SSL_CTX_set_options SECURE_RENEGOTIATION".
OpenSSL Docs. 2010-02-25.
[10] Various (2002-08-10). "IE SSL Vulnerability". Educated
Guesswork.
[11] Sean Marstona; Zhi Lia; Subhajyoti Bandyopadhyaya;
Juheng Zhanga; Anand Ghalsasib. "Cloud computing
The business perspective". Decision Support Systems.
[12] M. Armbrust; A. Fox; R. Griffith; A.D. Joseph; R.H.
Katz; A. Konwinski; G. Lee; D.A. Patterson; A. Rabkin;
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1645
2011 ACADEMY PUBLISHER
I. Stoica and M. Zaharia. "Above the Clouds: A Berkeley
View of cloud computing". University of California at
Berkeley. 10 April 2011.
[13] "NIST.gov Computer Security Division Computer
Security Resource Center". Csrc.nist.gov.
[14] "Gartner Says Cloud Computing Will Be As Influential
As E-business". Gartner.com. 2010-08-22.
[15] a b "What is the Grid? A Three entity Checklist".
http://dlib.cs.odu.edu/WhatIsTheGrid.pdf.
[16] Diuf.unifr.ch. "Pervasive and Artificial Intelligence
Group: publications [Pervasive and Artificial Intelligence
Research Group]". May 18, 2009.
[17] Chris Mitchell, Trusted Computing, Institution of
Electrical Engineers, 2005.
[18] Ross Anderson, "Cryptography and Competition Policy -
Issues with Trusted Computing ", in Economics of
Information Security, from series Advances in
Information Security, Vol. 12, April 11, 2006.
[19] Liu Lizhao, A New Adaptive SSC and SSSC Stream
Cipher Model Design and Implementation [J]. Advanced
Materials Research Journal: 2011, 1(143), 298 -303
[20] Yang, H., J. Shi.: A Hybrid CD/VND Algorithm for three-
dimensional bin packing [C]. The 2nd International
Conference on Computer Modeling and Simulation. IEEE
Press, Sanya(2010)
[21] Almeida A. d., Figueiredo M.B.: A particular approach for
the three-dimensional packing problem with additional
constraints [J]. Computers & Operations Research.
37(11), 1968-1976(2010)
[22] C. Archetti, A. Hertz, M.G. Speranza. A Tabu search
algorithm for the split delivery vehicle routing
problem[J].Transportation Science, 40, 64-73(2006)
[23] C. Archetti, M.W.P. Savelsbergh, M.G. Speranza. An
optimization-based heuristic for the split delivery vehicle
routing problem [J]. Transportation Science, 42, 22-
31(2008)
Keshou Wu (1975.3-), Xiamen city, Fujian Province, China,
PhD of Huazhong university of science and technology,
majored in software engineering. Research field: System
Engineering, Information System, Data Mining, GIS.
Lizhao Liu (1983.3-), Xiamen city, Fujian province, China.
PhD candidate of Xiamen university, majored in automation,
system engineering, Information Science and Technology
Department. Research field: chaotic modeling and control of
unmanned airplane vehicle and information system, feature
attraction and detection, scale space and multiscale technology.
He has done the China national 985 engineering process of
unmanned airplane vehicle for the UAV\UAIS chaotic
phenomenon analysis, UAV\UAIS chaotic modeling and
control. He made the paper such as The Chaotic Characters and
New Control Strategy of Unmanned Airplane Information
System 2008 ISCID and Error! Reference source not
found.The Chaotic Disturbance of UAV System's
Communication And Coping Strategy 2008 ICCCAS. He also
has done the work of grid behavior trust model and has the
paper such as The Quantitative Assignment of The Grid
Behavior Trust Model Based on Trusted Computing 2010
Wuhan university journal.Now he is doing the work of scale
space and multiscale technology for the image analysis
especially for the feature describtion definition detection and
matching.
1646 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
Non-line-of-sight Error Mitigation in Wireless
Communication Systems
Chien-Sheng Chen
Tainan University of Technology / Department of Information Management, Tainan, Taiwan
Yi-J en Chiu
Taiwan Shoufu University / Department of Digital Entertainment and Game Design, Tainan, Taiwan
E-Mail: [email protected]
Ho-Nien Shou
Air Force Institute of Technology / Department of Aviation and Communication Electronics, Kaohsiung, Taiwan
Ching-Lung Chi
Shu-Te University / Department of Computer and Communication, Kaohsiung, Taiwan
AbstractThe need for determining the position of a mobile
station (MS) is increasing rapidly in wireless
communications systems. When there is non-line-of-sight
(NLOS) path between the MS and base stations (BSs), it is
possible to integrate many kinds of measurements to
achieve more accurate measurements of the MS location.
This paper proposed hybrid methods that utilize time of
arrival (TOA) at five BSs and angle of arrival (AOA)
information at the serving BS to determine the MS location
in NLOS environments. The methods mitigate the NLOS
effect simply by the weighted sum of the intersections
between five TOA circles and the AOA line without
requiring priori knowledge of NLOS error statistics.
Simulation results show that the proposed methods always
give superior performance than Taylor series algorithm
(TSA) and the hybrid lines of position algorithm (HLOP).
Index TermsTime of arrival (TOA), Angle of arrival
(AOA), Non-line-of-sight (NLOS)
I. INTRODUCTION
The problem of position determination of a mobile
user in a wireless network has been studied extensively
in recent year. It is always desirable to achieve the
highest possible accuracy in location applications.
However, the requirements in different applications may
differ due to various reasons such as the cost and the
technology. There are various techniques for wireless
location, which can be broadly classified into two
categories --handset-based techniques and network-based
techniques. From the technical aspect, the handset-based
techniques are easy to implement and accurate to
determine the mobile station (MS). Global positioning
system (GPS) requires installation of a receiver and
transmitting the received GPS data to the base station
(BS) for further processing and position determination.
The drawbacks of this technique include the high cost for
developing a suitable low-power and economical
integrated technology for use in the handsets. Moreover,
a GPS receiver needs to have at least four satellites
constantly visible. Therefore, the GPS-based solution is a
feasible option for outdoor positioning but not for indoor
positioning within urban environments. The existing
wireless communications infrastructure without
supplementary technology has been utilized in MS
location estimation. One of the goals of the location
solution is to allow carriers to locate current users by
existing network without expensive modifications and be
adaptable to complement satellite handset-based
techniques.
And the primary network-based techniques of wireless
communication systems include signal strength [1], angle
of arrival (AOA) [2], time of arrival (TOA) [3], and time
difference of arrival (TDOA) [4] techniques. Signal
strength is a location method that uses a known
mathematical model describing the relation between the
path loss attenuation and distance. If the angle in which
the signal MS arrives to the BS can be measured, an
AOA line can be drawn. By measuring AOA angles at
least two BSs, the intersection of two lines can be
obtained where the MS would be located. TOA location
scheme measures the propagation time it took for the
signal to travel between the MS and the BS. The TDOA
is to determine the relative position of the MS by
examining the difference in arrival-time measurements at
multiple BSs, rather than absolute arrival time.
The accuracy of mobile location estimation strongly
depends on the propagation conditions of the wireless
channels. The radio signals are usually corrupted by
additive noise, multipath propagation, and
non-line-of-sight (NLOS) propagation in wireless
location system [5]. To enhance the precision of the
location estimation, appropriate steps must be taken to
mitigate these impairments. The additive noise is
relatively easy to control comparing to other wireless
channel impairment. Usually, it is modeled as zero mean
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1647
2011 ACADEMY PUBLISHER
doi:10.4304/jnw.6.11.1647-1653
Gaussian noise with variance determined by the signal to
noise ratio (SNR), measurement resolution, and other
factors. In wireless locating systems, the transmitted
signals are frequently corrupted with multipath
propagations; the line-of-sight (LOS) is blockaged. Many
procedures are necessary to reduce the effects of
impairments. First of all, we define multipath as the
presence of multiple signal paths between the MS and
BS. In general, multipath propagation is already a
problem for most positioning techniques even if the LOS
path is existent. Severe multipath propagation can reduce
the positioning accuracy significantly. The reflections
and diffractions from buildings in urban areas or
mountains in rural terrains can cause significant path
blockages and multipath time dispersions [6]. In the case
of AOA schemes, multipath in the propagation channels
would significantly degrade the performance of the
estimation of direction-of-arrival [7]. In particular, the
multipath propagations may also cause serious problems
in the signal strength measurements [5]. Even when the
LOS propagation exists, the multipath propagations can
induce errors in the timing estimations of the time-based
location systems [5].
The NLOS condition is even more critical, because the
LOS path is blocked additionally. A common
requirement for high location accuracy is the presence of
a LOS path between the MS and each participating BS.
In practice, LOS paths are not always readily available.
The NLOS propagation occurs usually in urban or
suburban areas. Due to the reflection or diffraction of the
signals between the MS and the BSs, NLOS propagation
results in significant errors in the time and angle
measurements. When a priori knowledge of the NLOS
error is available, different NLOS identification and
correction algorithms for determining MS location are
proposed [8]. The standard deviation of the range
measurements for NLOS propagation is much higher
than LOS propagation [9].
To improve the accuracy of MS location, it is
reasonable to combine two or more schemes give
location estimation of the MS. Hybrid techniques can be
used to take the advantage of various positioning
schemes. A hybrid TDOA/AOA algorithm that can offer
more accurate location estimation for wideband CDMA
cellular systems was proposed in [10]. To achieve high
location accuracy, the scheme uses TDOA information
from all BSs and the AOA information at the serving BS
in small error conditions. A hybrid range/range
difference algorithm was used to estimate the MS
location in a GSM system when only two base
transceiver stations (BTSs) are available and the MS is
located at the mass center of the serving cell [11]. The
positioning performance improvement of TDOA
schemes using the AOA measurement from the serving
BS over pure TDOA schemes is evaluated in [12]. We
have proposed hybrid geometrical positioning schemes
to estimate MS location under the condition that the MS
can be heard by only two BSs in [13].
In this paper, we apply the hybrid geometrical
positioning schemes to locate MS when five BSs are
available for location purposes. We present a mobile
positioning system that adopts TOA-aided AOA
information at five BSs to estimate the location of an MS.
By acquiring the intersections of five TOA circles and
AOA line, it is possible to locate the desired MS in
wireless communication systems. The proposed
positioning methods are based on the weighted sum of
the intersections of five TOA circles and the AOA line.
Simulation results show that the proposed methods
always achieve better location accuracy than Taylor
series algorithm (TSA) [14] [15] and the hybrid lines of
position algorithm (HLOP) [16].
The remainder of this paper is organized as follows.
The system model is given in Section II. Section III
presents the commonly used positioning methods TSA
and HLOP. Section IV describes various approaches
using the intersections of the five TOA circles and the
AOA line to estimate the position of MS. Simulation
results are presented in Section V. Conclusion is given in
Section VI.
II. SYSTEM MODEL
TOA measurements from five BSs and the AOA
information at the serving BS can be employed to give a
location estimate of the MS, as shown in Fig. 1 [17]. Let
denote the propagation time from the MS to BS , i and
the coordinates for BS i re given by
(
,
i
t
a
)
i
Y ,
i
X
...5 2, , 1 = i
. The distances between BSi and the MS can
be expressed as
2 2
) ( ) (
i i i i
Y y X x t c r + = =
(1)
where is the MS location and c is the
propagation speed of the signals. We assume that BS1 is
the serving BS, and denote by
) , ( y x
as the angle between
MS and its serving BS.
) ( tan
1
1 1
X x
Y y
(2)
III. TAYLOR SERIES ALGORITHM (TSA) AND HYBRID
LINES OF POSITION ALGORITHM (HLOP)
To determine the MS location, TSA [14] [15] and
HLOP [16] are the most used schemes.
A. Taylor Series Algorithm (TSA)
TOA and AOA measurements are inputs to the Taylor
series position estimator. Let be the true position
and be the initially estimated position. Assume
that
) , ( y x
) , (
v v
y x
v
x x
x
+ =
,
y v
y y + =
. By linearizing the TOA
and AOA equations through the use of a Taylor series
expansion and retaining second-order terms, we have
z A (3)
1648 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
where , , ,
=
12 11
52 51
22 21
12 11
b b
a a
a a
a a
A M M
=
y
x
=
v
v
v
v
r r
r r
r r
z
5 5
2 2
1 1
M
and
,
,
1
v v
y x
i
i
x
r
a
=
,
,
2
v v
y x
i
i
y
r
a
=
2 2
) ( ) (
i v i v vi
Y y X x r + =
, , ...5 2, , 1 = i
,
,
11
v v
y x
x
b
=
v v
y x
y
b
,
12
=
,
) ( tan
1
1 1
X x
Y y
v
v
v
.
Then, the least-square (LS) estimation can be solved by
z A A A
T T 1
) (
=
(4)
The process starts with an initial guess for the MS
location and can achieve high accuracy. This method is
recursive but tends to be computationally intensive. TSA
may suffer from the convergence problem if the initial
guess is not accurate enough [14] [15].
B. Hybrid Lines of Position Algorithm (HLOP)
The method uses linear lines of position (LLOP) to
replace the circular LOP for estimating the MS location.
The detail algorithm of the linear LOP approach can be
acquired by using the TOA measurements as in [18], and
the hybrid inear LOP and AOA measurement (HLOP) in
[16]. The line which passes through the intersections of
the two circular LOPs can be found by squaring and
subtracting the distances obtained by Eq. (1) for 2 , 1 = i
and can be expressed as
). ( ) ( 2 ) ( 2
2
2
2
1
2
2
2
1
2
1
2
2 2 1 2 1
Y Y X X r r y Y Y x X X + + = + (5)
Given the linear LOPs and AOA line, the equations that
describe all the lines can be written in matrix form as
h Gl = (6)
where denotes the MS location,
=
y
x
l
X X
=
1 tan
5 1 5 1
Y Y X X
G
M M
2 1 2 1
Y Y
and
+ + +
+
=
) tan ( 2
) ( ) (
) ( )
2
1
1 1
2
5
2
5
2
1
2
1
2
1
2
5
2
2
2
2
Y X
Y X Y X r r
Y X
h
+ + (
2
1
2
1
2
1
2
2
Y X r r
.
According to the LS, the solution to Eq. (5) is given by
h G G G l
T T 1
) (
=
(7)
VI. PROPOSED HYBRID TOA/AOA GEOMETRICAL
SCHEMES
In the TOA schemes, it is necessary to measure the
propagation time it took for the signal traveling between
the MS and all BSs. This time is multiplied with the
speed of light to calculate the MS-BS distance. The
distance can be used to form a circle and the MS lie on a
circle centered at the BS. A single AOA measurement
constitutes the MS along a line. The equations of the five
TOA circles and the AOA line can be expressed as
Circle 1-5:
( )
2 2 2
) (
i i i
r Y y X x = +
,
...5 2, , 1 = i (8)
Line 1: 0 tan = y x
(9)
Under the assumption of LOS propagation and there
exists no measurement error, the circles intersect one
single common point. However, it is very often that the
LOS does not exist for propagation of signals between an
MS and some fixed BSs. Therefore, the NLOS effect
could cause five circles and a line to intersect at various
points, which will be offset from the true MS location.
With NLOS propagation, the measured TOA values are
always greater than the true TOA values due to the
excess path length. The true MS location should be
inside the region enclosed by the overlap of the five
circles. The intersections that are within this are defined
as feasible intersections. The feasible intersections must
satisfy the following inequalities simultaneously:
( )
2 2 2
) (
i i i
r Y y X x +
, . ...5 2, , 1 = i
(10)
Calculating the feasible intersections of five TOA circles
and the AOA line will give the proximate location of the
MS. In order to enhance the performance of MS location
estimation with less complexity, the hybrid geometrical
positioning methods which we have proposed in [13] are
applied in five BSs. In comparison, for the cases
presented in [13], we used two AOA measurements to
eliminate the least likely intersection. Note that the
region of overlap of five circles is usually smaller than
that of two circles, it is not necessary to eliminate the
least likely intersection.
(1) Averaging Method
The simplest and most direct method of estimating the
MS location is to calculate the average value of these
feasible intersections.
Step 1. Find all the feasible intersections of the five
circles and the line.
Step 2. The MS location
) ,
N N
y x (
is estimated by
averaging these feasible intersections, where
=
=
N
i
i N
x
N
x
1
1
and
=
=
N
i
i N
y
N
y
1
1
.
(11)
(2) Distance-Weighted Method
The weights can be dynamically adjusted with
reference to the distance square between the estimated
MS location and the average MS location. The detailed
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1649
2011 ACADEMY PUBLISHER
(5) Sort-Weighted Method steps are as follows:
Steps 1-2 are the same as those of the averaging method.
Steps 1-4 are the same as those of the sort averaging
method.
Step 3. Calculate the distance between each feasible
intersection and the average location
i
d
) , (
i i
y x
) , (
N N
y x
.
Step 5. The MS location is estimated by a weighted
average of the first M feasible intersections with
weight = .
1 2
) (
i
d
N i y y x x d
N i N i i
+ = 1 , ) ( ) (
2 2
(12)
Step 4. Set the weight for the feasible intersection
to . Then the MS location is
determined by
th i
1 2
) (
i
d ) , (
d d
y x
( )
( )
1
1
2
1
1
2
=
N
i
i
N
i
i i
d
d
x d
x
and
( )
( )
=
N
i
i
N
i
i i
d
d
y d
y
1
1
2
1
1
2
.
(13)
(3) Threshold Method
In this method, the decision of each weight is based on
how close the feasible intersections are. The closer the
feasible intersections, the more weight will be assigned.
The detailed steps are as fellows
Step 1. Find all the feasible intersections of the five
circles and the line.
Step 2. Calculate the distance , ,
between any pair of feasible intersections.
mn
d 1 , m n N
Step 3. Select a threshold value as the average of
all the distances .
thr
D
1
mn
d
Step 4. Set the initial weight , , to be zero
for all feasible intersections.
If , then
I
and
k
I
=
m
N k
1 +
m
I
thr mn
D d 1 + =
n n
I I
for . 1 , m n N
Step 5. The MS location is estimated by
) , (
t t
y x
=
=
=
N
i
i
N
i
i i
t
I
x I
x
1
1
and
=
=
=
N
i
i
i
i i
t
I
y I
y
1
1
N
.
(14)
(4) Sort Averaging Method
Since some of the feasible intersections are too far
away from the averaged MS location, these feasible
intersections may not provide improved MS location
accuracy. Therefore, we proposed sort averaging method
and sort-weighted method, which does not consider the
influence of those far from feasible intersections.
Steps 1-3 are the same as those of the distance-weighted
method.
Step 4. Rank the distances in increasing order and
re-label the feasible intersections in this order.
i
d
Step 5. The MS location ) , (
M M
y x is estimated by the
mean of the first M feasible intersections.
=
=
M
i
i
M x
M
x
1
1
,
=
=
M
i
i M
y
M
y
1
1
( ) N N M = * 75 . 0
(15)
( )
( )
=
M
i
i
M
i
i i
d
x d
x
1
1
2
1
1
2
,
( )
( )
=
M
i
i
M
i
i i
d
y d
y
1
1
2
1
1
2
( N N M = * 75 . 0 )
(16)
V. SIMULATION RESULTS
Computer simulations are performed to show the
proposed methods is appropriate for location estimation.
The distance between these BSs is 3464 m and the
MS locations are uniformly distributed in the center cell,
as shown in Fig. 1. 10,000 independent trials are
performed for each simulation. Three different NLOS
propagation models were used to model the measured
ranges and angle, the circular disk of scatterers model
(CDSM) [19] [20], the biased uniform random model [16]
and the uniformly distributed noise model [19].
= d
Figure 1. Five-cell systemlayout.
The CDSM assumes that there is a disk of scatterers
around the MS and that signals traveling between the MS
and the BSs undergo a single reflection at a scatterer. The
BS1 serving a particular MS is called the serving BS
which can provides more accurate measurements. The
radius of the scatterers for BS1 is 100 m and the other
BSs were taken from 100 m to 500 m. Figure 2 shows
how the average location error is affected by radius of
the CDSM. As the radius of the scatterers increases, the
NLOS error will increase and lead to less accurate MS
location estimation. By comparing the root mean square
(RMS) error of location estimation, the proposed
methods can predict the MS location accurately. When
the NLOS errors increase, both TSA and HLOP provide
relatively poor location estimation. The proposed
methods can give a more accurate MS location and thus
reduce the RMS errors.
1650 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
100 200 300 400 500
0
50
100
150
200
250
300
Radius of scatterers (m)
R
M
S
Averaging
Distance-weighted
Sort-averaging
Sort-weighted
Threshold
TSA
HLOP
Figure 2. Effect of radius of the CDSM on the average error
performance.
The improvement in MS location accuracy using the
proposed method can be obtained in the cumulative
distribution function (CDF) curves, as illustrated in Fig.
3. The radius of the scatterers for BS1 and the other BSs
were taken to be 100m and 300m, respectively. From the
simulation results, it is clear that TSA and HLOP predict
the MS location with poor accuracy and the proposed
methods always achieve the best performance.
0 50 100 150 200 250 300 350 400
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Locationerror (m)
C
D
F
Averaging
Distance-weighted
Sort-averaging
Sort-weighted
Threshold
TSA
HLOP
Figure 3. CDFs of the location error when CDSM is used to model
the NLOS error.
The second NLOS propagation model is based on a
biased uniform random variable [16], in which the
measured error of TOA between the MS and is
assumed to be
i BS
,
i i i i
q u p + =
where and are
constants and is a uniform random variable over [0,
1]. Similarly, the measured error of AOA, is modeled as
i
p
i
q
i
u
1 1 1 1
+ = u f
, where
1
and
1
are constants. The
error variables are chosen as follows: =
1
p
3
50m,
=100 m, 150 m, =300 m,
3 2
p p =
5 . 2
1
=
=
1
q
2
q q =
, and = 2
1
. Figure 4 shows CDFs of the
location error for different algorithms. It can be observed
that the proposed methods can promote the location
precision effectively.
0 50 100 150 200 250 300 350 400
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Locationerror (m)
C
D
F
Averaging
Distance-weighted
Sort-averaging
Sort-weighted
Threshold
TSA
HLOP
Figure 4. Comparison of error CDFs when NLOS errors are modeled
as biased uniformrandomvariables.
The final NLOS propagation model is based on the
uniformly distributed noise model [19], in which the
TOA measurement error is assumed to be uniformly
distributed over , where is the upper bound
and the AOA measurement error is assumed to be
) , 0 (
i
U
i
U
1 1 1
= w f
, where is a uniformly distributed variable
over [-1, 1] [21]. The variables are chosen as follows:
1
w
=
1
U 200 m,
=
i
U
500 m, for , and ...5 3, i 2, = = 5 . 2
1
.
Figure 5 shows CDFs of the average location error of
different algorithms when the range errors were using the
uniformly distributed noise model. It can be seen that the
proposed hybrid TOA/AOA methods provide much
better location estimation as compared with other
existing algorithms.
0 50 100 150 200 250 300 350 400 450 500
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Locationerror (m)
C
D
F
Averaging
Distance-weighted
Sort-averaging
Sort-weighted
Threshold
TSA
HLOP
Figure 5. Comparison of error CDFs when NLOS errors are modeled
as the upper bound.
Figure 6 provides the RMS error as the upper bound
on uniform NLOS error increases. The upper bound for
BS1 is 200 m and the other BSs are taken from 200 m to
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1651
2011 ACADEMY PUBLISHER
700 m. As expected, it is observed that the location error
increases with the upper bound of NLOS. The proposed
methods always give better accuracy than TSA and
HLOP for the error model considered. The performance
degradation of the proposed methods is not pronounced
under harsher NLOS error conditions.
200 300 400 500 600 700
0
50
100
150
200
250
300
Upper bound onNLOS range error (m)
R
M
S
Averaging
Distance-weighted
Sort-averaging
Sort-weighted
Threshold
TSA
HLOP
Figure 6. Performance comparison of the location estimation methods
when the upper bound is used to model the NLOS.
VI. CONCLUSIONS
Based on the NLOS situation and the knowledge of
NLOS error statistics is not obtained, we proposed the
hybrid methods that utilize all the possible intersections
of five TOA circles and the AOA line to provide the
improved MS location estimation. The proposed methods
mitigate the NLOS errors by the weighted sum of the
feasible intersections of five circles and a line.
Simulation results demonstrate that the proposed
methods with different chosen weights generate more
accurate MS location estimates than the conventional
TSA and HLOP.
REFERENCES
[1] W. G. Figel, N. H. Shepherd, and W. F. Trammell,
Vehicle location by a signal attenuation method, IEEE
Trans. Veh. Technol., pp. 105109, 1969.
[2] K. J . Krizman, T. E. Biedka, and T. S. Rappaport,
Wireless position location: fundamentals, implementation
strategies, and sources of error, in Proc. IEEE Vehicular
Technology Conf., vol. 2, pp. 919923, May 1997.
[3] S. Al-J azzar, J . Caffery, and H.-R. You, A scattering
model based approach to NLOS mitigation in TOA
location systems, in Proc. IEEE Vehicular Technology
Conf., vol. 2, pp. 861865, 2002.
[4] B. T. Fang, Simple solution for hyperbolic and related
position fixes, IEEE Trans. Aerosp. Electron. Syst., vol.
26, no. 5, pp. 748753, Sep. 1990.
[5] J . J .Caffery, and G. L. Stuber, Overview of radiolocation
in CDMA cellular systems, IEEE Commun. Mag., vol. 36,
no. 4, pp. 3845, Apr.1998.
[6] T. S. Rappaport, J . H. Reed, and B. D. Woerner, Position
location using wireless communications on highways of
the future, IEEE Commun. Mag., vol. 34, no. 10, pp.
3342, Oct. 1996.
[7] R. Muhamed, and T. S. Rappaport, Comparison of
conventional subspace-based DOA estimation algorithms
with those employing property-restoral techniques:
simulation and measurements, in Proc. IEEE Int.
Universal Personal Communications Conf., vol. 2, pp.
10041008, Oct. 1996.
[8] L. Cong and W. Zhuang, Nonline-of-sight error
mitigation in mobile location, IEEE Trans. Wireless
Commun., vol. 4, no. 2, pp. 560573, Mar. 2005.
[9] M. Silventoinen and T. Rantalainen, Mobile station
emergency locating in GSM, in IEEE Int. Conf. on
Personal Wireless Communications, pp. 232238, Feb.
1996.
[10] L. Cong, and W. Zhuang, Hybrid TDOA/AOA mobile
user location for wideband CDMA cellular systems,
IEEE Trans. Wireless Commun., vol. 1, no. 3, pp. 439447,
J ul. 2002.
[11] M. A. Spirito, Mobile station location with
heterogeneous data, in IEEE Vehicular Technology Conf.,
vol. 4, pp. 15831589, Sep. 2000.
[12] N. J . Thomas, D. G. M. Cruickshank, and D. I. Laurenson,
Performance of a TDOA-AOA hybrid mobile location
system, in Int. Conf. on 3G Mobile Communication
Technologies, pp. 216220, March 2001.
[13] C.-S. Chen, S.-L. Su, and Y.-F. Huang, Hybrid
TOA/AOA geometrical positioning schemes for mobile
location, IEICE Trans. Commun., vol. E92-B, no. 2, pp.
396402, Feb. 2009.
[14] W. H. Foy, Position-location solutions by Taylor series
estimation, IEEE Trans. Aerosp. Electron. Syst., vol.
AES-12, no. 2, pp. 187193, Mar 1976.
[15] D. J . Torrieri, Statistical theory of passive location
systems, IEEE Trans. Aerosp. Electron. Syst., vol.
AES-20, vol. 2, pp. 183-197, Mar. 1984.
[16] S. Venkatraman, and J . Caffery, Hybrid TOA/AOA
techniques for mobile location in non-line-of-sight
environments, in Proc. IEEE Wireless Communications
and Networking Conf., vol. 1, pp. 274278, Mar. 2004.
[17] M. McGuire, K. N. Plataniotis, and A. N.
Venetsanopoulos, Location of mobile terminals using
time measurements and survey points, IEEE Trans. Veh.
Technol., vol. 52, no. 4, pp. 9991011, J ul. 2003.
[18] J . J . Caffery, A new approach to the geometry of TOA
location, in Proc. IEEE Vehicular Technology Conf., vol.
4, pp. 19431949, 2000.
[19] S. Venkatraman, J . Caffery, J ., and H.-R. You, A novel
TOA location algorithm using LOS range estimation for
NLOS environments, IEEE Trans. Veh. Technol., vol. 53,
no. 5, pp. 15151524, Sep. 2004.
[20] P. van Rooyen, M. Lotter, and D. van Wyk, Space-time
processing for CDMA mobile communications. New York:
Kluwer, 2000.
[21] C.-L. Chen, and K.-T. Feng, An efficient
geometry-constrained location estimation algorithm for
NLOS environments, in Proc. Int. Conf. on Wireless
Networks, Communications and Mobile Computing, vol. 1,
pp. 244249, J un. 2005.
1652 JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011
2011 ACADEMY PUBLISHER
Chien-Sheng Chen received the BS
degree in Electrical Engineering from
Feng Chia University, in 1994 and
received the MS degree in Aeronautics
and Astronautics from Chung Hau
University, in 1996. He received the
Ph.D. degree in the institute of
computer and communication
engineering, National Cheng Kung
University in 2010. He is currently
with the Tainan University of Technology. His current research
interests include mobile communication networks and wireless
location systems.
Yi-Jen Chiu received his BSc and MSc
degrees in Electronic Engineering from
Feng Chia University, Taiwan, in 1992
and from Chung Yuan Christian
University, Taiwan, in 1998,
respectively. Since 2009, he has been
pursuing the PhD degree in Institute of
Computer and Communication
Engineering, Department of Electrical
Engineering, National Cheng Kung
University and has been doing research on ultra-wideband
radio technologies, wireless communication. He is working in
the Department of Digital Entertainment and Game Design,
Taiwan Shoufu University, Tainan, Taiwan.
Ho-Nien Shou received the B.S.
degree in electrical engineering from
National Taiwan Institute of
Technology, Taipai, Taiwan, R.O.C., in
1986 and the M.S. degree from the
Department of Aeronautics and
Astronautics and the Ph.D. degree in
electrical engineering, both from
National Cheng Kung University,
Tainan, Taiwan, R.O.C., in 1990 and
2002, respectively.
From 1990 to 1991, he was with the Institute of Science and
Technology (CSIST) as an assistant researcher, working with
flight digital control system. From 1999 to 2001, he was with
the National Space Organization (NSPO) as an assistant
researcher, working with satellite attitude control system. From
2001, he was with the Department of Aviation &
Communication Electronics Air Force Institute of Technology
Assistant Professor. His main research interests include
nonlinear system control, satellite attitude control system.
Ching-Lung Chi was born in Chayi,
Taiwan, R.O.C., in 1965. He received
the B.S. and M.S. degrees from
Chung-Cheng Institute of Technology,
Taiwan, in 1988 and 1996 and the
Ph.D. degree from National Cheng
Kung University, Tainan, Taiwan, in
2006, all in electrical engineering.
Since 2008, he has been with She-Te
University, Kaohsiung, Taiwan, where he is currently an
associate Professor of computer and communication. His
research interests are in the areas of wireless communications,
and channel coding techniques.
JOURNAL OF NETWORKS, VOL. 6, NO. 11, NOVEMBER 2011 1653
2011 ACADEMY PUBLISHER
Call for Papers and Special Issues
Aims and Scope.
Journal of Networks (JNW, ISSN 1796-2056) is a scholarly peer-reviewed international scientific journal published monthly, focusing on theories,
methods, and applications in networks. It provide a high profile, leading edge forum for academic researchers, industrial professionals, engineers,
consultants, managers, educators and policy makers working in the field to contribute and disseminate innovative new work on networks.
The Journal of Networks reflects the multidisciplinary nature of communications networks. It is committed to the timely publication of high-
quality papers that advance the state-of-the-art and practical applications of communication networks. Both theoretical research contributions
(presenting new techniques, concepts, or analyses) and applied contributions (reporting on experiences and experiments with actual systems) and
tutorial expositions of permanent reference value are published. The topics covered by this journal include, but not limited to, the following topics:
Network Technologies, Services and Applications, Network Operations and Management, Network Architecture and Design
Next Generation Networks, Next Generation Mobile Networks
Communication Protocols and Theory, Signal Processing for Communications, Formal Methods in Communication Protocols
Multimedia Communications, Communications QoS
Information, Communications and Network Security, Reliability and Performance Modeling
Network Access, Error Recovery, Routing, Congestion, and Flow Control
BAN, PAN, LAN, MAN, WAN, Internet, Network Interconnections, Broadband and Very High Rate Networks,
Wireless Communications & Networking, Bluetooth, IrDA, RFID, WLAN, WMAX, 3G, Wireless Ad Hoc and Sensor Networks
Data Networks and Telephone Networks, Optical Systems and Networks, Satellite and Space Communications
Special Issue Guidelines
Special issues feature specifically aimed and targeted topics of interest contributed by authors responding to a particular Call for Papers or by
invitation, edited by guest editor(s). We encourage you to submit proposals for creating special issues in areas that are of interest to the Journal.
Preference will be given to proposals that cover some unique aspect of the technology and ones that include subjects that are timely and useful to the
readers of the Journal. A Special Issue is typically made of 10 to 15 papers, with each paper 8 to 12 pages of length.
The following information should be included as part of the proposal:
Proposed title for the Special Issue
Description of the topic area to be focused upon and justification
Review process for the selection and rejection of papers.
Name, contact, position, affiliation, and biography of the Guest Editor(s)
List of potential reviewers
Potential authors to the issue
Tentative time-table for the call for papers and reviews
If a proposal is accepted, the guest editor will be responsible for:
Preparing the Call for Papers to be included on the Journals Web site.
Distribution of the Call for Papers broadly to various mailing lists and sites.
Getting submissions, arranging review process, making decisions, and carrying out all correspondence with the authors. Authors should be
informed the Instructions for Authors.
Providing us the completed and approved final versions of the papers formatted in the Journals style, together with all authors contact
information.
Writing a one- or two-page introductory editorial to be published in the Special Issue.
Special Issue for a Conference/Workshop
A special issue for a Conference/Workshop is usually released in association with the committee members of the Conference/Workshop like
general chairs and/or program chairs who are appointed as the Guest Editors of the Special Issue. Special Issue for a Conference/Workshop is
typically made of 10 to 15 papers, with each paper 8 to 12 pages of length.
Guest Editors are involved in the following steps in guest-editing a Special Issue based on a Conference/Workshop:
Selecting a Title for the Special Issue, e.g. Special Issue: Selected Best Papers of XYZ Conference.
Sending us a formal Letter of Intent for the Special Issue.
Creating a Call for Papers for the Special Issue, posting it on the conference web site, and publicizing it to the conference attendees.
Information about the Journal and Academy Publisher can be included in the Call for Papers.
Establishing criteria for paper selection/rejections. The papers can be nominated based on multiple criteria, e.g. rank in review process plus
the evaluation from the Session Chairs and the feedback from the Conference attendees.
Selecting and inviting submissions, arranging review process, making decisions, and carrying out all correspondence with the authors.
Authors should be informed the Author Instructions. Usually, the Proceedings manuscripts should be expanded and enhanced.
Providing us the completed and approved final versions of the papers formatted in the Journals style, together with all authors contact
information.
Writing a one- or two-page introductory editorial to be published in the Special Issue.
More information is available on the web site at http://www.academypublisher.com/jnw/.
(Contents Continued from Back Cover)
Expectation Value Calculation of Grid QoS Parameters Based on Algorithm Prim
Kaijian Liang, Linfeng Bai, and Xilong Qu
Web Page Classification using an Ensemble of Support Vector Machine Classifiers
Shaobo Zhong and Dongsheng Zou
Integration of Unascertained Method with Neural Networks and Its Application
Huawang Shi
Researches on Grid Security Authentication Algorithm in Cloud Computing
Keshou Wu, Lizhao Liu, Jian Liu, Weifeng Li, Gang Xie, Xiaona Tong, and Yun Lin
Non-line-of-sight Error Mitigation in Wireless Communication Systems
Chien-Sheng Chen, Yi-Jen Chiu, Ho-Nien Shou, and Ching-Lung Chi
1618
1625
1631
1639
1647