Académique Documents
Professionnel Documents
Culture Documents
Computer Networks
journal homepage: www.elsevier.com/locate/comnet
a r t i c l e
i n f o
Article history:
Received 8 March 2015
Revised 15 July 2015
Accepted 27 August 2015
Available online 4 September 2015
Keywords:
Cognitive radio
Clustering
Routing
Reinforcement learning
Cluster merging
Cluster splitting
a b s t r a c t
Cognitive radio (CR) is the next-generation wireless communication system that allows unlicensed users (or secondary users, SUs) to exploit the underutilized spectrum (or white spaces)
in licensed spectrum while minimizing interference to licensed users (or primary users, PUs).
This article proposes a SpectruM-Aware clusteR-based rouTing (SMART) scheme that enables
SUs to form clusters in a cognitive radio network (CRN) and enables each SU source node to
search for a route to its destination node on the clustered network. An intrinsic characteristic of CRNs is the dynamicity of operating environment in which network conditions (i.e., PUs
activities) change as time goes by. Based on the network conditions, SMART enables SUs to adjust the number of common channels in a cluster through cluster merging and splitting, and
searches for a route on the clustered network using an articial intelligence approach called
reinforcement learning. Simulation results show that SMART selects stable routes and significantly reduces interference to PUs, as well as routing overhead in terms of route discovery
frequency, without signicant degradation of throughput and end-to-end delay.
2015 Elsevier B.V. All rights reserved.
1. Introduction
Cognitive radio (CR) [1] is the next-generation wireless
communication system that enables unlicensed or secondary
users (SUs) to explore and use underutilized licensed spectrum (or white spaces) owned by licensed or primary users
(PUs) in order to improve the overall spectrum utilization
without causing unacceptable interference to PUs. During the
SUs communication, if a PU re-appears on a SUs channel, the
SU must vacate the channel and switch to another available
http://dx.doi.org/10.1016/j.comnet.2015.08.019
1389-1286/ 2015 Elsevier B.V. All rights reserved.
197
198
199
Fig. 2. A cluster structure after cluster C1 in Fig. 1 undergoes cluster splitting. Cluster C1 and the newly-formed cluster C4 either use existing route or search for
new routes to destination SU BS.
ratio (SINR) and expected transmission time (ETT)). This approach has been shown to achieve higher packet delivery
ratio and throughput, as well as lower delay and routing
overhead.
SMART formulates and solves the cluster-based routing scheme using RL, and it provides further enhancements
to [6,7] in two main aspects, particularly cluster size adjustment and cluster maintenance. Cluster size adjustment
changes the cluster size based on the requirements on the
number of common channels in a cluster; while cluster
maintenance is comprised of cluster merging and splitting.
3.2. Reinforcement learning in CRNs
Reinforcement learning has been widely used in CRNs for
different applications, such as the detection of spectral resources [24], solving the problems of interference to PUs [25],
auction of dynamic spectrum access [26], power control [27],
routing [16,17], etc.
Galindo-Serrano and Giupponi [25] solve the problem of
interference to PUs by SUs using a multiagent RL approach,
known as decentralized Q-learning. An IEEE 802.22 standardbased CRN is considered by representing SU base stations as
multiple agents. A scenario with partial information about
the operating environment is considered in which the proposed system learns near-optimal policy which has lower
convergence rate; while providing the implementation benets of deployment and feasibility. RL has been shown to improve the outage probability.
Teng et al. [26] apply RL in an auction-based dynamic
spectrum access scheme in a CRN with one PU and multiple
SUs. SUs use an RL approach called Q-learning to determine
their respective bidding prices intelligently and dynamically,
and compete for spectrum access. The bidding price for different channels are different based on the current and future
expected packet transmissions to secure the spectrum opportunities. The PU releases the unused channels to SUs with the
maximal bids. RL has been shown to improve bidding eciency and reduced packet loss.
Zhou et al. [27] design a distributed power control algorithm for CRNs using RL with low implementation complexity. The proposed algorithm does not require power and
interference information of SUs. SUs control their transmission power by observing and learning the interference level
based on the transmission rates and feedback signals of PUs,
200
4. System model
We consider a distributed ad hoc CRN consisting of static
SUs and PUs. A SU can be any device equipped with CR functionality so that it can switch to any of the available channels. Each SU must minimize interference to PUs, and so
it performs CR functions, including channel sensing, channel selection, channel sharing and channel hand-off in a
distributed manner. The CRN is comprised of SU n N =
{1, 2, 3, . . . , |N|}, PU j J = {1, 2, 3, . . . , |J|}, and channel
k K = {1, 2, 3, . . . , |K |} where |N|, |J| and |K| represent the
number of SUs, PUs and channels, respectively. Jk J represents a set of PUs j Jk that use channel k.
The rest of this section presents our CRN architecture (see
Fig. 3) comprised of internal and external environments. We
incorporate CR functions into QualNet which is lack of CR environment. The CRN architecture allows this research to focus on cluster-based routing, rather than the underlying CR
functions, such as channel sensing and channel sharing.
4.1. Internal environment
The components in the internal environment can be categorized into three main layers (i.e., network, data link and
physical layers) and a cross-layer repository. The network
layer includes channel decision and routing protocol. The
data link layer includes channel sensing and channel sharing.
The physical layer includes channel hand-off.
Network layer. The network layer contains a cluster-based
routing protocol called SMART, which is a joint channel decision and route selection protocol (see Section 6). Channel decision module receives information about white spaces from
201
Table 1
Exponential distribution parameter values for |K | = 10 licensed channels.
Rate
k
ON, j
k
OFF, j
Channel
1
10
1.25
0.67
0.4
2
1
1
0.4
0.33
0.5
1
2
0.29
1
0.25
0.18
2
0.5
1.33
0.67
0.5
Table 2
Types of PUs activities.
PU activity
ON Period
OFF Period
kON, j
kON, j
Channels in Table 1
Long-term
High
Low
Intermittent
Long
Long
Short
Short
Long
Short
Long
Short
1
1
>1
>1
1
>1
1
>1
3, 4, 5, 7, 10
2, 8, 9
1, 6
ponentially distributed with rates kON, j and kOFF, j , respectively [28,29]. We assume that all PUs activities in a channel
k and T k , and each PU j uses a single
k are represented by TON
OFF
channel k only [3032].
In [32], the values of rates kON, j and kOFF, j for |K | = 10 licensed channels have been measured by collecting samples
of channel state transitions, and they are shown in Table 1.
There are four kinds of PUs activities with different rate
values kON, j and kOFF, j [33], namely, long-term, high, low
and intermittent PUs activities. These PUs activities are presented in Table 2, as well as investigated in Section 8.3.1.
Channel quality. Channels in CRNs are heterogeneous
in nature depending on the amount of white spaces
kt =
kON, j
kOFF, j
k
k
t
+
e(ON, j +OFF, j )
kON, j + kOFF, j kON, j + kOFF, j
(1)
202
Description
CHinfo
CHinfo.CH
CHinfo.merge
CHinfo.split
CH
nodeID
nodeState
clusterID
listChannels
listNodes
commonChannels
clusterSize
masterChannel
backupChannel
Table 4
Notations of messages used in clustering mechanisms.
Category
Cluster joining
Cluster merging
Notation
Description
JREQ i, j
JACCi, j
JDECi, j
JRECi, j, k
GREQ i, j, k
GACCi, j, k
GDECi, j, k
MREQ i, j, k
MACCi, j, k
MDECi, j, k
MCANi, j, k
RELj, k
Timers
LREQ i, j, k
LACCi, j, k
LINFi, j, k
Tw,scan
Tw,res
Tw,CHE
and split (split = 1 indicates cluster splitting is initiated) in clustering message CHinfo, other clustering information is included so that the clusterhead can calculate clustering metric and make decision on cluster merging, cluster
splitting, the selection of a new clusterhead, and the relinquishment of an existing clusterhead. Note that, only clusterheads can set values for CH, merge and split, although
all nodes (including the clusterheads) include the rest of the
information in CHinfo. As an example, an existing clusterhead indicates the node ID of a new clusterhead (or the new
cluster ID) using CHinfo.CH. As another example, the clustering information CHinfo.merge = 1 indicates that the
clusterhead initiates cluster merging. The packet structure of
CHinfo is presented in Fig. 4.
Notations used in clustering with their descriptions are
summarized in Table 3. A summary of messages, including
notations and descriptions, used to coordinate clusters are
presented in Table 4. Table 5 describes the notations used in
clustering algorithms.
203
Table 5
Notations used in clustering algorithms.
Notation
Description
NNi
CHj
MNi, j
GNi, j, k
RNi, j
NCHinfo.nodeState=CH
MCCHi
NBi
NNlNBi
NNNlNB
i
NNCi ,k
Non-clustered node i
Clusterhead j
Member node i of clusterhead j
Gateway node i of clusterhead j that provides inter-cluster connectivity between its own clusterhead and neighboring clusterhead k
Relay node i that provides intra-cluster connectivity between member node j and its own clusterhead
Number of clustering message CHinfo received from clusterheads
Master channel of clusterhead i
SU neighbor nodes of SU i
Non-clustered SU neighbor nodes of SU i
Number of non-clustered SU neighbor nodes of SU i
Number of nodes that belongs to cluster Ci , having channel k in their list of available channels
Rank of clusterhead j
Rank of channel k
Number of common channels in a cluster Cj
Number of common channels among clusters Ci and Cj
Threshold for the minimum number of common channels in a cluster
Cluster merging threshold for the minimum number of common channels required for cluster merging
Channel capacity for channel k
Number of available channels of SU i
Number of hops between SU i and neighboring cluster l
t
CH,
j
t
chan,k
nc,C j
nc,Ci, j
HC,min
HC,merge
kt
NlistChannelsi
Hi,Cl
Fig. 4.
204
/* Part I: Scan each available channel in order to receive clustering message CHinfo */
while listChannelsi do
Scan each available channel k for Tw,scan duration;
if receive CHinfo then
Store CHinfo;
end if
end while
/* Part II: Process CHinfo received from clusterhead(s) */
if NCHInfo.nodestate=CH = 1 then
Send JREQi, j to CH j ;
else if NCHInfo.nodestate=CH > 1 then
for k in CHinfo.nodeID.masterChannel do
Calculate kt using Eq.(1)
end for
t
t
t
t
Update CH,
such that CH,
> CH,lNB
if MC
>
jNB
j
i
t ;
MC
16:
do
17:
18:
19:
20:
21:
22:
23:
24:
25:
t
t
Send JREQi, j to CH j |CH,
> CH,lNB
;
j
i
Wait Tw,res ;
if receive JACCi, j from CH j then
nodeStatei MNi, j ;
break;
end if
end while
/* Part III: Process CHinfo received from non-clustered
node(s) */
else if NNNlNB = 0 or NlistChannelsi > NlistChannels jNN
i
26:
27:
28:
29:
30:
31:
32:
33:
34:
35:
36:
37:
38:
39:
40:
41:
lNBi
then
nodeStatei CHi ;
for k in listChannelsi do
Calculate kt using Eq.(1)
end for
t
t
t
Update chan,k
such that chan,k
> chan,m
if kt >
t |k listChannels and m listChannels ;
m
i
i
t
masterChannel = k|chan,k
= 1;
t
backupChannel = k|chan,k
= 2;
Broadcast CHinfo;
else
Wait Tw,CHE ;
if receive CHinfo from CH j then
Send JREQi, j to CH j ;
else
Run Algorithm 1(a);
end if
end if
non-clustered state. It starts to form a cluster with nonclustered SU neighbor nodes NNlNBi .
There are two circumstances. First, there is lack of nonclustered SU neighbor nodes of SU i (i.e., NNNlNB = 0), and
i
t
channel with the second highest rank chan,k
= 2; and subsequently broadcasts this information using clustering message
CHinfo. However, if a SU i does not have the highest clustering metric among its non-clustered SU neighbor nodes, it sets
a timer Tw,CHE to allow non-clustered SU neighbor nodes with
the highest clustering metric among the respective neighborhood to become clusterhead and joins the cluster with the
highest rank. Note that, if a SU does not receive any clustering message CHinfo from any clusterhead upon the expiration of the timer Tw,CHE , it starts another round of process for
non-clustered node.
The packet exchange for node joining and clusterhead
election in cluster formation are presented in Fig. 6.
2:
3:
4:
5:
6:
7:
8:
9:
while listChannelsMNi, j do
Scan each channel k for Tw,scan duration;
if receive CHinfo from Cl then
Send GREQi, j,l to clusterhead CH j ;
end if
end while
if receive GACCi, j,l from clusterhead CH j then
nodeStateMNi, j GNi, j,l ;
end if
205
Fig. 6. Packet exchange for (a) node joining and (b) clusterhead election.
11:
12:
13:
14:
15:
16:
m, j,l
then
206
4: end if
5: /* Part II: Gateway node performs or cancels cluster merging */
6: if receive MACCi, j,l and MACCi,l, j from CH j and CHl , respec-
the neighboring cluster Cl as compared to an existing gateway node GNm, j, l (i.e., HMNi, j ,Cl < HGNm, j,l ,Cl ). If there are potential and existing gateway nodes with similar number of
hops leading to the clusterhead of the neighboring cluster,
the member node MNi, j with the highest number of available
channels NlistChannelsMN is selected. Otherwise, the clusteri, j
head CHj declines the request by sending gateway role decline message (GDECi, j, l ) to its member node MNi, j .
Next, we present an example of the rst case in which
adjacent clusterheads communicate with each other in two
hops using a single gateway node. Fig. 7 shows that a clusterhead CH1 in cluster C1 sends a data packet to a neighboring cluster C2 operating on a different master channel.
First, clusterhead CH1 forwards data packets to its gateway
node GN1,1, 2 using its master channel. Second, gateway node
GN1,1, 2 switches to the master channel of neighboring cluster C2 and forwards data packets to clusterhead CH2 . Third,
upon completion of data packets transmissions, gateway
node GN1,1, 2 switches back to the master channel of its own
cluster C1 . Suppose, clusterhead CH2 wants to send data packets to clusterhead CH1 . It cannot forward data packets to gateway node GN1,1, 2 unless it rst switches to the master channel of cluster C1 . However, clusterhead CH2 cannot switch
from the master channel of its own cluster. Hence, clusterhead CH2 selects gateway node GN1,2,1 , which can switch its
operating channel to the master channel of cluster C1 , in order to forward data packets to clusterhead CH1 .
Next, we present an example of the second case in which
adjacent clusterheads communicate with each other in more
than two hops using more than a single gateway node. A
set of gateway nodes connecting two clusterheads is called
joint gateway nodes [35]. Fig. 7 illustrates gateway nodes (i.e.,
GN1,1, 2 and GN1,2,1 ) in the rst case and a set of joint gateway
nodes (i.e., GN2,2,3 and GN1,3,2 ) in the second case. Using joint
gateway nodes, a two-way inter-cluster communication uses
a similar set of gateway nodes.
5.5. Cluster merging
Cluster merging is the process of combining two clusters
into one. In SMART, cluster merging is only possible when the
number of common channels nc,Ci, j between clusters i and j
satises a threshold for cluster merging HC,merge , specically
the minimum number of common channels in a merged cluster, for cluster stability (see C.2 in Section 1).
7:
8:
9:
10:
11:
12:
13:
14:
15:
16:
17:
18:
19:
20:
21:
tively then
Create Cm ;
clusterIDm i;
CHm GNi, j,l ;
nodeStatei CHi ;
for k in listChannelsi do
Calculate kt using Eq.(1)
end for
t
t
t
t
Update chan,k
such that chan,k
> chan,m
if kt > m
| k
listChannelsi , m listChannelsi ;
t
masterChannel = k | chan,k
= 1;
t
backupChannel = k | chan,k
= 2;
Broadcast CHinfo;
Send REL j,m and RELl,m to CH j and CHl , respectively;
else if receive MDECi, j,l from CH j or MDECi,l, j from CHl then
Send MCANi, j,l to CH j and CHl ;
end if
5.5.1. Procedure
Algorithms 3(a), 3(b) and 3(c) present the cluster merging
procedure for cluster Cj at gateway node GNi, j, l , clusterhead
CHj and member node MNi, j , respectively.
In part I of Algorithm 3(a), a gateway node GNi, j, l is aware
of a set of common channels nc,C j,l between its cluster Cj
and neighboring cluster Cl to which it is connected to. Thus,
whenever a gateway node GNi, j, l discovers a potential cluster merging activity (i.e., nc,C j,l HC,merge ), it informs both
clusterheads CHj and CHl by sending cluster merging request
13:
14:
15:
16:
17:
/* Part I: Member node receives new cluster joining recommendation from existing clusterhead*/
if receive JRECi, j,m from CH j then
Send JREQi,m to CHm ;
if not receive JACCi,m within Tw,res from CHm then
Send LREQi, j,m to CH j ;
else
nodeStatei MNi,m ;
ClusterIDi m;
CHi CHm ;
end if
end if
/* Part II: Relinquished clusterhead (i.e., currently member node MN j,m ) receives relay request from member node
MNi,m */
if receive LREQi, j,m from MNi,m then
nodeState j RN j,m ;
Send LINFi, j,m to CHm for MNi,m ;
Send LACCi, j,m to MNi,m ;
end if
message (MREQ i, j, l ). In part I of Algorithm 3(b), a clusterhead CHj may accept the cluster merging request by sending
cluster merging acceptance message (MACCi, j, l ) to GNi, j, l to
merge clusters. This occurs when CHj is not undergoing cluster merging with another cluster (i.e., MREQ, j, = ). Subsequently, in part II of Algorithm 3(a), a gateway node GNi, j, l
receives MACCi, j, l and MACCi, l, j from both clusterheads CHj
and CHl respectively, and becomes the clusterhead CHm of
the newly-formed cluster Cm . After becoming a clusterhead,
it selects master and backup channels, broadcasts CHinfo
and sends clusterhead role relinquishment messages RELj, m
and RELl, m to CHj and CHl respectively, so that they relinquish their clusterhead role and become its member nodes.
Note that, one of the clusterheads, say clusterhead CHj , may
not agree to merge and send cluster merging decline message (MDECi, j, l ) to GNi, j, l . In this case, as shown in part II
of Algorithm 3(b), the gateway node GNi, j, l informs both
clusterheads about the cancellation of the cluster merging
procedure by sending cluster merging cancellation message
(MCANi, j, l ). This allows both clusterheads to accept cluster
merging requests from other gateway nodes by ignoring the
current cluster merging procedure (i.e., MREQi, j,l ).
In part III of Algorithm 3(b), upon receiving clusterhead
relinquishment message RELj, m by existing clusterhead CHj
from new clusterhead CHm , the clusterhead CHj informs its
member nodes MN, j to join the new clusterhead CHm on the
new master channel by sending cluster joining recommendation message JRECi, j, m i MN, j , marks itself as a member node of the new clusterhead CHm (i.e., nodeStatej
MNj, m ), and sets its clusterhead being the new one (i.e., CHj
CHm ).
In part I of Algorithm 3(c), there are two circumstances
when a member node receives cluster joining recommendation message. First, a member node is located in the transmission range of the new clusterhead CHm , and so after receiving cluster joining recommendation message JRECi, j, m
207
t
t
node degree (i.e., chan,k
> chan,l
if NNC j ,k > NNC j ,l | k
listChannelsC j , l listChannelsC j k). Then, it selects a certain number of most favorable channels as common channels
of the new cluster Ck . The number of selected channels must
satisfy the threshold for the minimum number of common
208
t
t
Rank channels chan,k
> chan,l
if NNC j ,k > NNC j ,l | k
4:
listChannelsC j , l listChannelsC j k;
5:
/* Create a new cluster Ck */
6:
commonChannelsCk N t
= 1, 2, . . . , HC,min ;
chan,k
Identify
listNodesCk |commonChannelsCk
listChannelsMNi, j , i listNodesC j ;
8:
Create cluster Ck listNodesCk ;
9:
/* Part II: Create second cluster Cl */
10:
listNodesCl listNodesC j listNodesCk ;
7:
11:
clusterSizeC
commonChannelsCl i=1
| NcommonChannelsC = HC,min ;
listChannelsMNi,l
12:
| j listNodesCk MNi,k ;
i,k
15:
i,l
16:
17:
18:
19:
20:
21:
22:
23:
24:
> NlistChannelsMN
j listNodesCl MNi,k ;
Broadcast clustering message CHinfo;
if not (CH j = CHk and CH j = CHl ) then
if CH j nodeListCk then
nodeStateCH j MNi,k ;
else if CH j nodeListCl then
nodeStateCH j MNi,l ;
end if
end if
end if
j,k
j,l
l
listChannelsMNi,l ). The number of common chani=1
nels must equal to the threshold for the minimum number of
common channels in a cluster (i.e., NcommonChannelsC = HC,min ).
l
209
the bottleneck channel of a route. To the best of our knowledge, the application of RL to cluster-based routing in CRNs
is novel in its approach.
In SMART, the Q-routing model is embedded in each SU
because it stands an equal opportunity to serve as a clusterhead. The routing decision (i.e., SU next-hop neighbor node
selection) is made by the clusterhead of a SU source node and
the intermediate clusterheads based on channel selection
performed by clustering. The important representations in
the RL model for an agent are state, action and reward. State st
represents the decision-making factors, which affect the reward (or network performance), observed by an agent from
the operating environment. Action ati represents an agent is
action, which may change or affect the state (or operating
environment) and reward (or network performance), and so
the agent learns to take optimal actions at most of the times.
Reward rit+1 (st+1 ) represents the positive or negative effects
received at time t + 1 of an agents action on its operating
environment taken at time t. At decision epoch t, an agent i
observes its operating environment to determine its current
state st . Based on the state st , the agent chooses an action ati .
Next, at decision epoch t + 1, the state st changes to st+1 as a
consequence of the action ati , and the agent i receives reward
rit+1 (st+1 ).
In SMART, the state st represents the SU destination node
and the action ati represents SU is next-hop neighbor node
that relays packets toward SU destination node st . At time t,
a clusterhead i estimates the Q-value Qit (st , ati ) for each SU
neighbor node ati , which indicates the OFF-state probability of the bottleneck channel along a route and updates its
routing table of Q-values as shown in Table 6. The bottleneck
channel is the channel of a link having the least OFF-state
probability for the next time slot along a route. Using the
OFF-state probability of channels helps to reduce the effects
of the dynamicity of channel availability in CRNs by selecting
stable routes that have channels with higher OFF-state probability in order to maximize the utilization of white spaces
(see C.1 in Section 1). Each column in Table 6 shows a SU
neighbor node of the SU source node i, while each row shows
a destination node st . Each cell represents the Q-value of a
next-hop neighbor node ati selected by SU source node i to
reach the SU destination node st . A clusterhead calculates
Q-value for each SU neighbor node, while SU intermediate node calculates the OFF-state probability of the bottleneck channel from itself toward the destination node st and
210
SU destination node st
1
2
3
n
Qit (2, 1)
Qit (3, 1)
Qit (3, 1)
forwards this probability to an upstream node in RREP message. A clusterhead i makes route selection by selecting a
SU next-hop neighbor node ati with the maximum Q-value
Qit (st , ati ) from the routing table of Q-values for destination
node st . Upon sending RREQ to SU neighbor node ati , a SU i
receives the OFF-state probability of the bottleneck channel
for the route from SU neighbor node ati to destination node
st through RREP. A SU i updates its Q-value Qit (st , ati ) at time
t + 1 for destination node st via next-hop ati = j as follows:
min rit+1 ( j), maxt Q tj (st , k)
ka j
(2)
2
Qit (1, 2)
Qit (3, 2)
Qit (n, 2)
3
Qit (1, 3)
Qit (2, 3)
Qit (n, 3)
m
Qit (1, m)
Qit (2, m)
Qit (3, m)
Qit (n, m)
211
tion node BS, which is represented by . Subsequently, it embeds this Q-value in the RREP message and forwards it to
the upstream clusterhead CH2 via its gateway node GN1,4,2 .
When clusterhead CH2 receives the RREP message from clusterhead CH4 through its gateway node GN2,2,4 , it compares
the OFF-state probability provided by clusterhead CH4 with
OFF-state probability of the operating channel between the
pair of gateway nodes GN2,2,4 and GN1,4,2 for reaching clusterhead CH4 , and nds that this communication channel has
lower OFF-state probability (i.e., 4t = 0.3 < 5t = 0.4), theret
(BS,CH4 ) = 0.3), embeds
fore it updates the Q-value (i.e., QCH
2
it in the RREP message and forwards it to its upstream node
(i.e., SU source node CH1 ) via gateway node GN1,2,1 .
When SU source node CH1 receives the RREP message from clusterhead CH2 , it updates its routing table of
Q-values. Since this is the rst attempt of route discovt1
(BS,CH2 ) = 0.
ery, the Q-value has been initialized QCH
1
The OFF-state probability of the operating channel between SU source node CH1 and clusterhead CH2 is 2t =
t
(CH2 ) = 0.5. Hence, using Eq. (2) with
0.5, therefore rCH
1
212
213
Table 7
Summary of SMART overhead.
Overhead type
Message overhead
Time complexity
DNrCHinfo
[1, 2dch + 1]
2
2m + 11
6
nH + HG + I + nI
D
[Tscan + 1, Tscan + dch + 1]
2
7
3
nH + HG + I + nI
node, it generates RREP message and forwards it on the reverse route that the RREQ message has traversed. We denote
the number of intermediate clusters from a SU source node to
a SU destination node involved in a RREQ as I. So, the whole
process of RREP propagation takes M = I + nI messages and
T = I + nI time steps. Here, I represents the number of RREP
messages forwarded by gateway nodes to their neighboring clusters and nI represents the number of RREP messages
sent from clusterheads to their gateway nodes which are n
hops away. To sum up, the message overhead of routing is
M = nH + HG + I + nI messages and the time complexity is
T = nH + HG + I + nI time steps.
7.7. Total overhead
We summarize our analysis in Table 7. The total SMART
overhead is based on the six procedures (see Sections 7.1
7.6). The lower bound of the total SMART overhead for message overhead is M = DNrCHinfo + 2m + nH + HG + I + nI +
20 messages and time complexity is T = Tscan + D + nH +
HG + I + nI + 13 time steps. The upper bound of the total
SMART overhead for message overhead is M = DNrCHinfo +
2dch + 2m + nH + HG + I + nI + 20 messages and time complexity is T = Tscan + dch + D + nH + HG + I + nI + 13 time
steps.
214
Parameter
Value
SU
Number of SUs
Transmission range
15
250 m
PU
Number of PUs
Transmission range
10
250 m
Channels
Network
Area
Placement of nodes
Active connections
Trac data type
Packet size
Total number of packets
Simulation time for each run
Number of simulation runs
Threshold for the minimum number of common channels HC,min
Threshold for the cluster merging HC,merge
1000 m 1000 m
Random
1
UDP-CBR
512 bytes
500
550 s
100
2
4
to prevent network instability, specically the size of a cluster ip-ops causing high occurrence of cluster merging and
splitting. Whenever a master channel is re-occupied by PUs
activities, all member nodes and the clusterhead in a cluster switch to a backup channel. The SU learning rate is set
to 0.5 in order to achieve a balance between recent and estimated values. The simulation parameters are presented in
Table 8.
Each simulation run has a xed SU source and SU destination nodes. Since the focus of our work is on the network
layer, we assume a perfect channel sensing [40], scheduling
and synchronization capabilities among SUs, as well as noiseless channels. For control information exchange, we assume
there is an out-of-band common control channel which is
free from PUs activities at all times for this purpose. PUs
activities module (see Fig. 3) generates exponentially distributed ON and OFF periods for each channel [32,34].
neighboring clusters. In this manner, SMART solves the problem associated with broadcasting using a single transceiver
in CRNs (see C.3 in Section 1). Second, in SA-AODV, route decision is made by the destination node, while in SMART, route
decision (i.e., next-hop clusterhead selection) is made by the
clusterhead of a SU source node based on Q-values. Third, SAAODV is a single-path routing protocol while SMART is a multipath routing protocol. Fourth, SA-AODV is a non-clusterbased routing protocol; while SMART is a cluster-based routing protocol.
The performance metrics for SMART are as follows:
SUPU interference ratio is the ratio of the total number
of SUs packets interfered with PUs activities to the total
number of packets sent by a SU source node. Whenever
a SUs packet interferes with a PUs packet, both PUs and
SUs packets are lost, and so lower SUPU interference ratio is favorable.
Route discovery frequency is the number of route discovery
(or RREQ) messages generated by a SU source node, and
so lower route discovery frequency is favorable because
it indicates greater stability of routes and lower routing
overhead.
Throughput is dened as the total number of bits received
per second by a SU destination node. Higher throughput
is favorable.
End-to-end delay is calculated by dividing the total delay
of all packets received at a SU destination node by the total number of packets sent from a SU source node. Lower
end-to-end delay is favorable.
8.3. Performance evaluation
This section compares the performance of SMART with
SMART-NO-MNT and SA-AODV under varying effects of PUs
activities, number of channels and number of PUs.
8.3.1. Effects of PUs activities
SUs exploit white spaces of licensed channels which may
have different types of PUs activities. This section investigates the effects of different types of PUs activities (see
215
216
frequency. SA-AODV achieves similar route discovery frequency as SMART-NO-MNT except at intermittent PU
activity in which route discovery frequency by SA-AODV
is signicantly higher. There are two main reasons. First,
SA-AODV selects channels randomly, therefore, the selected
routes may not have channels with higher OFF-state probability, and so there is a higher chance that routes get affected
more frequently by PUs activities. Second, SA-AODV maintains only one route toward SU destination node. Since PUs
constantly appear and disappear on very short durations in
intermittent PU activity, the established routes in SA-AODV
get affected more frequently by PUs activities.
Fig. 13 shows throughput under different types of PUs
activities for the three schemes. When PU activity level is
zero (or no PUs activities), SMART and SA-AODV achieve
similar throughput. When PU activity level is low, SMART
achieves higher throughput as compared to SA-AODV. This
is because the effect of low PU activity level is not signi-
217
Table 9
Threshold values for varying number of
channels.
Threshold
HC,min
HC,merge
Number of channels
2
10
1
2
2
3
4
5
6
7
8
9
effects of varying number of channels on network performance of SMART, SMART-NO-MNT and SA-AODV. For a varying number of channels, each channel contains an equal
number of PUs, and there are two thresholds, namely the
minimum number of common channels in a cluster (HC,min ),
which is the threshold for cluster formation and splitting,
and the minimum number of common channels in a cluster
required to initiate cluster merging (HC,merge ). These thresholds values are presented in Table 9. Note that, for a certain number of channels, the value HC,min is one less than
HC,merge to avoid frequent cluster merging and splitting because if HC,min = HC,merge , cluster merging and splitting may
occur even when a PU re-appears in a single common channel in a cluster. Furthermore, the values of threshold HC,min
and HC,merge increase with the increasing number of channels in a network so that SUs can exploit the availability of
multiple channels in the network. This enhances cluster stability by maintaining higher number of common channels in
a cluster (see C.2 in Section 1).
Fig. 15 shows SUPU interference ratio by varying the
number of channels in the network for the three schemes.
When there are two channels in the network, SMART
achieves lower SUPU interference ratio as compared to
other number of channels because of lower number of transmission, specically, there is a higher chance that a SU does
not nd an available channel for transmission. When there
are 4, 6, 8 and 10 channels in the network, a SU has sucient number of channels for transmissions, and so SMART
causes slightly higher SUPU interference as compared to
the case of two channels in the network due to the higher
number of transmissions. However, SMART achieves lower
218
219
220
signicantly lower end-to-end delay for the increasing number of channels as compared to SMART and SMART-NO-MNT.
This is because SA-AODV is a non-clustered scheme; therefore, it does not incur delay caused by clustering mechanisms. SMART causes higher end-to-end delay as compared
to SMART-NO-MNT. This is because SMART-NO-MNT does
not perform cluster maintenance, therefore, it drops higher
number of packets due to the lack of either intra-cluster connectivity or a route toward SU destination node. On the other
hand, SMART performs cluster maintenance to reduce the
dynamic effects of network, therefore, it is able to nd a route
toward SU destination node most of the times and transmit
higher number of packets. Therefore, higher number of transmissions and cluster maintenance cause higher end-to-end
delay.
In summary, this section investigates the performance
of SMART for varying number of channels in the network.
SMART outperforms SMART-NO-MNT and SA-AODV for vary-
ing number of channels and achieves lower SUPU interference ratio of upto 81% as compared to SA-AODV and lower
route discovery frequency of upto 21% and 48% as compared
to SMART-NO-MNT and SA-AODV, respectively, without signicant degradation of throughput and end-to-end delay.
However, when there are two channels in the network, the
performance enhancement brought about by SMART is not
substantial due to lower number of channels and frequent
cluster maintenance. Therefore, cluster maintenance is not
encouraged for lower number of channels.
8.3.3. Effects of number of PUs
SUs explore and exploit the channels of PUs; and a number of PUs may operate on similar channels at different geographic locations. This section investigates the effects of
the number of PUs on the network performance of SMART,
SMART-NO-MNT and SA-AODV. We consider there are a total of ve channels in the network, and so PUs are equally
221
number of RREQs with the increasing number of PUs. SAAODV causes signicantly higher route discovery frequency
with the increasing number of PUs. There are two main reasons. First, SA-AODV does not select stable routes that have
channels with higher OFF-state probability, and so there is a
higher chance that routes get affected more frequently due
to PUs activities. Second, it maintains a single route toward a
SU destination node, and so it initiates RREQ message every
time a route gets affected due to PUs activities.
Fig. 21 shows throughput by varying the number of PUs
in the network for the three schemes. SMART and SA-AODV
achieve lower throughput with the increasing number of
PUs. This is because, with the increasing number of PUs, there
is a higher chance that SMART cannot nd a route due to
higher level of PUs activities, and so higher number of packets are dropped. The reason of lower throughput achieved
by SA-AODV with the increasing number of PUs is that, SAAODV causes higher SUPU interference with the increasing
number of PUs in the network, therefore, packet loss in SAAODV is caused by interference with PUs. SMART-NO-MNT
achieves nearly equal throughput for varying number of PUs.
This is because SMART-NO-MNT does not perform cluster
maintenance and a cluster is comprised of physically close
SUs which may observe similar PUs behavior. Therefore, the
increasing number of PUs does not have signicant effect on
intra-cluster connectivity due to the availability of backup
and multiple common channels in a cluster.
Fig. 22 shows end-to-end delay by varying the number of PUs in the network for the three schemes. SMART
achieves higher end-to-end delay as compared to SMARTNO-MNT and SA-AODV. This is because SMART is a clustered scheme, therefore it causes higher end-to-end delay
due to cluster maintenance as the need of cluster maintenance increases with higher number of PUs. SMART-NOMNT achieves slightly lower end-to-end delay as compared
to SMART. This is because SMART-NO-MNT does not perform cluster maintenance, and so it does not incur such delay. SA-AODV achieves lower end-to-end delay. However, the
effect of increasing number of PUs on end-to-end delay is
222
signicantly higher as compared to SMART and SMART-NOMNT, and so SA-AODV causes similar end-to-end delay as
SMART when the number of PUs is equal to 50. This is
because SA-AODV is a non-clustered scheme; therefore it
achieves lower end-to-end delay when there are lower number of PUs in the network. However, when there are higher
number of PUs in the network, there is a higher chance of
route breakage, and so SA-AODV initiates higher number of
RREQs which causes signicantly higher end-to-end delay.
In summary, this section investigates the performance of
SMART for varying number of PUs in the network in comparison with SMART-NO-MNT and SA-AODV. SMART achieves
signicantly lower SUPU interference ratio of upto 83% as
compared to SA-AODV and lower route discovery frequency
of upto 49% and 57% as compared to SMART-NO-MNT and SAAODV, respectively. However, SMART causes slightly lower
throughput and higher end-to-end delay. SMART (with cluster maintenance) is preferred when the number of PUs is
low; while SMART-NO-MNT (without cluster maintenance)
is preferred when the number of PUs is high, which is a scenario not preferable for CR operation.
9. Conclusion and future work
This paper proposes a SpectruM-Aware clusteR-based
rouTing scheme called SMART for cognitive radio networks
(CRNs). SMART enables secondary users (SUs) to form clusters in a CRN and enables each SU source node to search
for a route to its destination node in a clustered network.
SMART applies an articial intelligence approach called reinforcement learning (RL) to help SUs to make right decisions on routing, particularly the selection of next-hop node
based on the OFF-state probability of channel availability
along a route by observing and learning the operating environment. SMART adapts cluster size by performing cluster maintenance, comprised of cluster merging and cluster
splitting, to reduce the dynamic effects of the network (i.e.,
PUs activities). The performance of SMART is compared with
clustered scheme without cluster maintenance (SMART-NOMNT) and non-clustered scheme (spectrum-aware AODV
or SA-AODV). Simulation results show that SMART selects
routes with higher stability, as well as reduces SUs interference to primary users (PUs) and routing overhead in terms of
route discovery frequency. SMART is able to achieve upto 76%
lower SUPU interference ratio and 49% lower route discover
frequency. These are achieved due to the selection of stable routes with channels having higher OFF-state probability, as well as cluster maintenance mechanisms which overcome the dynamic effects caused by PUs activities. These are
achieved without signicant degradation of throughput and
end-to-end delay performances.
In future, we aim to investigate important areas of SMART.
First, cluster merging and cluster splitting can be made
trac-aware. Hence, cluster merging occurs if a cluster has
enough capacity to handle tracs from its own and neighboring clusters, and cluster splitting occurs if a cluster does
not have enough capacity to handle tracs from neighboring clusters. Since cluster merging and splitting are only performed when required, it is expected to improve the energy
eciency of SUs. Second, best-t channel selection (BFC)
scheme [29] can be incorporated in order to select operating channel which is expected to remain idle for the duration of a trac, rather than the longest possible duration. Third, channel capacity metric can be improved by
considering exponential weighted moving average (EWMA)
[41], which considers the most recent values in its estimation. Fourth, SMART is designed for single-radio interface, so it can be extended to multi-radio interfaces. Fifth,
SMART can be extended to fulll requirement on the minimum number of nodes in a cluster in order to enhance
network scalability. Sixth, mobility imposes another challenge of dynamicity which affects routing when any node involved in a route moves outside the transmission range of its
previous- or next-hop node causing a link to break. Therefore, it is important to investigate the effects of node mobility. Finally, stay-and-wait policy for channel selection can be
223
[23] A. Bourdena, C.X. Mavromoustakis, G. Kormentzas, E. Pallis, G. Mastorakis, M.B. Yassein, A resource intensive trac-aware scheme using
energy-aware routing in cognitive radio networks, Future Gener. Comput. Syst. 39 (2014) 1628.
[24] U. Berthold, F. Fu, M. Van der Schaar, F.K. Jondral, Detection of spectral
resources in cognitive radios using reinforcement learning, in: Proceedings of IEEE DySPAN, 2008, pp. 15.
[25] A. Galindo-Serrano, L. Giupponi, Distributed q-learning for aggregated
interference control in cognitive radio networks, IEEE Trans. Vehicular
Technol. 59 (4) (2010) 18231834.
[26] Y. Teng, Y. Zhang, F. Niu, C. Dai, M. Song, Reinforcement learning based
auction algorithm for dynamic spectrum access in cognitive radio networks, in: Proceedings of IEEE VTC, 2010, pp. 15.
[27] P. Zhou, Y. Chang, J. Copeland, et al., Reinforcement learning for repeated power control game in cognitive radio networks, IEEE J. Selected
Areas Commun. 30 (1) (2012) 5469.
[28] A.S. Cacciapuoti, M. Cale, L. Paura, Reactive routing for mobile cognitive radio ad hoc networks, Ad Hoc Netw. 10 (5) (2012) 803
815.
[29] S. Bayhan, F. Alagz, A Markovian approach for best-t channel selection in cognitive radio networks, Ad Hoc Netw. 12 (2014) 165177.
[30] W. Kim, M. Gerla, S.Y. Oh, K. Lee, A. Kassler, CoRoute: a new cognitive
anypath vehicular routing protocol, Wireless Commun. Mob. Comput.
11 (12) (2011) 15881602.
[31] A.W. Min, K.G. Shin, Exploiting multi-channel diversity in spectrumagile networks, in: Proceedings of IEEE INFOCOM, 2008.
[32] M.H. Rehmani, A.C. Viana, H. Khalife, S. Fdida, Surf: a distributed channel selection strategy for data dissemination in multi-hop cognitive radio networks, Comput. Commun. 36 (10) (2013) 11721185.
[33] S. Bayhan, F. Alagz, Distributed channel selection in CRAHNs: a nonselsh scheme for mitigating spectrum fragmentation, Ad Hoc Netw.
10 (5) (2012) 774788.
[34] H. Kim, K.G. Shin, Ecient discovery of spectrum opportunities with
MAC-layer sensing in cognitive radio networks, IEEE Trans. Mob. Comput. 7 (5) (2008) 533545.
[35] T. Chen, H. Zhang, G.M. Maggio, I. Chlamtac, CogMesh: a clusterbased cognitive radio network, in: Proceedings of IEEE DySPAN, 2007,
pp. 168178.
[36] J.A. Boyan, M.L. Littman, Packet routing in dynamically changing networks: a reinforcement learning approach, in: Advances in Neural Information Processing Systems, 1994, p. 671.
[37] C. Bettstetter, S. Konig, On the message and time complexity of a
distributed mobility-adaptive clustering algorithm in wireless ad hoc
networks, in: Proceedings of European Wireless Conference, 2002,
pp. 128134.
[38] I.I. Er, W.K. Seah, Clustering overhead and convergence time analysis of
the mobility-based multi-hop clustering algorithm for mobile ad hoc
networks, J. Comput. Syst. Sci. 72 (7) (2006) 11441155.
[39] Qualnet Communications Simulation Platform. URL http://web.
scalable-networks.com/content/qualnet, 2015 (accessed 07.15).
[40] K.R. Chowdhury, I.F. Akyildiz, CRP: a routing protocol for cognitive radio
ad hoc networks, IEEE J. Selected Areas Commun. 29 (4) (2011) 794
804.
[41] M. Hoyhtya, S. Pollin, A. Mammela, Classication-based predictive
channel selection for cognitive radios, in: Proceedings of IEEE ICC, 2010,
pp. 16.
Yasir Saleem received the B.S. degree in Information Technology from National University of
Sciences and Technology (NUST), Islamabad, Pakistan in 2012. Currently, he is an M.Sc. candidate
in Computer Science by research at Sunway University, Malaysia under the joint programme of
Sunway University, Malaysia and Lancaster University, UK. He is also a reviewer of Pervasive and
Mobile Computing, Articial Intelligence Review,
International Journal of Communication Systems,
Computers & Electrical Engineering, IEEE Globecom 2014, IEEE ICC 2013. His research interests
lie in the domain of wireless networking, specifically, cognitive radio networks, wireless sensor networks and unmanned
aerial vehicles.
224
Nordin Ramli received the B.Eng. degree in Electrical Engineering from Keio University, Japan in
1999. He received the M.Eng. and Ph.D. degrees,
both in Electronic Engineering from the University of Electro-Communications, Japan in 2005
and 2008, respectively. Currently, he is a staff researcher at Wireless Network & Protocol Research
(WNPR), MIMOS Berhad, Malaysia. His current research and development (R&D) interests are in
the area of cognitive radio, TV white space, spacetime processing, equalization, adaptive array system and wireless mesh networking. Based on his
work, he has published over 60 journal and conference papers, and led more than 20 patents.