1. Introduction
The rapid growth and intensive development in the areas of wireless communication and computation science, including wireless sensor networks (WSNs) and other related technologies, is increasingly being used to satisfy evolving user requirements [
1,
2,
3]. WSNs have increased flexibility in terms of maintenance and deployment when compared to conventional sensor networks. Due to the high demand and efficient scalability of WSNs, it has invaded numerous sectors. It has a prominent place in every corner of society, particularly in applications such as smart cities, industry 4.0, precise agriculture, and farming management [
4,
5,
6]. WSNs have the attributes of significance and superiority and have been implemented in several domains due to increased flexibility and low cost. WSNs also play a pivotal role in environmental monitoring by gathering critical environmental parameters such as temperature, noise, fire detection, pollution, among many others. [
7,
8,
9]. WSNs have seen substantial advancement in recent decades, particularly concerning data processing, communication quality improvements, energy saving, and data storage capacities. It has prompted the development in advanced technology domains of Cloud Computing, Big Data, and the Internet of Things.
In the standard architecture of WSN, its physical arrangement involves a large number of sensor nodes (SNs), each having a radio frequency (RF) transceiver system, intelligent microprocessor, storage, and battery. However, many challenges encountered by WSNs have been investigated and well researched, such as the limited storage capacity, energy constraints, and extensive deployment range required [
10,
11]. Besides the challenges mentioned above, particularly for applications such as monitoring and data gathering, two additional aspects need to be explored. First, a WSN comprises of static (fixed) placement of SNs. Although this static topology brings advantages of energy and cost efficiency, the overall system still lacks agility and mobility.
Furthermore, the static deployment of SNs restricts scalability and applicability. For environmental surveillance in large regions, there are inconsistencies between the increasing range of surveillance areas and the limitation of the surveillance scope of traditional WSNs. In a WSN monitoring environment, different obstacles may hinder the path of the wireless signals, such as huge tall buildings, walls, trees, human presence, and machines. However, these obstacles may significantly influence the quality of communication and wireless signal strength during signal propagation and cause deep fading of wireless signals, attenuation, and strong reflections from the objects.
Secondly, the clustering of WSNs is a critical aspect in many applications. Efficient clustering mechanisms can help achieve a longer life with energy conservation. In the clustering of nodes, the selection of cluster heads (CHs) and optimization of cluster structure are vital factors to be considered. The K-means clustering algorithm is widely used for cluster formation in different applications, including WSNs. However, K-means algorithms have certain drawbacks; such as the initial centroids are chosen randomly, leading to local optima, as seen in
Figure 1. The figure shows there are four clusters in this simulation setup. Different colors represent the different clusters and their respective cluster heads. The cluster heads are denoted by a square mark, and separate shapes in each cluster represent the sensor nodes. There can be conditions where empty clusters or clusters having relatively low sensors are included. In cluster 4, there are only three sensor nodes. K-means algorithm does not guarantee its convergence into the best results. Even the optimal cluster’s density also cannot be decided and is given as an input by the user.
To solve these two important research problems, researchers have proposed several solutions and methodologies. Many robots or mobile land vehicles described in literature reports are used in WSNs where the mobile robot/vehicle can act as a sink node, a relay node, and a base station. Unmanned air vehicles (UAVs) are the best among all mobile platforms and robots [
12,
13], widely employed in applications such as aerial photography, agriculture, and environmental monitoring. Safety, ease of operation, adaptability, and a broad monitoring range are UAVs’ significant characteristics. In the literature, it can be seen that UAVs are used to improve the quality of service (QoS) as well as expanding the overall monitoring area, including the collection of data from SNs and transmitting it forward to the base station. However, data collection, unbalanced cluster formations, and the flight path to visit each cluster inside the WSN still need investigation and performance consideration. Therefore, UAV-based WSNs need critical and efficient solutions. The clustering problem also needs an efficient algorithm that produces balanced clusters compared to K-means and includes an optimization technique for CH selection, keeping residual energy in consideration along with Euclidean distance. Therefore, this article provides an efficient and effective architectural layout of WSN incorporating intelligent UAV-based surveillance systems. In a UAV-based data collection system, we use UAV to help form balanced WSN clusters. This cluster formation helps in the energy conservation of WSNs, leading to a longer lifetime of nodes. UAV helps in the data exchange from the WSNs to the BS, reducing the communication range.
The main contributions can be summarized as follows:
- ⮚
An energy efficient clustering protocol is presented to solve the issue of unbalanced cluster structure and optimizes the CHs selection process. A uniformly distributed cluster is obtained with almost equal number of SNs; the initial CHs are not chosen randomly in this algorithm; rather, midpoint strategy is used to address this problem. This technique also considers its communication with the UAV while selecting the CHs.
- ⮚
Considering the land WSN network, a UAV flight path is determined, which can collect data from every cluster of WSN optimally. The cluster head is placed at the center of the cluster and collects data, which are then passed to the UAV.
- ⮚
Our extensive simulations validate our proposed algorithm’s performance and show the performance in terms of lifetime, cluster design, and energy consumption.
The remainder of the paper is organized as follows.
Section 2 covers the related work from the literature;
Section 3 explains the system model;
Section 4 describes the proposed methodology; and, finally, the simulation results are presented in
Section 5.
2. Related Work
Researchers have explored the area of WSN for many decades under various limitations and constraints. Based on different stages and applications, the optimization schemes and utilized objectives were also different. The models for WSN are hierarchical and flat topology [
14]. In the early phase of WSN practice, the majority of applications used the flat model. In this model, all network nodes share the same status, hardware specifications, and functions. Numerous algorithms and communication protocols have already been proposed for this topology. Furthermore, this model has adverse effects on the network management system. Moreover, SNs close to the base station (BS) might demand more energy to communicate with other network nodes via multihop, causing early discharge of the battery, leading to a dead node. As a result, the whole system network lifespan is reduced. Contrary to the flat model, hierarchical network design is based on a group of nodes functioning as sink nodes, group leader, and other ordinary nodes. Every node performs its duty, such as data collection and data transmission [
15]. The authors in [
16,
17,
18] evaluate the computation energy efficiency maximization schemes for the enhancement of WSNs.
As the WSNs research area was explored extensively, the heterogeneous sensor network evolved from the hierarchical topology of WSNs. In [
19], the authors proposed a clustering scheme to optimize the heterogeneous network using a genetic algorithm. Several recent approaches were evaluated, with their outcomes indicating that this method outperforms and extends the network’s lifetime. The authors in [
20] presented the idea to reduce communication overhead by using the energy-aware clustering hierarchy protocol. For effective data collection and routing in WSNs, a multilevel hierarchical architecture was adopted. The proposed scheme simulation outcomes showed that it consumes the least amount of energy.
The authors in [
21] introduced a new methodology of reclustering that improves overall system efficiency by appropriate task management of SNs. In another work, [
22] proposed the constrained coverage (CC) technique, which considered K-neighbors for each cluster by using two virtual forces, but this method may cause the decrease of SN lifetime and low coverage area of the network. Furthermore, researchers in [
23] developed virtual force-based clustering, but this technique may cause an unstable lifetime of WSN.
Low-energy adaptive clustering hierarchy (LEACH) is a primary classical protocol, giving the idea of clustering in a WSN and introduces hierarchical transmitting of data [
24]. The clustering technique transforms the WSN into groups or a hierarchy of clusters that gather the data from their surroundings and send it to its respective cluster head (CH). The optimal selection of CHs in a WSN cluster can maximize the communication range and prolong the network’s lifetime. In every round, the method randomly chooses CHs stochastically. Then, the nominated CH communicates with every non-CH node in the cluster to collect the sensed data. Election of the best CH is a critical task as variety of conditions are required to be fulfilled for selecting the optimal node in the whole cluster [
25]. These conditions include factors such as residual energy, range, throughput, and mobility of each SN.
The LEACH algorithm extends the network lifespan compared to multihop and direct transmission but still has many drawbacks and limitations. The CH is selected on a random basis, which does not ensure an optimal solution and leads to improper distribution of SNs in each cluster, making it unbalanced. The nodes having lower residual energy levels are assigned the same priority as those with higher residual energy levels for CH nomination. Thus, when an SN of lower energy is nominated for CH responsibilities, its energy level will drop out in a shorter period, reducing network lifetime [
26]. In [
27], enhanced research work was introduced that exploits the LEACH algorithm to increase the energy efficiency of WSN. The authors in [
28] proposed an optimized zone-based energy efficient protocol (OZEEP) for optimum CHs selection and improved the clustering by incorporating genetic fuzzy systems (GFS). One of the critical issues in clustering is optimizing the CHs selection and improving the cluster structure. The K-means method is highly effective in producing clusters for a myriad of IoT-based WSN applications. Various K-means-based techniques are discussed for efficient clustering [
29,
30,
31,
32,
33,
34].
However, this past research focuses only on WSNs, excluding the performance and applications of UAVs for data collection and surveillance purposes. Furthermore, these methods do not evaluate the topography and quality of wireless transmission during the design and position for surveillance systems. However, all these key factors must be taken into account for UAV-based WSNs. The authors in [
35] proposed distributed and centralized K-means clustering technique. Although it is a good scheme for WSN clustering, the researchers only considered the distance parameter in its evaluation. With the growing development of UAV involvement in WSN, numerous literature reports exist for UAV-based WSNs. The studies can be further classified into optimal algorithms and applications. The first phase of UAV integration with WSN has been evoked in many domains, such as healthcare observations [
36], monitoring of animals [
37], data collection for greenhouse gases [
38], and agriculture units [
39]. To develop high-quality systems, the authors in [
40,
41] introduced new architecture of UAV-based WSN and evaluated certain applications. However, they mainly consider specific WSN types without focusing on UAVs and ground network systems. During the second phase, work done by the authors in [
42,
43,
44] still face challenges in WSN overall energy conservation, adopting several techniques to optimize the problems related to routing, transporting protocols, and MAC in UAV-based WSNs. From the perspective of UAV, several studies identify flying control, path planning, and many other issues [
45,
46,
47]. These studies exposed a new direction of research into UAV-based WSN and its applications. In addition, the approaches that we have stated focus solely on the challenges and conditions from a single perspective, i.e., either UAV or WSN, and do not examine the aerial mobile robots and ground network systems as a unified system. This substantially restricts their applicability and integration for many remote-based large-scale surveillance systems. In short, substantial use of these techniques cannot accomplish all the structural layout objectives of UAV-aided WSN for environmental monitoring. Moreover, in [
48], the particle swarm optimization (PSO) approach was adopted to reduce UAV travel time, energy consumption, and bit error rate (BER). The ground WSN must be recurrent to choose the optimal CH during a single time slot or over multiple time slots. Furthermore, due to change in network topology resulting from the change of the CHs, UAV involvement helps to recalculate the flight path. This approach depends on ideal assumptions, which cannot be considered realistic scenarios, and further effort and work are required before data can be gathered in advance.
3. System Model
In this section, the system model and preliminary concepts of our work are discussed. A scenario is considered where several nodes are deployed in a random manner to collect the environmental parameters such as temperature, humidity, etc. The overall architecture of the monitoring system includes a UAV sink node, sensor nodes, cluster heads, and a remote base station. Each cluster has a cluster head, which receives the data from the sensors and then transfers it to the UAV, and acts as a sink node. The UAV further transmits these data to the remote base station. The land system computes the UAV’s flying trajectory once the geographical positions of CHs are obtained. The computation of UAV’s flight path parameters such as distance and time are considered.
In the proposed system, the UAV is also utilized for the performance enhancement of the WSN, by making it more energy efficient in data collection and monitoring. In the proposed technique, optimized K-means clustering protocol is used to improve the cluster structure, CHs selection, and low-energy consumption for data communication.
Figure 2 expresses the stepwise working of the proposed scheme, and the topology of the network considered.
Table 1 gives the details of the symbols and notations.
In the scenario, a square range area with dimensions
is assumed, with N randomly deployed SNs in the sensing area. Both the SNs and the BS are static in nature, and there is only one UAV capable of flying over the sensing region. All the SNs are having same amount of initial energy and to be homogeneous in nature. The BS knows the geographical information of all deployed SNs. The proposed strategy starts with calculating the optimum cluster density in the area of interest, depending upon the sensing range and the total number of SNs. Mathematically, the number of optimum clusters can be calculated as follows [
48]:
where
is the distance between CH to BS, and
and
are parametric values for the free space and multipath model, respectively. The data are initially transferred to the BS, which shares this information with the UAV, to follow the CHs during its flight.
The next stage is to identify the cluster heads. Rather than picking the CHs by random means our proposed strategy uses midpoint technique. This methodology resolves the unbalanced cluster structure and uniformly deploys CHs to ensure that every cluster has almost equal SNs. This leads to an equal and balanced communication load on the CHs, which eventually expands the network’s life. This technique is explained in the next section.
Our proposed technique considers residual energy of SNs along with the Euclidean distance for selection of CHs. The Euclidean distance is employed with the K-means basic approach; the nominated CHs transfer data to the UAV successfully. The K-means method is an iterative method that attempts to divide the dataset into K non-overlapping subgroups (clusters), where each element belongs to only one group. In the proposed scheme, K-means clustering categorizes the SNs into predefined C number of disjoint clusters. Algorithm 1 gives the idea of optimized K-means method.
Algorithm 1: Optimized K-means clustering method |
Input: |
X = consists of a total n number of data items. |
C = required clusters |
Output: |
A complete set of C clusters |
Steps: |
1: Choose C data items as initial centroids from X randomly. |
2: Repeat |
3: Associate each data item to the closest available centroid |
4: Mean value calculation for every cluster |
5: Continue until it meets the convergence criteria. |
Another feature of our proposed clustering algorithm is that a node may not be nominated as CH if its remaining energy is less than a defined threshold. In our scenario, the estimate of the residual energy threshold is shown by the total energy needed for the aggregation, receiving and transmitting it to the average number of SNs in the cluster. Data aggregation happens in each of the selected CH and ultimately transferred to the UAV.
This technique reduces the energy consumed by the CHs for data transfer. The communication range between CHs and UAV is kept small for data transfer. If the distance between CH and UAV is more than the threshold, the UAV will choose a different node, as CH is based on the Euclidean distance. Those nodes having good residual energy and better channel conditions will be considered in this process to improve the lifetime of the WSN. This mechanism can be called UAV-assisted re-election of CH, where the UAV performs the process under the TDMA scheme.
Once the clusters are formed and CHs are finalized, after determining CHs positions and geographical coordinates, the ground-based monitoring system computes the flight path for the UAV using an intelligent algorithm. UAV flies over CHs to function as sink node. It gathers all of the data and sends it to the base station for processing. The proposed flight trajectory for UAVs will visit each cluster for data collection from CHs in a shorter period and shorter path with the aim of low battery usage. Moreover, other aspects must be seen as the distance between the clusters, flight duration, and speed.
A radio energy dissipation model is used for performance evaluation of the proposed model. To transfer the k-bits message to a distance l, the radio utilized as follows:
where
is transmit power by the electronic circuit to send 1-bit of data;
and
are the coefficients of free space and multipath models. In the free space model, their energy dissipations are proportional to
for and in the case of multipath model proportional to
. However, the threshold
is calculated as follows:
The energy required to receive the k-bits message at the receiving end is calculated as: