1. Introduction
With economic globalization, international trade has become an important part of the world economy, and maritime transport bears more than 80% of global trade freight [
1]. Security and transparency are vital to maintaining this mode of transport, but they are threatened by illegal activities at sea such as illegal unregulated and unreported (IUU) fishing, human trafficking, smuggling, and piracy plunder [
2]. A common feature of these crimes is that goods, supplies, or personnel are transferred between vessels during the voyage, which is broadly defined as transshipment behavior [
3], also known as ship-to-ship transfer.
In the case of IUU fishing, fishing vessels remain at sea for long periods, fishing and waiting for transshipment vessels to land caught seafood. This leads to overfishing and allows illegally caught fish mixed with legal products together to enter the market, making it difficult to estimate the true number of fish caught in the region [
4]. The Food and Agriculture Organization (FAO) of the United Nations has identified transshipment as a key indicator of international deterrents to IUU fishing [
5]. For human trafficking, by receiving supplies and fuel from other ships, crew members can be kept at sea indefinitely, raising issues such as slavery and bonded labor [
6]. Concerning smuggling activities, illegal goods are transshipped at sea by organized criminal networks to falsify the origin of goods and transport them unregulated across the world’s oceans. To monitor and combat these maritime illegal activities, it is crucial to identify suspected illegal transshipment behavior at sea [
7].
These infringements are often committed in areas where supervision is weak, such as the high seas, and the sheer size of the world’s oceans presents a major obstacle to the implementation of maritime governance [
8]. However, the trajectory of the vessels involved in transshipment has distinct characteristics that differ from most of the other normal ship movements, namely rendezvous, as shown by the ships being close to each other at low speed for a period far from strictly regulated sea areas. In contrast, other ships normally navigate along the main traffic corridors, sailing directly to their destinations and maintaining a safe distance from other vessels. To monitor the trajectory of vessels in vast ocean space, technology such as drones, imaging satellites, combined satellite-terrestrial AIS (automatic identification system) [
9], and other remote sensors can be used. By comparision to traditional maritime monitoring through manned ships, these methods are cheaper and more accessible for states [
10,
11,
12].
Even though illegal transshipment behavior has some common characteristics, it remains difficult to develop an ideal model that can automatically and accurately identify them for the following reasons. Firstly, it is difficult to determine the extent of behavior characteristics, such as speed and proximity of the vessels. Different locations, environmental conditions, and regulatory constitutions influence the characteristics of the behavior in different contexts [
13] and if the ships are considered as mass points when calculating the distance between them, the actual distance will differ from the calculated one, particularly when the vessels are large. Secondly, the availability of real-world examples for the study is limited. Even in regions that have adopted transshipment regulations [
14,
15,
16], data on transshipments is closely guarded and often considered sensitive business information. While in other regions, transshipments activites are without any independent observations, transshipped catch verification, or suspected transnational criminal activities monitoring [
3]. Thirdly, the basic features of illegal transshipment are similar to certain normal navigation behaviors. In the following scenario, when a ship is traveling in a channel crowded with other ships, it chooses to travel at a slower speed for navigation safety and maintain this state while in the channel. According to its basic characteristics, the ship is likely to be misidentified as engaged in illegal transshipment.
Considering the common feature of illegal transport and meeting actual regulatory requirements for interpretability, much of the current literature uses rule-based methods, by determining whether the characteristics of two ships’ trajectories meet the predetermined patterns [
3,
17,
18,
19,
20,
21]. Additionally, a relatively small body of literature uses data-based methods such as machine learning techniques which learn the behavior pattern from labeled samples [
22,
23]. The main advantage of rule-based methods is their interpretability, which is essential for regulators to make decisions regarding suspicious or dangerous vessel activities. However, such patterns do not learn from data, and can barely identify ship behavior that fits with some priori definitions that are highly influenced by human subjectivity [
24]. Although the data-based methods can straightly learn the pattern from real-world examples, they highly depend on the quality of datasets, abandon the use of expert experience, and the identification results are non-interpretable.
Defining the optimal threshold is a challenge that the rule-based approach faces. For transshipment behavior, it includes setting properly minimum encounter duration, maximum encounter speed, and maximum distance between ships. For this problem, the literature [
17,
21] directly uses fixed thresholds determined by experts. In [
18], an interface is provided for the expert to adjust the threshold based on the recognition performance in the application. The sensitivity of each threshold is analyzed in [
3], and a comparison is made between the percent change in total duration for different threshold values, and the optimal threshold is determined by finding the more tolerant boundary in the interval that triggers the biggest variation; some use the fuzzy set theory to blur pre-determined threshold boundaries [
20]. While these methods can select relatively reasonable thresholds, the values of each threshold are chosen independently rather than in combination and are not fully determined by the behavior characteristics of the sea area.
Inference results being either true or false is another shortcoming of most identification methods. In this way, even a minor perturbation could result in a completely different outcome. Markov Logic Networks are used in the literature [
19,
20] to introduce probabilistic uncertainty into the rule base by learning weights to each formula. Thus, it does not exclude events that do not meet all the conditions in the rule, but using this probability directly as a measure of suspicion is less interpretable. In the literature [
21], fuzzy logic systems have been used to first identify the transit behavior using fewer rules and then to classify the behavior suspiciously based on other features, in order to reduce missed identifications while making the level of suspicion more interpretable. However, the classification of the suspicion level only takes into account the movement features of the ship and not considering the situation of the surrounding sea.
Another problem is that illegal transshipment behavior exhibit similar characteristics to vessels that navigate in densely trafficked areas of the sea, where maritime surveillance is relatively strict. To solve this misidentification problem, various studies have attempted to allocate special areas [
3,
13,
17], such as areas near ports or coastlines, anchorages, and major traffic corridors, where rendezvous behaviors are not considered illegal transshipment. In this way, misidentified events can be effectively removed in designated areas. However, if these areas are not accurate and comprehensive enough to include all traffic main routes, the capacity of identification ability will be greatly reduced, and the ideal shape is hard to slice out in the real world. When a large number of misidentification results are generated, genuine transshipment events will be swamped, making it difficult to apply to actual maritime surveillance.
To solve these problems, we proposed a method to identify suspected illegal transshipment by incorporating knowledge-based and data-driven methods, allowing the model to learn suitable threshold combinations of predefined rules by unsupervised learning from the vessel’s dynamic data in a specific sea area. Relatively loose rules are used for preliminary screening for suspected illegal transshipment, and the suspicion level is obtained based on clustering results of vessel motion features and a traffic density feature, which was introduced to reduce suspicion of misidentification in dense traffic areas without precisely delineating special sea areas.
The main contribution of this study are summarized as follows:
An illegal transshipment identification scheme that can benefit from the interpretability advantages of the rule-based system while fully utilizing ship navigation data.
A methodology for unsupervised learning appropriate threshold distributions of transshipment behavior in the monitored sea area.
A traffic density feature was introduced to reduce the suspicion level of misidentification in dense traffic areas.
A classification of suspicion is designed based on the actual regulatory needs and is interpretable.
The proposed method is evaluated on two navigation datasets within different sea areas: a public dataset around Brest ports in France and a dataset containing some areas with high traffic volume around the Rizhao port in China. The experiment results show our methods can achieve more accurate and comprehensive identification compares to the rule-based methods proposed in the literature [
17], especially in dense traffic areas. A similar method is also used in [
3] to estimate the number of transshipment events involved in IUU fishing globally.
The rest of the paper is organized as follows.
Section 2 introduces the methods in this paper. The details of the experiments and results are presented in
Section 3, and the results are discussed in
Section 4.
Section 5 concludes the paper and gives future work recommendations.
4. Discussion
The identification results in two different sea areas as shown in
Table 6 and
Table 11 demonstrate that our methods can detect more suspicious events than fixed threshold results, and according to
Figure 8, these events with high and medium suspicion satisfy behavior features of illegal transshipment. There are several possible explanations for this result. Firstly, if the ships involved in the case continue to move during the process, the event that strictly meets the specified threshold may be divided, resulting in a short duration of a single event and causing the identification to be missed, as shown in
Figure 7. Secondly, since the position broadcast by the vessel comes from the position where the sensor is located, the size of the ship will make the actual distance larger than the calculated one, so a small distance threshold will tend to filter out events related to large vessels. Thirdly, some transshipment behavior exhibits motion characteristics that exceed the fixed threshold, some real-world examples can be found in
Section 3.3.3.
By analyzing the result of fixed threshold methods exhibited in
Table 7 and
Table 12 in the two datasets, we found about 61.38% and 80.13% of the identifications are generated in areas with greater traffic density, respectively. From the geographical distribution of identification in
Figure 6,
Figure 10 and
Figure 13, the highly and moderately suspicious events identified in this paper turned out to be more distant from strictly regulated areas such as coasts and ports. This indicates that fixed threshold approaches without taking into account the traffic density characteristics are likely to lead to a large number of false positives in dense marine areas without predefined geometry. This was further verified in
Section 3.3.4, based on the sea area with obvious traffic flow, most of the events occurring within the traffic flow area are classified as low suspicion by our methods, and the number of high and medium-suspicious events within 10 days remains within the supervisory range, which meets the practical application requirements, and the method based on fixed thresholds generates a large number of suspected events in traffic dense area.
By comparing the clustering models of the two sea areas as in
Table 4 and
Table 9, as the sea area changes, the feature distribution of the clustering model also changes, still separating events that match the characteristics of illegal transshipment with stable proportions (
Table 5 and
Table 10), while the method with fixed thresholds produces a large number of false identification events in the densely trafficked Chinese coastal region, and with tests on the legal transshipment events, the clustering model can divide events as expected in
Table 1.
5. Conclusions
This paper proposes a method to identify suspected illegal transshipment. First of all, this method combines a rule-based method with an unsupervised learning method, which can automatically determine the threshold distribution combination suitable for regulatory sea areas. It solves the problem of missing identification of fixed threshold method based on expert experience. Secondly, the method introduces traffic density features in the sea area, which can effectively reduce the false identification rate of suspected illegal transshipment. This method is validated on two sea areas. The results of French Breast port dataset show that this method can fully cover illegal transshipment events identified by the fixed threshold method.The results can be divided according to the suspicious degree and interpretable. Compared with the fixed threshold method, this method identifies more illegal transshipment events, which are highly suspicious, and gives warning much earlier. The proposed method can even filter out misidentification events from compared methods’ results, which account for more than half of the total number. The results of Chinese Rizhao port dataset show that this method can cover all illegal transshipment events identified by the fixed threshold method, which is consistent with the results of French breast port dataset. This method can effectively solve the problem of misidentification in dense traffic areas and improve the recognition accuracy. What’s more, it can identify all legal transshipment events. The proposed method is an active exploration in the application of the combination of rule-based method and machine learning method, which is significant for the practical application of identifying suspected illegal transshipment behavior.
The following three aspects shall be improved in the future. Firstly, different event features, such as the closest distance between vessels’ trajectories shall be considered to better distinguish illegal transshipment events. Next, limiting factors, such as AIS closures, can be taking into account to improve practical use. Last but not least, background information can be investigated to help further judge illegal transshipment from suspected illegal transshipment.