1. Introduction
An oceanic eddy is a circular current of sea water [
1,
2,
3]. The largest scale oceanic eddies are caused by the instability of horizontally sheared motion typically seen in boundary current, while relatively smaller (on a horizontal scale of tens of kilometers) eddies such as baroclinic eddies are generated by the sudden severe change of sea water’s transverse density gradient associated with baroclinic instability [
4,
5]. According to the rotation directions, oceanic eddies can be divided into cyclonic eddies and anticyclonic eddies. They can also be divided into different categories according to their sizes. In oceanography, mesoscale eddies normally refer to eddies with a horizontal scale of tens to hundreds of kilometers and they can survive for several days to 1 year [
6,
7,
8]. Submesoscale eddies are those with a smaller horizontal scale on the order of 1 km and they only survive for a period of a few hours to several days [
9,
10,
11]. Although mesoscale eddies have a larger diameter and a lifetime of days and weeks, there are also smaller short-living eddies (e.g., in frontal filaments, behind the coastal and topographic features). In the ocean, mesoscale and submesoscale eddies transport particles, debris, and organisms and transfer heat, mass, and oxygen in both horizontal and vertical directions, which play a critical role in the marine ecosystem and environment [
12,
13,
14,
15]. Besides, eddies may appear on shipping routes and in offshore regions and affect human marine activity. Therefore, the detection and tracking of mesoscale and submesoscale eddies are significant.
The primary objective of eddy detection is to determine the eddy center, radius, and boundary [
16]. Three typical equipments used to collect eddy data are in-situ sensor [
17], satellite [
18] and high-frequency radar (HFR) [
16,
19]. An in-situ sensor is good for eddies in a small-scale region, but it cannot provide complete eddy information when an eddy randomly occurs in a large-scale region. In other words, only when the eddy location can be predicted in advance, the in-situ sensor is effective. Thus, an in-situ sensor is seldom utilized in a region with unpredictable eddies. Satellite sensor is useful for large-area, long-period, and multi-type eddy observation since it has a large field of view. However, sea surface height (SSH) and sea level anomaly (SLA) results from satellite data are characterized by a low spatial resolution (larger than
) and a long revisit period (far more than 1 h). HFR can accurately measure offshore eddy parameters due to its relatively higher spatial resolution (
) and near real-time observation capability (every 20 min). So far, HFR is the most efficient equipment for submesoscale eddy monitoring.
The detection of oceanic eddy depends on the geometry characteristic of the eddy. The spiral structure of an eddy in the three-dimensional (3D) space can be well studied by a projection onto the two-dimensional (2D) plane, where the sea surface vector currents show an obvious rotating structure. For the continuity of the fluid, there should be a vertical current component to compensate for the non-zero 2D divergence, so a more detailed eddy structure can be retrieved by deploying more sensors in the 3D space. Existing eddy detection methods fall in four categories: expert-based visual method [
20], physics-based statistics method [
21], geometry-based statistics method [
22], and deep learning-based method [
23]. The expert-based visual method depends on the expert’s experience and visual judgment to distinguish eddies. It is the most reasonable method but requires significant labor effort. Such detection results are usually used as ground truth. The physics-based statistic method detects eddies according to whether the values of certain physical quantities are higher than a threshold, such as pressure, the current’s speed, vorticity, and helicity. The physics-based method searches for the outermost closed contour of the streamlines as the eddy boundary and the zero-speed position within the eddy boundary as the eddy center. The classical physics-based method is the Okubo-Weiss method [
24], which employs three direct indices, including shearing deformation rate, straining deformation rate, and vorticity, for determining the existence of an eddy. The geometry-based statistic method detects eddies by checking whether the eddy’s pattern satisfies geometric criteria according to the eddy’s shallow features extracted. The representative geometry-based methods include vector-geometry (VG) [
25] and winding-angle (WA) [
26] methods. The VG method utilizes four different constraint conditions to locate the eddy center, and then it treats the center’s outermost closed streamline contour as the eddy boundary. The WA method firstly implements clustering operation to determine the streamline centers and then regards the streamline centers of the same cluster as the eddy center, finally, it fits streamlines of the same cluster as the eddy boundary. Deep learning-based methods include object detection-like pipelines and pixel segmentation-like pipelines. Deep eddy detection means using deep learning-based methods to implement the eddy detection task. In the first pipeline, eddy detection is regarded as an object detection task. The object detection network firstly identifies the candidate region possibly with the eddy, then reduces the area of the candidate region, and then generates a bounding box without an overlapped area, finally, it uses the VG method for localizing the eddy center and computing the eddy boundary in the bounding box. The performance of the first pipeline depends significantly on the VG’s detection capacity. Deep learning is only applied for the feature extraction and bounding box. Therefore, the researchers attempt to develop an end-to-end network to make full use of deep learning for eddy detection. In the second pipeline, eddy detection is regarded as a pixel segmentation task. The pixel segmentation network takes the original current field as input and directly outputs the identified pixels associated with cyclonic eddies, anticyclonic eddies, and background. The second pipeline is convenient for the oceanologist since they need little deep learning knowledge but can rapidly identify the eddies. Meanwhile, the black-box-like structure of the second pipeline provides the users with little flexibility to alter the module, which may lead to a performance lower than the first pipeline. In this work, we incorporate the transfer learning technology into the end-to-end pix-el segmentation pipeline for improving its performance in eddy detection.
However, two challenges remain in actual eddy detection using HFR data. The first is that effective labeled data are always insufficient and much less than collected data since visually labeling all the data is impossible and labeling takes much labor and time. Furthermore, labeling the collected data only by an expert’s visual judgment is quite inefficient. Fortunately, although the amount of labeled eddy data is small in each experiment, the number of HFR’s observation experiments is increasing with the spreading of HFR. The labeled data from other experiments can be shared for public access to augment the training datasets. In this study, we attempt to use instance-based domain adaption to increase training samples using previous similar observation data.
The second challenge is how to inherit experience from previous detection [
27]. As for classical eddy detection methods, past detection results including fuzzy perception, threshold, and parameters from a previous analysis may give limited assistance to the construction of a detection model in the present application. When the observation area is the same, the model trained previously may be adopted. When the area is different, the direct application of previous models may cause problems. Because the network trained for one region (e.g., Mediterranean) may recognize some exclusive characteristics of the eddy in that area but if they do not exist in the other region (e.g., Malacca Strait), redesigning or retraining of the network is necessary. Moreover, previous deep models are mainly developed for the regions with enough data (e.g., Mediterranean and Australian’s Coral Sea regions), while the regions with insufficient data (e.g., Cape of Good Hope and Malacca Strait) attract only a little attention. The transfer learning technology can be used to identify and discard specific regional characteristics and retain common eddy characteristics, then the data imbalance problem can be solved even if the data are insufficient. Here, we attempt to use parameter-based transfer learning by sharing a general feature layer for eddy detection in multiple regions.
In this paper, a cross-domain eddy detection neural network, CDEDNet, is proposed for HFR applications. Here, cross-domain means analyzing the eddy in one region using the network trained with the eddy data collected from another region. Firstly, a fully convolutional network skeleton-based eddy detection network CDEDNet-0, which is developed using the HFR data from the South China Sea is presented. Secondly, for dealing with the problem of insufficient labeled data, we present an instance-based domain adaption method for increasing training samples in CDEDNet-1. Thirdly, for dealing with the problem of inheriting previous detection experience, CDEDNet-2 which incorporates parameter-based transfer learning, is designed for multi-scene eddy detection. The remaining content of the paper is organized as follows.
Section 2 introduces the related work of deep eddy detection and transfer learning. In
Section 3, the HFR systems and data are illustrated.
Section 4 describes the structure and loss metric of three CDEDNets.
Section 5 demonstrates the performance evaluation and observation analysis.
Section 6 contains the conclusion and future plan.
2. Related Work
With the rapid development of deep learning, deep learning-based eddy detection attracts significant interest. In [
28], the Ocean Eddy Identification Neural Network (OEDNet) was presented for locating mesoscale eddies automatically. OEDNet is constructed on the skeleton of RetinaNet. It searches for mesoscale eddies with a small number of samples and multiple sea level anomaly (SLA) data. In [
29], an eddy identification and tracking framework was proposed based on SLA data from Australia. The framework combines a convolutional neural network for feature learning and the VG method for identifying the eddy’s location and shape. In [
30], an artificial intelligence idea was adopted for eddy detection using PSPNet and the VG method. However, its performance is almost the same as the VG method. Because object detection-like pipeline depends on the VG algorithm to determine the eddy’s center and boundary, its performance is limited by the VG algorithm to a large extent.
Some researchers attempt to use an end-to-end neural network to transform the current field to the final eddy detection. In [
31], a neural network-based ocean eddy current pixel classification framework (EddyNet) was presented for detecting eddies. EddyNet consists of a U-shape neural network (U-Net) structure with a pixel-wise classification layer and uses Copernicus Marine Environment Monitoring Service (CMEMS) SSH maps for exploring eddies. Its classification results are worse than that of statistic-based methods due to limited training data and the closed-contour identification method. In [
32], a CNN-based classifier for detecting Mediterranean Sea eddy using SST images was designed. It uses two labeled datasets for model training. The results demonstrate that the model based on a manually annotated dataset performs better than using an automatically annotated dataset. In [
33], a dubbed DeepEddy which involves two principal component analysis (PCA) convolution layers for exploiting eddy features and a non-linear transformation for transforming features was developed. DeepEddy utilizes multi-scale features fusion technology in synthetic aperture radar (SAR) images. Its performance is comparable with the statistic-based method, but its computation load is higher. These end-to-end neural networks are based on pixel segmentation neural networks. Although they perform well in predicting the eddy’s position, their performance significantly relies on the quality of labeled data and the choice of neural network. Therefore, our work follows the end-to-end roadmap to construct the fundamental CDEDNet and improves the model’s performance through instance-based domain adaption and parameter-based transfer learning.
3. Data Description
Our experiment of eddy detection is based on the sea surface current data collected by the Ocean State Monitoring and Analyzing Radar, model S (OSMAR-S), which is a compact HFR system designed by Wuhan University. At 13 MHz, one radar provides a radial current map up to 100 km offshore with a range resolution of 2.5 km every 6.5 min, and two radars at different locations provide a vector current map on a longitude/latitude grid with a uniform spacing of 0.05 degrees in both dimensions every 20 min. OSMAR-S uses monopole as the transmitting antenna and monopole/cross-loop antenna to receive echoes. The sweep bandwidth is 60 kHz and the average transmit power is 100 W. For more details about OSMAR-S, the reader can refer to [
34,
35].
Some control-quality assurance (QC-QA) measures have been implemented in the current mapping software as well as the hardware of OSMAR-S. An automatic frequency selection module based on noise spectrum monitoring [
36] provides the optimum work frequency, i.e., with the lowest noise floor, to ensure the best signal quality for each processing cycle. Before surface current estimation, radio frequency interference (RFI) and instantaneous interference are detected and removed if they are detected [
37]. In the radial current map generation, a quality factor is calculated for each radial current based on its corresponding signal-to-noise ratio (SNR) and spatial spectrum, and it is used to decide whether this radial result is valid and its weight in subsequent processing of merging and interpolation (if this radial passes the test). The quality factors for radials are also used for vector current calculation. Median filters in both temporal and space domains are used to remove the weird values in both radials and vectors. Here, the QC test generally follows [
38]. With such QC-QA measures, high-quality sea current data can be obtained although a small number of outliers still pass the test especially those on the boundary or baseline. Typical root-mean-square errors (RMSEs) in OSMAR-S are 10–15 cm/s for radials, and 10–16 cm/s in magnitude, and 10–20 degrees in direction for vector current.
This study involves two sets of OSMAR-S data at different times and locations. In the first dataset, two OSMAR-S radars were deployed at Shanliao and Xian in the Fujian province of China. The observation region covers the southwest of the Taiwan Strait and the experiment period lasts for 80 days, from 11 January 2013 to 31 March 2013. The sample period of OSMAR-S is 20 min, and the total number of generated current fields is 5760. In the second dataset, the same OSMAR-S radars were deployed at Nansan and Xuwen in the Guangdong province of China. The observation region covers the overlapped area between the north of Qiongzhou straits and east of the Leizhou Peninsula in the South China Sea. The 88-day observation period is from 25 July 2017 to 21 October 2017. With the same sample rate, a total number of 5904 current field maps were generated. In both radar current datasets, the effective data rate at each grid point was greater than 0.85. In practical observations, data were missing or of very low quality (e.g., significantly contaminated by noise) for some periods. The effective data rate was calculated as the ratio of the period with useful data over the whole observation period. The high data rate of radar measurement ensured adequate samples for further eddy detection algorithms.
Figure 1 depicts the two observation regions.
Figure 2 shows the geometric dilution of precision (GDOP) for the current maps in
Figure 1. As can be seen, the GDOP had large values near the baseline and greater values near the boundaries and thus, the current measurements in these regions were less reliable than those in the central region. However, the effect of corresponding outliers on our eddy detection results was negligible since eddies were not in these questionable regions in this study (as can be seen in the following figures showing eddies).
For cross-domain eddy detection, one geographical dataset was selected as source data and another as target data. Considering that deep eddy detection had been implemented for the Taiwan Strait dataset [
39] and the small number of eddy observations associated with the South China Sea, the Taiwan Strait dataset was chosen as the source data, and the South China Sea dataset was regarded as the target data. The source data consisted of 600 current field images (300 cyclonic eddies and 300 anticyclonic eddies) and their corresponding labeled images. The 600 images were carefully sorted according to common obvious eddy results identified by DEDNet’s and the python-eddy-tracker software (PET-14). After that, the expert adjusted the 600 images’ results by visual correcting and transformed them to the source data. The target data consisted of 5904 current field images. Wherein, 2000 of them were annotated by PET-14 (PET-14 is an auxiliary eddy detection software and cannot represent ground-truth results). After being annotated by PET-14, each labeled current field image consisted of three categories of pixels: (1) the white pixel represented the background or no eddy data; (2) the red pixel represented cyclonic eddy, and; (3) the green pixel represented anticyclonic eddy.
Figure 3 depicts an example of the eddy detection training couple. The left panel of
Figure 3 is the original current field map, and the right panel is the segmentation result obtained by PET-14.
Next, CDEDNet was used in the target data to determine the eddy position and determine the eddy type in the remaining 3904 images.
5. Test Results and Discussion
All the three architectures introduced earlier were implemented on the Tensorflow backend of the Pytorch framework. The hardware platform included an Intel I5-6600K CPU at 3.5GHz and an NVidia GTX 1060 GPU card with a 6 GB memory. The training dataset of CDEDNet-0 only contained 2000 automatically annotated current field maps. The training dataset of CDEDNet-1 consisted of 2000 automatically annotated current fields and 600 transferred manually annotated current maps. The CDEDNet-2 only used 2000 automatically annotated current maps and the pretrained FCN-based eddy detection neural network based on the XASL dataset. The ground truth results were obtained based on the results from PET-14, the VG and WA algorithms, as well as expert visual observation. This method produced the most accurate ground truth results with the least labor cost.
5.1. Fundamental Architecture
The three architectures were sequentially implemented for eddy detection from the NSXW dataset.
Figure 7 depicts the eddy detection results of three architectures.
Figure 7a is the original current field, which mainly shows the current, coastline, and HFR site.
Figure 7b shows the ground-truth result obtained using a combinational method based on PET-14, the VG algorithm, the WA algorithm, and expert visual observation.
Figure 7c–e represents the detection results obtained by CDEDNet-0, CDEDNet-1, and CDEDNet-2, respectively. Predicted masks were obtained as the processing output in
Figure 7c–e, which represent the eddy positions in the original current fields.
The ground-truth results clearly depicted a distinct eddy boundary and the wrapper gradation of the streamline. The eddy boundaries detected by CDEDNets were more or less influenced by the surrounding streamline features. Wherein, the eddy boundary obtained from CDEDNet-0 was difficult to be recognized, e.g., the result from the third row of (c) shows obvious error. It could be concluded that CDEDNet-1 and CDEDNet-2 performed better than CDEDNet-0, but it was difficult to distinguish the difference between the performances of CDEDNet-1 and CDEDNet-2.
CDEDNet-1 and CDEDNet-2 had similar performance in identifying the boundary. The eddy in the image of the first row showed relatively complex streamlines. It was not a perfect elliptic eddy but had both sparse and dense streamlines. However, CDEDNet-1 and CDEDNet-2 performed well for this current field. The boundaries identified by them were approximately the outermost closed curve. As for eddy location, all three CDEDNets obtained results that agreed well with the ground-truth data.
A quantitative comparison of the three CDEDNets was also conducted.
Table 1 lists the accuracy comparison from 30 experiments. In
Table 1, the accuracy for each eddy type is defined as the ratio of correctly detected positive samples over all true positive samples of that type. Additionally, IoU means Intersection over Union. CDEDNet-1, CDEDNet-2, and CDEDNet-0 ranked, respectively, first, second and third in terms of accuracy. Wherein, the results of CDEDNet-1 and CDEDNet-2 were close to each other, with a difference of only 0.009. The detection accuracies of cyclonic eddies and no eddies were higher than that of anticyclonic eddies. This was partially due to the imbalance of samples in the categories. The number of anticyclonic eddies was less than that of cyclonic eddies and there were no eddies in the training data.
In conclusion, CDEDNet-1 and CDEDNet-2 were better than CDEDNet-0. CDEDNet-1 benefited from the additional transferred similar data for training. Enlarged training data size improved the model performance. With more and more HFRs being installed, CDEDNet-1 has the potential for wide application. CDEDNet-2 benefited from the frozen general eddy feature layers trained from other detection experiments. The special NSXW’s eddy feature could be directly identified from the 2000 training images. As a result, CDEDNet-2 improved the accuracy by 0.062 compared to CDECNet-0.
5.2. Detection Results
The objective of using CDEDNet and HFRs data was to provide a summary of the eddy situation analysis and valuable suggestions to the local fisheries and seafaring in the Nanshan and Xuwen regions. After a comprehensive analysis of the detection results, the findings were summarized in the following three aspects.
The first was an overview of the eddy situation. During the 88 days, HFRs generateded 5904 current maps, wherein 2123 current fields contained eddies. The current field maps with eddies were divided into three types: 1960 maps with a single-eddy, 147 with two eddies, and 16 with three eddies. The total number of eddies was 2301.
Figure 8 depicts the current field with two or three eddies. After the eddy positions were determined from the masks, the streamlines in each mask region were calculated to indicate the internal eddy structure.
The second aspect was the relationship between the eddy radius and eddy number.
Figure 9 illustrates the number of eddies with different radius. It could be concluded that the number of eddies decreased with the increment of radius. The radius of most eddies ranged from 4 km to 16 km and a small number of eddies had a radius larger than 30 km. As we know, smaller eddies appear more frequently since they can be easily generated by local disturbances. Thus, if a radar with a better sampling resolution is available, more eddies with a smaller radius may be observed. Because the observed region was in the off-shore area, two currents with opposite directions encountered in the area could intrude and wrap around each other, and gradually generate a small-scale eddy. With more opposite-direction currents joining, the scale of the eddy could gradually enlarge. However, considering that the topography of the sea-floor was not complicated and the off-shore area had a relatively stable environment, the number of current pairs with opposite directions was limited and only enough for generating eddies with a radius less than 32 km. Meanwhile, the number of cyclonic eddies was about 60 more than that of anticyclonic eddies on each level of radius. However, there was no obvious relationship between the eddy type and eddy radius.
The third aspect was the occurring period of the eddy. It was found that 2301 eddies were evenly distributed at each period of 24 hours. Wherein, the two periods of 21:00 to 3:00 and 10:00 to 14:00 had a relatively higher possibility for the occurrence of smaller eddies. Considering that the eddy frequently occured due to the fast change in temperature during these two periods, it suggested that the local marine activities should be reduced due to a possibly dangerous eddy.
5.3. Eddy Phenomenon
During the 88-days observation, the merging phenomenon of two eddies was also captured by the OSMAR-S radar. Although the mergers were extracted from different periods, the different stages of the entire eddy merging process were recorded.
Figure 10 shows four different merging stages of two eddies rotating in the same direction: encounter, contact, partial merger, and complete merger. In the encounter stage, the two eddies attracted each other and the distance between their centers decreased with time. In the contact stage, the outermost currents between two eddies interacted with each other, and then gradually merge into one current. In the partial merger stage, while the centers of two eddies approached each other, more and more inner currents of the eddy start to flow together into one layer. In the complete merger stage, the strong eddy wrapped around the weak one till the two eddies merged into one eddy. It is worth noting that two eddies rotating in opposite directions hardly merge (see
Figure 11).
The radar also successfully observed the dynamic trajectory of the eddy with an automatic eddy tracking algorithm (when counting the number of eddies, the eddies belonging to the same trajectory were counted only once).
Figure 12 depicts the eddy dynamic trajectory from 11:20 to 14:20 on 30 August 2017. The eddy was originated from the southeast part of the observed region. Then it gradually moved to near the center, which took about 1 h 20 min. After arriving near the central region, the eddy moved west toward the shore. Finally, it stayed at a distance from the shore and gradually became weaker with time.
6. Conclusions
In this paper, in order to explore the eddy situation in Nansan and Xuwen, a cross-domain eddy detection neural network is constructed based on HFR field data. Three architectures are presented for handling different challenges in deep eddy detection. The first architecture, CDEDNet-0 uses an FCN-based network to extract deep-level eddy features and determine the eddy’s location. The second architecture CDEDNet-1 incorporates an instance-based domain adaption method to enlarge the training dataset to solve the problem of insufficient effective labeled data. The third architecture, CDEDNet-2 adopts parameter-based transfer learning for multi-scene eddy detection, which enables inheriting previous detection experience. The experiment demonstrates that CDEDNet-1 and CDEDNet-2 perform better than CDEDNet-0 in terms of accuracy. CDEDNet-1 and CDEDNet-2 have similar results in the segmentation index. Meanwhile, the analysis illustrates that an increased eddy radius leads to a decreased eddy number, and the eddy frequently occurs during two periods, 21:00 to 3:00 and 10:00 to 14:00, in the Nansan and Xuwen regions. The phenomena of eddy merger and eddy dynamic trajectory are also recorded and analyzed. The multi-eddy phenomenon is also recorded and discussed. The results of the eddy phenomenon investigation are significant for providing reasonable suggestions to local fishery and seafaring. Moreover, these results are expected to provide more observation data for physical oceanography researches on submesoscale eddies.
It should be pointed out that certain QC/QA measures have been taken while generating the datasets used in this study, but some outliers are still observed near the boundary and baseline of the radar sites. Although these outliers showed little impact on the eddy detection results in this study, more effort should be expended to develop more efficient QC/QA measures to enhance the reliability and accuracy of current measurement and eddy detection in the area near the boundary and baseline of the radar sites.
Our future work will consider but is not limited to the following directions. The first is to further test the effectiveness of the proposed method using more HFRs data collected from different regions and periods although the results based on the limited data used in this study show it performs well. The second is to investigate the transfer of multi-source heterogeneous models. The heterogeneous model can support different structures and parameters for network merging and improve model diversity. The third is to fuse data from other sensors (e.g., synthetic aperture radar) for eddies detection.