1. Introduction
Classical seismic acquisition networks rely on cable-based systems. With the increase in the surveying area, cable-based systems are not practical owing to the weight and cost of the cables. Therefore, there is a need for a robust wireless seismic network that can transmit hundreds of recordings from geophones to the data center. This poses two major challenges: the data from a geophone must reach the data center in a timely manner while retaining the quality. The timing issue has been addressed in the previous studies [
1,
2]. However, the wireless transmission also requires mitigating the channel effects, like interference, multi-path fading, and noise. In traditional wireless systems, periodic training signals are transmitted in order to estimate and ultimately remove the channel effects. The inclusion of additional training signals on top of a giant amount of seismic data makes the situation even worse, i.e., more bandwidth and time are required for transmission [
3]. None of the previous studies (see in [
1,
2], and references therein) focuses on this important issue. Most of the work done is related to the design of the wireless seismic system while completely ignoring the wireless link (channel) impairments.
In this work, efficient mitigation of the channel effects without using additional training signals is proposed. The proposed method relies on the blind system identification and deep neural networks approach. Blind system identification is a mature field [
4] that estimates the channel without using any training signals. However, channel estimation using this method highly depends on the additive noise. Therefore, deep neural networks are used to alleviate this vulnerability.
Deep neural networks require data for training in a supervised learning setup [
5,
6]. In the proposed setting, synthetic data (generated using a realistic model) are used for this purpose. Therefore, once the network is trained offline, then it is suitable for real-time implementation without the need of field (real) data for training. The method consists of two deep neural networks: one is used for signal-to-noise ratio (SNR) enhancement and the other for classification. The blind system identification methods are effective in low-noise environments without the need for SNR improvement. As the SNR is not known at the data center, the classifier network decides whether to pass geophone data through the SNR enhancement network or not.
Furthermore, the geophone setup environment is stationary, i.e., geophones or the data center are fixed at the locations for several shots. This added advantage of a stationary environment is used together with blind system identification for improving the estimation of the channel impulse response.
The main contributions of this work are summarized as follows:
Estimating the channel impulse response using blind identification method. The estimation is further improved by taking the stationary environment into account.
Enhancing the SNR using deep convolutional neural networks by taking into account the features of seismic data in the frequency domain.
Classification of geophone data using a deep fully connected neural network in order to decide about the need for SNR enhancement. This point addresses the practical implementation aspect of the proposed method.
The rest of the paper is organized as follows. The blind system identification is covered in
Section 2.
Section 3 discusses SNR enhancement, while
Section 4 presents simulation results. Finally,
Section 5 draws the conclusion.
2. Blind Channel Identification
Blind channel identification methods rely on a multichannel framework that is obtained either by using an array of antennas at the receiver side or oversampling the received signal. Both scenarios are best suited for the situation at hand. The reason being that the geophone already has to transmit a large amount of data under tight timing constraints [
2] so oversampling at geophone is not feasible. Furthermore, oversampling or multiple antennas at a geophone also increases the processing load on battery-driven wireless geophone. Therefore, for multichannel blind system identification, the load (oversampling and multiple antennas) is shifted to the data center where power and processing requirements are relaxed.
Assuming that a single geophone data passes through
m independent channels before reaching the data center (this is achieved by oversampling or array of antennas at the receiver). The discrete channel model for the window of
M received samples is obtained by stacking the data into a vector/matrix representation and it is given as follows:
where the received data is
and the transmitted digital modulated data is
. The additive random noise
is stacked in a similar way to
, and
is an
block Toeplitz matrix given as
where
,
, and
,
. Let’s
be the desired vector containing all the channel’s taps then the objective is to estimate these channels’ impulse responses, i.e.,
using the observation data in (
1). There are many subspace-based methods for blind system identification (see in [
4] and references therein). Here, the method proposed in [
4], namely, structured-based subspace method (SSS), is used. This method claims to be efficient in ill-conditioned channel matrices. In this approach, one searches for the system/channel matrix
in the form
such that the orthogonality criterion is set to be equal to zero, i.e.,
. Furthermore,
is chosen such that the resulting matrix is close to the desired block Toepliz structure. The columns of matrices
and
spans the signal and noise subspace, respectively.
Note that the geophones and the data center locations are fixed for several shots, therefore the channel impulse response is not expected to change. This additional advantage is used to further enhance the channel estimation
. As the seismic shooting process is repeated over and over again, the channels’ impulse response for the
shot is updated according to the following recursion
where
is the estimated channels’ impulse response using SSS method and
is the updated estimation at the
shot. For seismic signal recovery,
shots are used to get the final channel response
and then optimum equalization method, i.e., maximum likelihood sequence estimation (MLSE) [
7], is used to obtain a robust estimation of the transmitted signal
. The MLSE is a computationally expensive technique; however, the equalization is done offline at the data center. Therefore, it is worth using the computationally intensive approach to achieve the best reconstruction of the seismic signal.
4. Results and Discussion
To train the deep neural networks, synthetic data are generated using the Marmousi Model [
15]. This model is often used in exploration seismology as a standard case study. The receivers are placed at a distance of 50 m in the horizontal axis and the shots are sequentially generated at the same location as every other receiver. The shot records are generated using the various seismic signatures, i.e., Ormsby wavelet with frequencies
,
,
,
,
,
,
, and
Hz. The use of different frequencies for the Ormsby wavelet is to ensure well-trained deep neural networks that can be used for a variety of data sets. The frequency spectrum of the Ormsby wavelet is of trapezoidal shape which gives more flexible control on frequency domain than Ricker wavelet. This helps reconstruct the seismograms similar to the real seismic data. For the generation of synthetic data, the Matlab package [
16] is used. The sampling frequency is set to 4 kHz. Furthermore, the training data are randomized before being given to the neural networks and then shuffled after every epoch. The input (predictors
P) and output (targets
T) training data are z-normalized for both neural networks as follows:
where
(
) and
(
) are the mean and variance of
P(
T), respectively. Furthermore, 10% of the data are used for validation. The neural networks are first trained on the synthetic data obtained using the Marmousi model. To obtain noisy synthetic data, the raw data are converted from analog to digital and then randomly flipped
of the bits corresponding to a trace. The noisy data are mixed with the original raw data for training the classifier neural network.
In order to verify the performance, the classifier neural network is tested on publicly available seismic data from Utah Tomography & Modeling/Mitigation Consortium (UTAM) [
17]. In this dataset, each trace has 4000 samples with the sampling frequency of
kHz. Before the preprocessing stage, the data are cleaned by removing bad traces due to geophones not operating properly. The channel length is set to
and Rayleigh fading is assumed. Furthermore,
,
, and Binary Phase Shift Keying (BPSK) modulation is used in the simulations.
Figure 5 shows a comparison of the state-of-the-art SSS method with the proposed one. The estimation of the channel impulse response is enhanced tremendously. As the estimation is leveling off around 50 shots, the final estimated channel impulse response is
. Moreover, a large variation in estimation is observed for various traces in the case of the SSS method due to the random noise. Therefore, in some traces recovery might be worst and in turn, further seismic data processing steps like stacking will give very bad overall results (due to error accumulation).
The SSS method, its proposed improvement, and SNR enhancement with or without the classifier network are compared and reported in
Figure 6. SNR is obtained by averaging the respective values over all traces. It can be seen in
Figure 6 that significant performance improvement is achieved through the use of pre-trained deep neural networks, in particular at high noise levels. For instance, at the SNR of
dB of the received data, a nearly 10 dB increase in SNR is achieved while at the SNR of 0 dB of the received data, about 120 dB increase in SNR is observed. It is noteworthy to argue that with only the SNR enhancement network the performance deteriorates at low noisy environments due to the robust estimation of the channel impulse response. The SNR of received data, in practical scenarios, is not known. Therefore, it is worth using a pre-trained classifier network without transmitting any training signal to the data center for estimating SNR. Furthermore, to the best of the authors’ knowledge, the proposed method is developed for the first time considering the wireless seismic system and, therefore, comparison with the other methods is not possible.
In order to show the SNR improvement single trace, the proposed method is simulated for the SNR of received data of
dB.
Figure 7 compares the reconstructed trace with the original trace.
Figure 7a–c depicts the original trace, reconstructed trace after MLSE without denoising, and reconstructed trace after denoising, respectively. The improvement can be seen before and after using pre-trained neural networks. The spikes in the received data are pretty much removed by the denoising network which is clear from the zoomed version of
Figure 7a–c. Some examples from the experimental results using real/field data with different levels of noise are shown in
Appendix A.
The window size
M is also crucial and the blind channel estimation performance depends on it. The blind algorithm highly is dependent on the structure of the channel matrix
. Therefore, it is shown in
Figure 8 that when the window size
M has a small value, the structure of the channel matrix is poor and the performance suffers. However, increasing the value of
M beyond 15 (large channel matrix) causes the minimization error to increase, resulting in a decrease in performance.