- Research
- Open access
- Published:
Frequency-domain equalization for OFDMA-based multiuser MIMO systems with improper modulation schemes
EURASIP Journal on Advances in Signal Processing volume 2011, Article number: 73 (2011)
Abstract
In this paper, we propose a novel transceiver structure for orthogonal frequency division multiple access-based uplink multiuser multiple-input multiple-output systems. The numerical results show that the proposed frequency-domain equalization schemes significantly outperform conventional linear minimum mean square error-based equalizers in terms of bit error rate performance with moderate increase in computational complexity.
1 Introduction
Multiple-input multiple-output (MIMO) techniques in combination with orthogonal frequency division multiple access (OFDMA) have been commonly used by most of the 4G air-interfaces, e.g., WiMAX, long-term evolution, IEEE 802.20, Wireless broadband, etc. In the IEEE 802.16e mobile WiMAX standard, OFDMA has been adopted for both downlink and uplink transmission [1, 2]. In 3GPP LTE, single carrier (SC) frequency division multiple access (FDMA) is used for uplink transmission, whereas the OFDMA signaling format is exploited for downlink transmission [3]. There are also some proposals on using OFDMA for uplink transmission in the LTE advanced (LTE-A) standard, in which both SC-FDMA and OFDMA can be considered for uplink transmission.
This paper investigates receiver algorithms for the uplink of OFDMA-based multi-user MIMO systems. Frequency-domain equalization (FDE) is commonly used for OFDMA. This includes frequency domain linear equalization (FD-LE) [4], decision feedback equalization (DFE) [5, 6], and the more recent turbo equalization (TE) [7, 8]. FD-LE is analogous to time-domain LE. A zero-forcing (ZF) LE [9] eliminates intersymbol interference (ISI) completely but introduces degradation in the system performance due to noise enhancement. Superior performance can be achieved by using the minimum mean square error (MMSE) criterion [9], which accounts for additive noise in addition to ISI. In OFDMA, a DFE results in better performance than a LE due to its ability to remove past echo ISI. However, a DFE is prone to error propagation when incorrect decisions are fed back. Consequently, it suffers from a performance loss for long error bursts. The principle that TE employs to improve performance is to add complexity at the receiver through an iterative process, in which feedback information obtained from the decoder is incorporated into the equalizer at the next iteration. The iterative processing allows for reduction of ISI, multistream interference, and noise by exchanging extrinsic information between the equalizer and the decoder [7, 8].
The second-order properties of a complex random process are completely characterized by its autocorrelation function as well as the pseudo-autocorrelation function [10]. Most existing studies on receiver algorithms only exploit the information contained in the autocorrelation function of the observed signal. The pseudo-autocorrelation function is usually not considered and is implicitly assumed to be zero. While this is the optimal strategy when dealing with proper complex random processes [11], it turns out to be sub-optimal in situations where the transmitted signals and/or interference are improper complex random processes, for which the pseudo-autocorrelation function is non-vanishing, and the performance of a linear receiver can be improved by the use of widely linear processing (WLP) [12]. Such a scenario arises when transmitting symbols with improper modulation formats (e.g., ASK and OQPSK) over complex channels. It was shown in Schreier et al. [10] that the performance gain of WLP compared to conventional processing in terms of mean square error can be as large as a factor of 2. MIMO transceiver design was considered in Mattera et al. [13], Sterle [14], where it was shown that when channel information is available both at the transmitter and receiver, joint design of the precoder and decoder using WLP yields considerable performance gains at the expense of a limited increase in the computational complexity, compared to the conventional linear transceiver in the scenario where real-valued symbols are transmitted over complex channels. By using the same principle, a real-valued MMSE (RV-MMSE) beamformer was developed in Chen et al. [15] for a binary phase shift keying (BPSK)-modulated system and was shown to offer significant enhancements over the standard complex-valued MMSE (CV-MMSE) design in terms of bit error rate performance and the number of supported users.
In this paper, we show that the conventional frequency-domain linear equalizer is suboptimal for improper signals and that performance can be greatly improved by applying widely linear processing and utilizing complete second-order statistics of improper signals.
Notations: we use upper bold-face letters to represent matrices and vectors. The (n, k)th element of a matrix A is represented by [A] n,k , the n th element of a vector b is denoted by [b] n , and the n th column of a matrix A is represented by (A) n . Superscripts , and (·)* denote the Hermitian transpose, transpose, and conjugate, respectively. E[·] denotes expectation (statistical averaging).
2 System model
The cellular multiple access system under study has n R receive antennas at the BS and a single transmit antenna at the i th user terminal, i = 1, 2, ..., K T , where K T is the total number of users in the system. We consider the multi-user MIMO case with K (K ≤ K T ) users being served at each time slot and K = n R . The system model for an OFDMA-based MIMO transmitter and receiver is shown in Figures 1 and 2, respectively. On the transmitter side, the user data block containing N symbols first goes through a subcarrier mapping block. These symbols are then mapped to M (M > N) orthogonal subcarriers followed by an M-point inverse fast fourier transform (IFFT) to convert to a time-domain complex signal sequence.
There are two approaches to mapping subcarriers among mobile stations (MSs) [3]: localized mapping and distributed mapping. The former is usually referred to as localized FDMA transmission, while the latter is usually called distributed FDMA transmission scheme. With the localized FDMA transmission scheme, each user's data are transmitted by consecutive subcarriers, whereas with the distributed FDMA transmission scheme, the user's data are placed in subcarriers that are distributed across the OFDM symbol [3]. Because of the spreading of the information symbols across the entire signal band, the distributed FDMA scheme is more robust against frequency-selective fading and can thus achieve better frequency diversity gain. For localized FDMA transmission, in the presence of a frequency-selective fading channel, multiuser diversity and frequency diversity can also be achieved if each user is assigned to subcarriers with favorable transmission characteristics when the channel is known at the transmitter.
In this work, we only consider localized FDMA transmission. A cyclic prefix (CP) is inserted into the signal sequence before it is passed to the radio frequency (RF) module. On the receiver side, the opposite operating procedures are performed after the noisy signals are received by the receive antennas. A MIMO frequency-domain equalizer (FDE) is applied to the frequency-domain signals after subcarrier demapping as shown in Figure 2. For simplicity, we employ a linear MMSE receiver, which provides a good tradeoff between the noise enhancement and the multiple stream interference mitigation [16].
In the following, we let and denote by F M the M × M Fourier matrix with the element where k, m ∈ {1, ..., M} are the sample number and the subcarrier number, respectively. Here, ⊗ is the Kronecker product, and I K is the K × K identity matrix. We denote by the KM × KM matrix where is the M × M inverse Fourier matrix with element . Furthermore, we let F n represent the subcarrier mapping matrix of size M × N. Then, is the subcarrier demapping matrix of size N × M.
The received signal after the RF module and CP removal becomes , where is the data sequence of all K users, and x i ∈ ℂN ×1, i ∈ {1, ..., K}, is the transmitted user data block for the i th user; is a circularly symmetric complex Gaussian noise vector with zero mean and covariance matrix , i.e., ; is the n R M × KM channel matrix.
The signal after performing the FFT operation, subcarrier demapping, and employing a MIMO FDE is given by
where
is the channel matrix in the frequency domain and r = HPs + w; G is the KN × KN equalization matrix; is a circularly symmetric complex Gaussian noise vector with zero mean and covariance matrix , i.e., . The vector x can be expressed as x = Ps, where and s i ∈ ℂN×1, i ∈ {1, 2, ..., K}, is the user data block for the i th user, and . The power loading matrix P ∈ ℝKN × KNis a block diagonal matrix with its i th sub-matrix expressed as and pi,n(i ∈ {1, 2, ..., K}) is the transmitted power for the i th user at the n th subcarrier; s ∈ ℂKN×1represents the transmitted data symbol vector from different users with
When proper modulation schemes are employed, the conventional equalizer G can be derived from the cost function . Minimizing this cost function leads to the optimal solution
where is the autocorrelation matrix of the observation vector r; is the cross-correlation matrix between the observation vector r and the symbol vector s.
Note that the aforementioned FDE is a joint equalization algorithm, i.e., the transmitted symbols from different users are jointly equalized. To achieve spatial multiplexing gain, symbols from different users are assigned to the same subcarriers in the studied OFDMA-based multiuser MIMO system. Due to co-channel interference (causing the channel matrix H to be non-diagonal), we need to perform joint equalization for the transmitted symbols from different users.
3 The proposed frequency-domain receiver algorithm
In the previous section, we presented the conventional linear MMSE solution for the uplink of OFDMA-based multiuser MIMO systems. It is designed based on the autocorrelation matrix C rr and the cross-correlation matrix C rs . It is only optimal for systems with proper modulation, such as M-QAM and M-PSK, for which the pseudo-autocorrelation and the pseudo-cross-correlation are zero when M > 2. However, for improper modulation schemes, such as M-ary ASK and OQPSK (for which both the pseudo-autocorrelation and the pseudo-cross-correlation are non-zero), the conventional solution becomes suboptimal because and are not taken into consideration in the receiver design. In order to utilize and , we need to apply widely linear processing [10, 12], the principle of which is not only to process r, but also its conjugated version r* in order to derive the filter output, i.e.,
where and . It is worth noticing that the conventional linear MMSE receiver is a special case of the one expressed by (3), when and G 1 = 0.
To derive the improved FDE, we re-define the detection error as . According to the orthogonality principle [17], the mean-square value of the estimation error ε is minimum if and only if it is orthogonal to the observation vector y, i.e.,
leading to the solution , where
and
Based on the above derivations, we can form the optimal solution for Ψ as
For the proposed FDE, the augmented autocorrelation matrix C yy and cross-correlation matrix C ys expressed in (5), which give a complete second-order description of the received signal, are used to derive the filter coefficient matrix Ψ. On the other hand, for the conventional linear MMSE algorithm, the coefficient matrix G is calculated using only the autocorrelation of the observation C rr and the cross-correlation C rs . The pseudo-autocorrelation and pseudo-cross-correlation are implicitly assumed to be zero, leading to sub-optimal solutions.
For proper signals like QAM and PSK, the improved FDE converges to the conventional FDE since , leading to and . Therefore, and in Eq. (5). The optimal solution of Ψ can be simplified to
which is exactly the same as Eq. (2) for the conventional FDE.
The improved FDE has higher computational complexity than the conventional FDE. The difference in complexity lies in the computation of the matrix G for the conventional equalizer and the computation of Ψ for the improved equalizer as indicated in Table 1, where we show the number of complex multiplication (×), division (÷), addition (+), and subtraction (-) operations to calculate G and Ψ, respectively. In the complexity calculation, we use the fact that for a L × L matrix, its matrix inversion involves 2L2 divisions, 2L3 multiplications, and 2L3 subtractions. It should also be noted that the complexity increase by the improved scheme is compensated for the significant performance improvement. Furthermore, this issue becomes less critical in slow-fading channels for which the equalizer matrices do not need to be updated frequently.
In Figure 3, we show the number of flops required to compute the matrix G (for the conventional FDE) and the matrix Ψ (for the improved FDE) as a function of the data block size N for a 2-user case. One flop is counted as one real operation, which can be addition, subtraction, multiplication, or division [18]. A complex division requires 6 real multiplications, 3 real additions/subtractions, and 2 real divisions. A complex multiplication requires 4 real multiplications and 2 real additions. It is evident from Figure 3 that the additional operations required by the improved FDE is moderate when the block size is small, e.g., N < 10, and increases significantly when the block size increases. For example, the number of flops required by the improved FDE is 4.5 times that required by the conventional FDE when N = 12. Therefore, for efficient implementation, it is necessary to break the received data into blocks of moderate sizes before the equalization is applied.
4 The proposed iterative receiver algorithm
In this section, we derive an iterative FDE algorithm by applying WLP and exploiting the complete second-order statistics of the improper signals. Recall that the received signal after CP removal, FFT and subcarrier demapping can be expressed as
where the symbol vector . Let us assume that symbol s n is to be decoded. By using the iterative interference cancelation technique [8, 19, 20], the received vector can be expressed as
where r n is the interference canceled version of r, and
which contains the soft estimate of the interfering symbols from the previous iteration. Note that (8) represents a decision-directed iterative scheme, where the detection procedure at the p th iteration uses the symbol estimates from the (p - 1)th iteration. The performance is improved in an iterative manner due to the fact that the symbols are more accurately estimated (leading to better interference cancelation) as the iterative procedure goes on. For simplicity, the iteration index is omitted, whenever no ambiguity arises.
In order to further suppress the residual interference in r n , an instantaneous linear filter is applied to r n , to obtain , where the filter coefficient vector g n ∈ ℂN K× 1is chosen by minimizing , under the MMSE criterion. It can be derived as
where (HP) n is the n th column of the matrix HP. The matrix V n ∈ ℝ N K× 1is formed as
where , and . Refer to Wautelet et al. [19], Wang and Li [20], and Tuchler et al. [8] for a detailed description of this conventional iterative algorithm.
The conventional scheme suffers from the problem of error propagation caused by incorrect decisions. As will become evident in Section 5, the error propagation effect can be reduced and the system performance can be improved if we not only process r n but also its conjugated version in order to derive the filter output, i.e., , where and . The filter Ψ n can be derived by minimizing the MSE E{| e n |2}, where . According to the orthogonality principle,
leading to the solution
where
In what follows, we demonstrate how the vector in (9) and the matrix V n in (11) can be derived in order to carry out the iterative process. The filter output can be expressed as
where the combined noise and residual interference ν n are approximated as a Gaussian random variable [21], i.e., . The parameters μ n , N ν can be determined as [22]
After computing the values of μ n and N ν , the conditional probability density function (PDF) of the filter output can be obtained as
For M-ary PSK, QAM, ASK systems, each symbol s n corresponds to log2 M bits, denoted as , i = 1, ..., log2 M. The log-likelihood ratio (LLR) for the i th information bit can be computed as
where is the set of symbols {x m } whose i th bit takes the value of 1 (0); s+ denotes the symbol corresponding to , and s- denotes the symbol corresponding to .
The soft estimate in (9) and the variance var (s i ) in (11), respectively, can be calculated as [22]
where . The a priori probability of each symbol P r (s i ) can be calculated as , where
5 Simulation results
We consider a WiMAX baseline antenna configuration, in which two MSs are grouped together and synchronized to form a MIMO channel between the BS and the MSs. We assume a six-path fading channel, and the channel matrix is normalized such that the average channel gain for each transmitted symbol be equal to unity. The fading coefficients for each path are modeled as independent identically distributed (i.i.d) complex Gaussian random variables. The channel is assumed to be fully interleaved, have a uniform power delay profile, and to be a slowly time-varying so that it remains static during the transmission of one frame of data but varies from one frame to another. The block size of the user data is 12, which is also the number of subcarriers in a resource block. The size of the FFT is 256, and the length of the cyclic prefix (CP) is 8. The power loss incurred by the insertion of the CP is taken into account in the SNR calculation.
Figure 4 shows the bit error rate (BER) performance comparison between the conventional and the improved receivers for 4ASK and OQPSK systems. The improved receiver scheme significantly outperforms its conventional counterpart, especially at high SNRs. The gap can be over 5-6 dB. The curve for a QPSK system with the conventional receiver is also provided for a baseline comparison. Note that for the conventional receiver, the BER performance for an OQPSK system is the same as for a QPSK system [23]. The performance of the QPSK system is superior to the 4ASK system with the conventional receiver but is inferior to the 4ASK system with the improved equalizer at high SNRs. Although QPSK modulation itself is more power efficient than 4ASK for using a signal constellation of 2 dimensions instead of 1, the 4ASK system can exploit the pseudo-autocorrelation function in the receiver design, whereas the QPSK system does not have this special property to utilize. The overall impact will render an advantageous situation for the 4ASK system. Refer to Sterle [24] for a detailed and quantitative analysis of the performance gain that can be achieved by a widely linear transceiver.
Figure 5 shows the BER performance comparison between the conventional and the improved FDE for 16ASK and 16QAM systems. For the 16ASK system, the improved receiver significantly outperforms its conventional counterpart and the performance gain increases as the SNR increases. Figure 5 also shows that the 16ASK system with the improved FDE performs better than the 16QAM system when SNR > 40 dB.
In Figure 6, we compare the performance of the proposed iterative FDE introduced in Section 4 with the conventional iterative FDE. The curves are plotted at the second iteration, since it has been observed that the major gain from the iterative process can be achieved with two iterations. The conclusions from previous experiments also hold here: the QPSK system has a better performance than the 4ASK system with the conventional iterative FDE, but it is inferior to the 4ASK system with the improved iterative FDE. The performance gain can be over 4 dB at high SNR. The gain achieved by the iterative process can be determined by comparing Figures 6 to 4. For example, in order to achieve a target BER of 10-3, a SNR value of 28 dB is required for the 4ASK system with the proposed non-iterative FDE, while only 25 dB is required by the proposed iterative FDE at the second iteration.
6 Conclusion
In this paper, we derived an improved FDE algorithm for an OFDMA-based multiuser MIMO system with improper signal constellations. Our simulation results reveal that the proposed scheme has superior BER performance compared to the ones with the conventional FDE. We also presented a novel iterative FDE scheme, which utilizes the complete second-order statistics of the received signal. It is shown that this scheme significantly outperforms the conventional iterative FDE.
References
Etemad K: Overview of mobile WiMAX technology and evolution. IEEE Commun Mag 2008,46(10):31-40.
Wang F, Ghosh A, Sankaran C, Fleming P, Hsieh F, Benes S: Mobile WiMAX systems: performance and evolution. IEEE Commun Mag 2008,46(10):41-49.
3GPP TR 25.814 V7.0.0: Physical Layer Aspects for Evolved UTRA. Technical Report. 2006.
Pancaldi V, Bitetta GM: Block channel equalization in the frequency domain. IEEE Trans Commun 1995,53(1):110-121.
Benvenuto N, Tomasin S: On the comparison between OFDM and single carrier modulation with a DFE using a frequency domain feedforward filter. IEEE Trans Commun 2002,50(6):947-955. 10.1109/TCOMM.2002.1010614
Benvenuto N, Tomasin S: Iterative design and detection of a DFE in the frequency domain. IEEE Trans Commun 2005,53(11):1867-1875. 10.1109/TCOMM.2005.858666
Douillard C, Jzquel M, Berrou C, Picart A, Didier P, Glavieux A: Iterative correction of inter-symbol interference: turbo-equalization. Eur Trans Telecommun 1995,6(5):507-511. 10.1002/ett.4460060506
Tuchler M, Koetter R, Singer A: Turbo equalization: principles and new results. IEEE Trans Commun 2002,50(5):754-767. 10.1109/TCOMM.2002.1006557
Tse D, Viswanath P: Fundamentals of Wireless Communications. Cambridge University Press, Cambridge; 2004.
Schreier P, Scharf L, Mullis C: Detection and estimation of improper complex random signals. IEEE Trans Inform Theory 2005,51(1):306-312. 10.1109/TIT.2004.839538
Neeser F, Massey J: Proper complex random processes with applications to information theory. IEEE Trans Inform Theory 1993,39(4):1293-1302. 10.1109/18.243446
Picinbono B, Chevalier P: Widely linear estimation with complex data. Trans Signal Process 1995,43(8):2030-2033. 10.1109/78.403373
Mattera D, Paura L, Sterle F: Proceedings of the EUSIPCO. Widely Linear MMSE Transceiver for Real-Valued Sequences Over MIMO Channel. 2006.
Sterle F: Widely linear MMSE transceivers for MIMO channels. IEEE Trans Signal Process 2007,55(8):4258-4270.
Chen S, Tan S, Hanzo L: Proc. IEEE WCNC. Linear Beamforming Assisted Receiver for Binary Phase Shift Keying Modulation Systems. 2006, 1741-1746.
Paulraj AJ, Nabar R, Gore D: Introduction to Space-Time Wireless Communications. 1st edition. Cambridge University Press, Cambridge; 2003.
Kay S: Fundamentals of Statistical Signal Processing. Prentice Hall, NJ; 1998.
Golub GH, Van Loan CF: Matrix Computations. 3rd edition. John Hopkins University Press, Baltimore; 1996.
Wautelet X, Dejonghe A, Vandendorpe L: MMSE-based fractional turbo receiver for space-time BICM over frequency-selective MIMO fading channels. IEEE Trans Signal Process 2004,52(6):1804-1809. 10.1109/TSP.2004.827198
Wang J, Li S: Proceedings of the PIMRC. Reliability Based Reduced-Complexity MMSE Soft Interference Cancellation MIMO Turbo Receiver. 2007, 1-4.
Poor V, Verdu S: Probability of error in MMSE multiuser detection. IEEE Trans Commun 1997,43(3):858-971.
Dejonghe A, Vandendorpe L: Turbo-equalization for multilevel modulation: an efficient low-complexity scheme. Proc IEEE ICC 2002, 3: 1863-1867.
Simon M, Alouini MS: Digital Communication over Fading Channels: A Unified Approach to Performance Analysis. Wiley, New York; 2000.
Lipardi M, Mattera D, Sterle F: Constellation design for widely linear transceiver. Eur J Adv Signal Process 2010: 13. (Article ID 176587)
Author information
Authors and Affiliations
Corresponding author
Additional information
7 Competing interests
The authors declare that they have no competing interests.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Xiao, P., Lin, Z., Fagan, A. et al. Frequency-domain equalization for OFDMA-based multiuser MIMO systems with improper modulation schemes. EURASIP J. Adv. Signal Process. 2011, 73 (2011). https://doi.org/10.1186/1687-6180-2011-73
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/1687-6180-2011-73