1. Introduction
Boosting the data rate has been the main objective of wireless communication systems over this last decade. Long-Term Evolution (LTE) aimed at improving user data rate experience, and all the enhancements were centered on this main requirement, while the needs of services like virtual reality and the Internet of Things (IoT) were not properly taken into account. Currently, we are experiencing the 5th-Generation (5G) mobile era, where the objective is to allow the coexistence of different services and heterogeneous users on the same networking platform, besides looking for improvements of the data rate.
5G is a multi-use-case environment designed to reach almost all life aspects, from industry and health care to online games and vehicle-to-everything (V2X) communications. The current 5G is, however, just the beginning of more advanced communication systems. The service stack will continue to grow, and new applications are expected to be developed. It is no surprise that new requirements will also emerge in the near future, related to energy in particular, and more advanced designs will undoubtedly be needed [
1].
The current 5G specifications describe it as a multi-purpose communication ecosystem. That is, it will be an umbrella for several services where each service is associated with a bundle of requirements (see [
2] for further details). In order to support them, 5G is equipped with new features. Among them, flexibility is the most important one [
3]. Supporting multiple use-cases requires a flexible design not only on radio access methods and deployment options, but also on the frame structure and operations it supports. The corner stone to achieve such flexibility is the new design of the 5G new radio (NR).
The NR is built on four main pillars: (i) new spectrum; (ii) massive MIMO and beamforming; (iii) multi-connectivity; (iv) network flexibility and virtualization (numerology, slicing, NFV/SDN). The flexibility of NR spreads all over the system planes. Not only on the frame structure, but also on the operation and protocol stack logic. Furthermore, NR is user-centric, and bandwidth parts (BWPs) can be used to fit user equipment (UE) temporal requirements with dynamic transmission time interval (TTI) lengths and dynamic time division duplex (DTDD) [
4]. In short, the 5G NR is designed with components that are flexible, ultra-lean, and forward-compatible [
5].
The NR frame structure is the pivotal element for the flexible support of heterogeneous services, while allowing adaptation to different user channel conditions [
6]. Flexibility here comprises different orthogonal frequency division multiplexing (OFDM)-based waveforms, as well as a mixed numerology [
7]. The last term refers to different multicarrier modulation parameters with an impact on subcarrier spacing, cyclic prefix (CP) duration, and slot duration, as shown in
Table 1. The downlink and uplink transmissions are organized into 10ms frames, each having ten subframes of 1ms. In 5G, the 1ms subframe is then divided into one or more slots, depending on the numerology index in use. As shown in
Figure 1, the shortening of the slot duration is related to the shortening of the duration of the symbols.
On the one hand, this flexible design enables an efficient delivery of different qualities like low latency, guaranteed bit rate (GBR), reliability, and more [
8]. On the other hand, 5G resource management becomes more challenging. Mixed numerologies are prone to: (i) inter-numerology interference (INI); (ii) low spectral efficiency; (iii) signaling overhead; (iv) scheduling complexity. Therefore, the number of coexisting numerology indexes should be minimized.
A mixed numerology system where different BWPs co-exist demands special planning. Such a planning problem involves deciding on the numerology mix and BWPs that better explore network capacity given a set of users/services that may change in number, requirements, and impairments. Such planning should also be feasible for environments having a dynamic traffic pattern. Here in this article, the pre-planning of the numerology profiles is proposed to address this issue. The mathematical optimization models created to design such mixed numerology profiles allow for the optimization of mixed numerologies in future 5G systems, under any wireless communication scenario and traffic pattern. More clearly, the contributions of this article are the following:
A framework for the outline of mixed numerology profiles is proposed. These profiles are planned according to the k most antagonistic user/service requirements. The proposed framework allows a fast transition between profiles, performed in the case of traffic pattern changes.
A two-step approach is proposed for the planning of such numerology profiles, each having its own mixed numerology and BWPs. Mathematical optimization models are presented to solve both steps.
As technology evolves in 5G, operators will be able to deliver even more advanced and value-added services. Therefore, network planning and deployments must be done in such a way that they match the ambition of the services. This requires understanding users/services and taking into account their requirements in order to outline future mixed numerology profiles. The proposed framework allows this goal to be reached.
The remainder of this article is organized as follows.
Section 2 discusses related work, and then,
Section 3 clarifies several wireless communication and 5G related definitions, which are required to understand the following sections and addressed problem. In
Section 4, the motivation for the optimization of numerology profiles is presented, and a mathematical formulation of such an optimization problem is created, allowing it to be solved.
Section 5 presents the simulation setup and analysis of the results, and
Section 6 presents a final discussion on the results and conclusions, together with future work.
2. Related Work
The multiplexing of 5G services under a mixed numerology is now a hot topic. The mixture of OFDM-based numerology indexes was analyzed in [
6], where it was shown to be beneficial for the support of different services with different latencies. However, due to energy leaks, the INI problem may appear. The authors demonstrated a time-domain window filtering method to reduce such an effect. The authors in [
9] studied the multi-service support under different subcarrier spacing and highlighted the effect of the subcarrier spacing difference and the guard band size on the INI problem. They proposed a sub-band filtered transmission scheme and cancellation and equalization algorithms to reduce the effect of the INI problem. In [
10], the INI problem was also analyzed and shown to be even more severe on the edge sub-carriers of neighboring BWPs. Similar to [
9], the authors also showed that the effect of the INI is proportional to the difference in numerology indexes. In order to avoid the INI, an adaptive numerology selection approach was developed in [
11] based on the delay requirements of the service. This approach considers only the delay requirements to reduce the average scheduling latency.
Finding the minimum mixture of numerology indexes was studied in [
12], where a metric was developed to measure user satisfaction regarding the numerology assigned to deliver its services. Different scenarios were simulated and different numerology index sets considered. In order to select the desired number of numerologies, the authors proposed a greedy algorithm, which uses the metric to obtain a trade-off between scenario and user requirements. In [
13], service multiplexing using a predefined mixture of numerology indexes was modeled as an integer programming problem and shown to be NP-hard by constructing a polynomial-time reduction from the partition problem. For this reason, it was solved using a Lagrangian relaxation. However, this work did not include the INI problem in the model. Similar studies can be found in [
14,
15]. In [
14], a multi-user OFDM system was modeled using an integer program, and a relaxed version was developed to reduce the computational complexity. In [
15], an integer program was also developed to model the scheduling of a multi-user system while meeting users’ service requirements. This work also proved that the multi-user resource allocation under different service requirements is an NP-hard problem. Moreover, the authors developed two different algorithms: a resource portioning algorithm to decompose the allocation problem into a set of parallel small scale problems and an iterative greedy algorithm based on the resource assignment weight. This work, however, allocated physical resource blocks (PRBs) to users under the assumption that numerologies are preselected by each user. The energy efficiency optimization problem was studied in [
16]. Furthermore, a joint beamforming and power allocation scheme was proposed, which takes into consideration the intra- and inter-cell interference, but not the INI problem. In addition, resource scheduling of different services was studied in [
17], where different models were developed to join enhanced mobile broadband (eMBB) and ultra reliable low latency communications (URLLC) resource allocation. In [
18], the concern was to predict the outage probability in order to ensure efficient and stable service communications. New approaches using machine learning have also emerged, as in [
19].
Table 2 summarizes the differences among these research efforts.
The previously mentioned studies considered either a predefined mixed numerology and addressed resource allocation and service multiplexing problems or considered planning a mixed numerology that is more suitable for specific services. In [
20], we addressed the spectrum allocation problem. In this article, and contrary to such studies, the problem is to plan multiple numerological profiles that fit certain QoS requirements, which should be selected according to the presence or not of one or more of these QoS requirements over time. To our knowledge, this problem has not been addressed before.
3. Required Definitions
For readability, the notation used throughout the article is summarized in
Table 3.
For now, 5G systems will use a single waveform (CP-OFDM), and two large frequency ranges (FRs) are specified by 3GPP: sub-6 GHz (FR1) and millimeter wave (FR2). A subcarrier spacing of 15 and 30 kHz can be used in sub-6 GHz; a subcarrier spacing of 120 kHz can be used in the millimeter wave range; and a subcarrier spacing of 60 kHz can be used in both. A set of bands has been defined for each FR, by 3GPP, together with the available subcarrier spacings and supported UE channel bandwidths (maximum transmission bandwidth + guard bands). For example, a 5 MHz UE channel bandwidth is only supported in a 15 kHz subcarrier spacing [
23].
Definition 1 (Cell bandwidth). Let us assume a wireless communication scenario in which the service area is covered by a specific frequency band B, under a certain bandwidth denoted by W. Guard band penalties are assumed to be incorporated in W, and the set of allowed numerology indexes is denoted by .
The cell bandwidth is expected to be large, but the reception/transmission bandwidth of a UE is not necessarily the same as that of the cell bandwidth. That is, the reception/transmission bandwidth of a UE will be a subset of the total cell bandwidth and may decrease during low activity to save power. A UE can have at most four BWPs configured for downlink (similar for uplink; supplementary uplink is possible), and for now, just a single BWP will be active at a time (in Release 15). One of the BWPs will be similar for all users in the cell and used for initial access to the network.
Definition 2 (BWP). A bandwidth part is the frequency spectrum, within the carrier’s bandwidth, over which a device is currently operating. A BWP is a group of contiguous physical resource blocks (PRBs) where one PRB occupies 12 consecutive subcarriers (frequency domain), and it can be used in either direction (uplink or downlink). A BWP is associated with a numerology index, and only a single BWP can be active at any time. The set of BWPs for utilization by UE u is denoted by , and if is the bandwidth percentage assigned to , , then will be the number of its PRBs () if numerology index is used. A switch to short BWP allows energy saving, while BWPs at different numerology indexes allow for different services.
The control mechanism responsible for exchanging BWP information is the radio resource control (RRC) protocol. This protocol is understood by both the user NR and by the network gNB. The RRC can perform BWP reconfiguration or use downlink control information (DCI) messages for BWP switching. These consume a certain time, and for this reason, their use should be minimized. Since the RRC reconfiguration is the most demanding one, the assigned BWPs should fit not only current user/service needs (while being efficient regarding the use of physical resources), but also future needs so that RRC reconfigurations are avoided as much as possible.
Definition 3 (NR data rate). A user NR device is assumed to have a limitation on the speed at which it can transfer data, denoted by . It is assumed that such a limitation is per 66.67 μs (symbol duration in Numerology Index 0). Such a limitation influences the user BWP sizes, numerology, and modulation scheme. That is, for a given BWP of user u, , for numerology g and modulation m.
Moreover, NR devices also have a limitation on the speed at which they can transfer data, and impairments (e.g., Doppler spread, frequency offset) may exist.
Definition 4 (Impairments). Measurements/feedback are provided via channel quality information (CQI), or other similar systems. The overall set of feedback elements is denoted by , and it is assumed that specific wireless communication scenarios (e.g., mobile), served by a specific frequency band, usually lead to a set of channel and UE impairments. Such information should be used to improve the QoS.
ITU-Rstarted by defining three service types: enhanced mobile broadband (eMBB), ultra reliable low latency communications (URLLC), and massive machine type communications (mMTC) [
24]. Regarding these service types, it can be stated that:
eMBB: The focus is on supporting the ever-increasing end user data rate and system capacity.
URLLC: The focus is reliability and security required by mission-critical applications. High subcarrier spacing and mini-slots are the key enablers for this use case.
mMTC: The focus is energy efficiency and massive connectivity.
This classification has as a basis the technical viewpoint of network operators and service providers. More recently, an end user experience perspective was proposed to classify 5G services [
25,
26]. The authors proposed a classification based on five features: immersiveness, intelligence, omnipresence, autonomy, and publicness. Naturally, other kinds of services may arise, allowing the possibility to better serve existing and future applications. As the number of classes increases, so does the complexity of systems because services will have different kinds of requirements.
The service requirements are specified through 5G QoS class index (5QI) values [
27,
28], and these are expected to differ among wireless communication scenarios like indoor hotspot, dense urban, rural, urban macro, high speed, etc. [
29]. That is, wireless communication scenarios are expected to affect the service requirements, which is basically related to the impairments.
Definition 5 (Services). The main types of uses that 5G is expected to enable (overall set of services under consideration) are denoted by . The overall set of possible QoS requirements is denoted by , and it is assumed that requirement values change according to the wireless communication scenario.
Regarding the duration of TTIs, these will change according to the used numerology index because TTI = the number of symbols in time × symbol length. For services requiring lower latencies, a low number of symbols per TTI or a short symbol length can be used to obtain a shorter TTI. For higher spectral efficiency, a longer TTI allows for higher spectral efficiency (less % of downlink control channel overhead, used to carry scheduling decisions every TTI). Therefore, there is a tradeoff between system spectral efficiency and minimal latency.
In practice, the subcarrier spacing, TTI, and number of subcarriers are directly related. For a low number of subcarriers, large subcarrier spacing is used, and a lower TTI (due to short symbol duration) is obtained, leading to low latency (note, however, that TTI has to do also with the number of symbols in time). Users/services can be flexibly multiplexed over the available resources with different TTIs, which allows for the support of service-aware TTI multiplexing on the same frequency [
30]. The TTI can be adjusted according to the required latency and scheduling frequency [
31].
The radio medium access control (MAC) scheduler does the packet treatment separately for each data radio bearer (DRB) following a two-step mapping of end-to-end (E2E) session flows.
Definition 6 (Two-step mapping of E2E session flows).
The non-access stratum (NAS) filters the data packets in the UE, or 5G core network (CN), and associates the data packets with QoS flows. An E2E session can be associated with one or more QoS flows. The access stratum (AS) mapping in the UE (or 5G RAN) associates the QoS flows with the DRBs (see Figure 2). This mapping is based on 5QI in the transport header of the packets and on the corresponding QoS parameters that are signaled via the CN interface when a session is established. One or more QoS flows can be mapped to a DRB, and a UE can have a set of DRBs, denoted by . A DRB d has a vector of QoS requirement values denoted by , where . Table 4 shows some traffic types and corresponding requirements. If the system reaches congestion, then priorities can always be used (see [
28] for details). This two-step approach allows differentiating application/service flows and the adaptation of the DRB requirements to guide the radio scheduler. That is, the QoE manager can adaptively monitor and adjust the mapping of QoS flows to DRBs (reflective QoS). By adjusting the mapping of QoS flows to a DRB, there is an adjustment of the latency budget, packet loss rate tolerance, and GBR with the DRB, and this can be used to guide the lower-layer scheduler (although this should be done rarely, and priority should be changed to better serve an application). DRB data can be mapped to one or more BWPs and, consequently, TTI sizes.
Applications can have different requirements, and achieving the highest data rates may not always be the main requirement. Power consumption can be a critical issue and must also be considered. For most devices, the maximum throughput scenario is the one leading to the highest energy efficiency because the energy consumed per transferred bit is minimum (there is a power baseline that if distributed by a large number of bits, then the power efficiency is higher). However, it cannot be taken for granted that the power consumption will linearly map to a data rate improvement, especially if the device uses the full bandwidth to transmit lower data rate traffic (high baseline). That is, if the device uses a large bandwidth to transmit low data rate traffic, then there is energy inefficiency. In summary:
Uplink: Since power varies according to the distance, when transmit power is near its limit, then the only way to extend uplink coverage is to concentrate the same energy into fewer bits. If the data are not urgent, then the BWP should be small. For short distances, higher BWPs (fitting data rate) should be used for higher energy efficiency).
Downlink: High throughputs can be provided by high bandwidth carriers and MIMO layers, but this requires high processing capacity to deal with data rates and high power at maximum throughput because the device has to actively monitor wideband control channel across a large bandwidth, even when no data are present. For this reason, the BWP should be large in the case of high throughput, but small in the case of low throughput.
The power baseline also changes with the channel/BWP/modulation, and this will change the required energy per bit. Switching between BWPs can be done to keep the baseline low, if the throughput reduces, but the switching overhead must also be considered. In summary, the BWP and the consequent power consumption must be considered based on the traffic profile: service type and their requirements.
Definition 7 (NR energy). NR devices are assumed to have different hardware characteristics. In general, the higher the channel, BWP, and modulation of a channel, the higher the power baseline. This should follow service throughput needs for energy efficiency. Full spectrum and short TTI allow constrained devices to go to sleep mode. BWP planning should take the traffic profile into account.
Numerology indexes are not associated with specific service classes because there will be different communication scenarios and UE characteristics. For this reason, seven numerology indexes have been defined, and the BWP/numerology index in use by a UE may change at every TTI. This should be decided according to: (i) the requirements of services/DRBs in a communication scenario; (ii) NR features (bandwidth, data rate, and energy limitations); and (iii) impairments.
In general, few numerological indexes should be used when there is a need for greater spectral efficiency. This is because a mixture of numerology indexes requires guard bands to avoid the INI. In some cases, such spectral efficiency is more important than having multiple numerology indexes, which provides more flexibility, but at the expense of scheduling complexity and signaling overhead, leading to a waste of bandwidth resulting from the use of multiple guard bands.