Analysis of Fuzzy Time Series Forecasting for Migration Flows

Uzhga-Rebrov, Oleg; Grabusts, Peter

doi:10.3390/sym14071441

Open AccessArticle

Analysis of Fuzzy Time Series Forecasting for Migration Flows

by

Oleg Uzhga-Rebrov

^1,* and

Peter Grabusts

²

¹

Institute of Engineering, Rezekne Academy of Technologies, LV-4601 Rezekne, Latvia

²

Faculty of Engineering, Rezekne Academy of Technologies, LV-4601 Rezekne, Latvia

^*

Author to whom correspondence should be addressed.

Symmetry 2022, 14(7), 1441; https://doi.org/10.3390/sym14071441

Submission received: 8 June 2022 / Revised: 7 July 2022 / Accepted: 11 July 2022 / Published: 13 July 2022

Download

Browse Figures

Versions Notes

Abstract

:

The goal of this article is to forecast migration flows in Latvia. In comparison with many other countries with sufficiently symmetric emigration and immigration flows, in Latvia, migration flows are very asymmetric: the number of emigrants considerably exceed the number of immigrants. Since statistical data about migration are usually inaccurate, we employ fuzzy time series forecasting methods for prognosticating migration flows in Latvia forecasting. The use of this type of method is often useful not only for forecasting purposes. Three different methods for fuzzy time series forecasting are used. A detailed comparative analysis of the obtained results is given. Generalized forecasts of the expected net migration flow in the future are presented.

Keywords:

migration; net migration; fuzzy time series; fuzzy time series forecasting

1. Introduction

Time series forecasting problems often arise in different fields. In situations with a high uncertainty of historical data, fuzzy time series forecasting methods are more suitable. The initial data are transformed into a form of fuzzy linguistic categories and these categories are foundation for the forecasting.

Taking into account sufficiently high uncertainty of historical migration data for prospective evaluation of net migration in Latvia, three well-known fuzzy time series forecasting methods are used in this paper.

This paper’s basic purposes are:

Net migration fuzzy forecasting in Latvia using three selected methods.
The analysis of received results to determine the most suitable fuzzy forecasting method.
Practical recommendations to develop stabilization of migration flows in Latvia.

This paper’s novelty: according to the authors’ information, it is the first attempt to use fuzzy time series for migration flows forecasting.

This paper’s scientific and practical contributions are the expansion of fuzzy time series forecasting approaches to the new direction and using the received results to make practical recommendations concerning migration flow stabilization and reaching relative symmetry.

Research findings:

Fuzzy forecasting methods are suitable for this problem solution.
The method that shows the higher effectiveness may be recommended for another problem forecasting with fuzzy initial data.

The whole symmetry of immigration and emigration flows is an unattainable ideal. The situation in countries with prevalent emigration flows, which is currently typical for Latvia, leads to essential outflow of trained workers and creates worse conditions for business functioning. Countries with prevalent immigration flows of untrained workers can have problems with maintenance costs. Therefore, migration flows’ symmetry is an important factor for the successful progress of any state.

This article has the following structure. In Section 2, definitions of migration and migration flows are presented, and the state of migration flows in Latvia is described. Section 3 provides a review of the relevant literature on fuzzy time series forecasting methods. Section 4 discusses the conceptual foundations of fuzzy time series and common approaches to their prediction. Section 5 presents net migration in Latvia forecasts based on the general three fuzzy time series forecasting methods. In Section 6, the evaluation and analysis of results are given. In Section 7, the discussion of results is presented. Section 8 presents general conclusions and conclusions regarding the current state and possible future state of migration processes in Latvia.

2. Migration and Migration Flows

Individual and group migrations of people have happened throughout the history of human evolution. Migration processes have become more intense in the last decades of the 20th century and in the 21st century. One of the reasons for this was the intensive globalization of all economic processes, which caused the movement of large groups of people between countries and regions. Another reason was the wars in Iraq, Libya, Syria, and Afghanistan, which gave rise to intensity the flows of refugees from these countries. The third reason was the collapse of the USSR and the formation of many independent states on its former territory.

All these global processes have had and are having a significant impact on political and social processes in Latvia. In particular, during the years of its independence, the size and structure of migration flows in this country have changed significantly.

As a result, the following task arises: to analyze historical data on migration flows in Latvia and to predict the possible future state of these flows. This is necessary not only to get a clear idea of the current situation with migration, but also to develop reasonable recommendations for managing migration flows in the future.

Let us give some formal definitions from [1].

Migration is the movement of a person or a group of persons, either across an international border, or within a State. It is a population movement, encompassing any kind of movement of people, whatever its length, composition, and causes; it includes migration of refugees, displaced persons, economic migrants, and persons moving for other purposes, including family reunification.

Migrant. At the international level, no universally accepted definition for ʺmigrantʺ exists. The term migrant was usually understood to cover all cases where the decision to migrate was taken freely by the individual concerned for reasons of ʺpersonal convenienceʺ and used without intervention of an external factor; it therefore applied to persons and family members, moving to another country or region to better their material or social conditions and improve the prospect for themselves or their family.

The United Nations defines migrant as an individual who has resided in a foreign country for more than one year irrespective of the causes, voluntary or involuntary, and the means regular or irregular, used to migrate.

Total migration. The sum of entries and arrivals of immigrants, and of exists, or departures of emigrants, yields the total volume of migration, and is termed total migration, as distinct from net migration, or the migration balance, resulting from the difference between arrivals and departures.

Net migration. Difference between the number of persons entering the territory of a State and the number of persons who leave the territory in the same period, also called “migratory balance”. This balance is called net immigration when arrivals exceed departures, and net emigration when departures exceed arrivals.

There are various theoretical approaches to explain and analyze migration processes. A brief overview of such approaches is presented in [2]. Work [3] extensively analyzes the causes and consequences of migration flows to European countries. A broad literature review on the current and possible future situation in European countries is presented in [4]. In [5], the main theories of migration are presented: fundamentalist migration theories and conflict theory.

It has now become clear that the early theories of migration attempted to explain only certain aspects of migration processes. Therefore, there is an obvious need to develop new extended theories that more adequately take into account the influence of various factors (reasons) on migration processes. Reasonable criticism of existing migration theories and the definition of migration as an essential part of important socio-economic changes in modern society are widely presented in [6].

The main factors causing population migration are the following:

Economic factors. These factors play a decisive role in the formation of migration flows. Low level of income and high unemployment in some countries cause the migration of population to more developed countries. The obvious purpose of such migration is to ensure a higher standard of living for oneself and one’s family, to provide a good education for children, and to live in more comfortable social and cultural conditions.
Demographic factors. A high birth rate and rather low standard of living lead to overpopulation in some countries. The way out may be the migration of the most active part of population to other countries.
Socio-cultural factors. Underdeveloped infrastructure and lack of comfortable conditions for living and personal development force dissatisfied residents to move to countries where they can effectively meet their social and cultural needs.
Political factors. In countries with authoritarian systems, there are often restrictions on political and personal freedoms of citizens. This causes the migration of certain groups of citizens to countries with a more favorable political climate. Another significant factor from this group was the devastating wars that have taken place in recent decades in Iraq, Libya, Syria, and Afghanistan. These wars have caused large flows of refugees to other countries.

Currently, Latvia is an independent state—a member of the European Union. In this regard, the asymmetric migration flows in Latvia have a significant impact on the state and its development.

Some data, mostly of statistical nature, on current migration flows in Latvia are presented in [7]. The purpose of this article is to analyze and forecast net migration in Latvia. The values of net migration are taken as a basis, since they are a generalization of both the flow of people leaving the country and the flow of those entering the country. The specificity of migration flows in Latvia is that the flow of emigration significantly exceeds the flow of immigration. The values of emigration and net migration are highly correlated. At the same time, the values of immigration and net migration are practically not correlated.

The working material for this article is widely available statistical data. However, there is a problem with these data. A widely known fact is the relative inaccuracy of historical migration data. Therefore, in this article, we model the initial data in the form of a fuzzy time series, the elements of which are fuzzy linguistic categories. Prediction procedures are performed on this transformed dataset.

In addition to the main aim, another purpose of this article is to present the process of fuzzy time series forecasting in a broader context. To do this, we perform a historical data forecasting process using three different methods. This allows us to compare the performance of these methods and determine the method that gives the most accurate prediction results in our particular problem.

3. Literature Review about Fuzzy Time Series Forecasting

Time series can be defined as the results of observations, measurements, evaluations that are performed sequentially at certain points in time. There are a lot of literature sources regarding deterministic time series and their prediction. Here we will mention only monographs [8,9,10], in which all issues related to this topic are covered competently and in detail.

In his innovative works, L. Zadeh [11,12,13] introduced new concepts of fuzzy sets, fuzzy logic, and fuzzy linguistic variables. From that time, the rapid introduction of these new concepts in various theoretical and practical areas of human activity began.

The first successful attempts to introduce fuzziness in the time series and their prediction are presented in works [14,15,16]. The authors gave examples where the observed phenomena cannot be estimated using standard numbers in principle, since these phenomena are by their nature vague concepts. As an example, the authors used subjective assessments of weather conditions by an individual. Another even more illustrative example is the individual’s consistent subjective assessment of his mood.

The conceptual definitions of fuzzy time series and the proposed algorithm for their prediction in [14,15,16] were correct from a theoretical point of view, but the problem was the extraordinary number of calculations that had to be performed in the practical use of the proposed method.

To eliminate this shortcoming in [17], a modification of this algorithm was proposed, which gave satisfactory prediction results, while the number of required calculations was much lower. Further improvements to the original algorithm were proposed in [18,19]. Later, various options for improving the efficiency of fuzzy time series forecasting algorithms were proposed [20,21,22,23,24,25,26].

Among other problems associated with the fuzzy time series, we should mention the formation of intervals and related fuzzy linguistic categories in the range of changes in the original historical data. In the original version of the algorithm, the authors used 7 intervals. This number was taken as a basis on the grounds of subjective judgments. In this regard, the problem arises of developing a criterion for objectively assigning the number of relevant intervals. Attempts to resolve this problem are presented in [21,23,25,27,28,29].

At present, fuzzy time series forecasting is widely used for forecasting in various fields. In [29,30,31], forecasting of needs in tourism is presented. In [32], a fuzzy time series model is used for stock market forecasting. In [33], a fuzzy time series model is used for forecasting the amount of Taiwan exports.

In addition to the above approach to fuzzy time series forecasting, alternative approaches are used. Another approach is FARIMA—Fuzzy Autoregressive Integrated Moving Average. Examples of practical use of this approach can be found in [34,35,36]. One more approach is to use Artificial Neural Networks (ANNs) [37,38,39].

An additional extensive review of various applications of fuzzy time series forecasting is presented in [40].

4. Conceptual Foundations of Fuzzy Time Series Forecasting

In a strict mathematical form, the concepts and definitions of fuzzy time series are presented in [14]. Let us cite the main theses of this work.

Let

Y (t)

t = ...0, 1, 2, \dots

, a subset of

R^{1}

be the universe of discourse on with fuzzy sets

f_{i} (t)

i = 1, 2, \dots

are defined and

F (t)

is the collection of

f_{i} (t)

. Then,

F (t)

is called a fuzzy time series on

Y (t)

t = \dots, 1, 2, \dots

.

In this definition,

F (t)

can be understood as a linguistic variable and

f_{i} (t)

as the possible linguistic values of

F (t)

. Because at different times, the values of

F (t)

can be different,

F (t)

is a function of time

t

.

Suppose

I

and

J

are indices sets for

F (t - 1)

and

F (t)

, respectively. Then, if for any

f_{j} (t)

where

j \in J

there exist an

f_{i} (t - 1) \in F (t)

where

i \in I

such that there exist a fuzzy relation

R_{i j} (t, t - 1)

and

f_{j} (t) = f_{i} (t) \circ R_{i j} (t, t - 1)

where

\circ

is the max-min composition, then

F (t)

is said to be caused by

f (t - 1)

only.

Fuzzy relation

R_{i j}

can be extended in two alternative ways:

If for any $f_{j} (t)$ , there exist an integer $m > 0$ and fuzzy relation $R_{a}^{p} (t, t - m)$ such that

f_{j} (t) = (f_{i 1} (t - 1) \times f_{i 2} (t - 2) \times \dots \times f_{i m} (t - m)) \circ R_{a}^{p} (t, t - m)

then

F (t)

is said to be caused by

F (t - 1)

,

F (t - 2)

, ..., and

F (t - m)

simultaneously then define the following fuzzy relational equation:

F (t) = (F (t - 1) \times F (t - 2) \times \dots \times F (t - m)) \circ R_{a} (t, t - m)

(1)

2.: If there exist a fuzzy relation $R_{o}^{p} (t, t - m)$ such that

f_{j} (t) = (f_{i 1} (t - 1) \cup f_{i 2} (t - 2) \cup \dots \cup f_{i m} (t - m)) \circ R_{a}^{p} (t, t - m)

then

F (t)

is said to be caused by either

F (t - 1)

or

F (t - 2)

, or … or

F (t - m)

. We have the following fuzzy relational equation

F (t) = (F (t - 1) \cup F (t - 2) \cup \dots \cup F (t - m)) \circ R_{o} (t, t - m)

(2)

where

R_{o} (t, t - m) = \cup_{p} R_{o}^{p} (t, t - m)

.

If the fuzzy relation

R (t, t - m)

or

R_{a} (t, t - m)

, or

R_{o} (t, t - m)

of

F (t)

is independent of time

t

, then

F (t)

is called a time-invariant time fuzzy series. Otherwise, it is called a time-variant fuzzy time series.

In [14], the authors firstly proposed an approach to predict the time-invariant fuzzy time series. In [16], they extended this approach to predict the time-variant fuzzy time series. They used the data of enrolment to Alabama University from 1971 to 1990 as historical data. The forecasting process includes the following procedures:

The interval [max enrolments–min enrolments] is divided into 7 intervals of the same length $u_{1}$ , $u_{2}$ , ..., $u_{7}$ .
Fuzzy linguistic categories are formed subjectively. Each such category is a qualitative expression of sets of values for the predicted quantity. For each linguistic category, the degrees of belonging of each of the initial intervals to the corresponding linguistic category are determined.
Data fuzzification, that is, finding for the reception of each year membership to each of the fuzzy categories $A_{j}$ , $j = 1, \dots, 7$ . If the maximum membership of a reception in some year corresponds to the fuzzy category $A_{j}$ , then it is assumed that the fuzzy value of the reception in this year is equal to $A_{j}$ .
Using historical data, a lot of logical connections are formed between the next two years. For example, if in the previous year the reception was equal $A_{j}$ and in the current year the reception is equal to $A_{k}$ , then we have the following logical connection $A_{j} \to A_{k}$ .
On the basis of the obtained set of logical connectives, the set of corresponding relations is determined. For example, for a connection $A_{j} \to A_{k}$ , the relation $R_{j}$ is defined as $R_{j} = A_{j}^{T} \times A_{k}$ . The generalized relation is defined as $R (t, t - 1) = R = \cup_{i = 1}^{10} R_{j}$ , since in the presented example there are 10 logical connectives.
Fuzzy forecasting. Calculation of fuzzy predicted values is made according to the expression

A_{i} = A_{i - 1} \circ R

(3)

where

A_{i - 1}

—fuzzy reception category per year

i - 1

;

A_{i}

—predicted fuzzy reception category per year

i

.

7.: Defuzzification. In practice, the prediction results do not require defuzzification since they are presented in a deterministic form. In work [16], the authors proposed similar calculation procedures for time-variant fuzzy time series.

The undoubted merit of authors of papers [14,15,16] is that they were the first who formally determined the fuzzy time series and proposed a mathematically correct approach for their prediction.

On the other hand, a significant drawback of the proposed approach is the extremely large number of calculations required in the practical use of this approach.

To overcome this significant shortcoming of the original approach, the following modification of this approach was proposed in [17]. All preliminary steps before fixing the logical connections between the previous and current time moments are performed in exactly the same way as in the original approach. Then, using the formed set of fuzzy logical connections, these connections are combined into groups, each of which contains one fuzzy category and all other categories logically related to this category. The further forecasting process will be presented in detail in the next section.

The approach presented in [17] gives results close to those obtained using the original approach but requires a significantly smaller number of calculations.

In subsequent years, many other variants of fuzzy time series forecasting algorithms were proposed, but the conceptual foundations of this direction were laid in the works [14,15,16].

5. Forecasting Net Migration in Latvia Based on Fuzzy Time Series Forecasting Methods

In this section, we will perform fuzzy forecasting of net migration in Latvia using three alternative methods, which we will call Method1 [17], Method2 [23], and Method3 [25].

5.1. Forecasting Based on Method1

Source [7] presents statistical data on the flows of people who entered Latvia, those who left Latvia, and the value of net migration (balance) for the period from 1990 to 2020. From 1990 to 2011, the values of net migration changed very chaotically, reaching peak values in some years that are very different from its values in other years.

Taking into account this circumstance, we use reduced historical data for the period from 2012 to 2020. Over the years, the values of net migration have changed quite smoothly and did not have bursts, which greatly complicate the forecasting process.

Historical data on migration processes in Latvia are presented in the 2nd, 3rd, and 4th columns of Table 1.

We take net migration as the basis for forecasting because (1) it is a derived indicator dependent on both emigration and immigration flows and reflects the asymmetry of these flows; (2) net migration values are strongly correlated with the number of people leaving Latvia. Let us perform fuzzy forecasting procedures according to Method1 [17].

Establishing a working range (interval) value of net migration.
From Table 1, we find that the minimum net migration value is −3150 and its maximum value is −14,262. Hence, the actual range of net migration values is $[- 3150, - 14,262]$ . According to the recommendations in the literature and to simplify further calculations, we expand this range and as a result we have the following operating range: $[- 3000, - 15,000]$ .
Formation of intervals in the operating range.

In this work, we use 6 intervals of the same length. These intervals are displayed at the bottom of Figure 1.

3.: Formation of fuzzy linguistic categories.

We form 5 fuzzy linguistic categories

A_{1} - A_{5}

in the operating range. Graphs of membership functions of net migration values to these categories are presented in the upper part of Figure 1.

In terms of the qualitative meanings of net migration, the fuzzy linguistic categories defined above are interpreted as follows:

A_{1}

—low;

A_{2}

—medium-low;

A_{3}

—medium;

A_{4}

—medium-high;

A_{5}

—high.

4.: Fuzzification of deterministic net migration values.

To perform this procedure for each historical net migration value, using the membership function plots in Figure 1, we define that fuzzy category, the degree of belonging to which this historical value is maximum. For example, the net migration value −11,860 has the maximum degree of membership to the fuzzy linguistic category

A_{4}

. Therefore, the fuzzy category

A_{4}

is taken as the fuzzified value for the given net migration value.

The fuzzified net migration values thus determined are presented in the

5 th

column of Table 1.

Relationships between fuzzy linguistic categories and intervals can be represented in the following form:

\begin{matrix} A_{1} = (\frac{1}{u_{1}} + \frac{0.35}{u_{2}} + \frac{0}{u_{3}} + \frac{0}{u_{4}} + \frac{0}{u_{5}} + \frac{0}{u_{6}}); \\ A_{2} = (\frac{0.65}{u_{1}} + \frac{1}{u_{2}} + \frac{0.35}{u_{3}} + \frac{0}{u_{4}} + \frac{0}{u_{5}} + \frac{0}{u_{6}}); \\ A_{3} = (\frac{0}{u_{1}} + \frac{0.35}{u_{2}} + \frac{1}{u_{3}} + \frac{1}{u_{4}} + \frac{0.35}{u_{5}} + \frac{0}{u_{6}}); \\ A_{4} = (\frac{0}{u_{1}} + \frac{0}{u_{2}} + \frac{0}{u_{3}} + \frac{0.65}{u_{4}} + \frac{1}{u_{5}} + \frac{0.65}{u_{6}}); \\ A_{5} = (\frac{0}{u_{1}} + \frac{0}{u_{2}} + \frac{0}{u_{3}} + \frac{0}{u_{4}} + \frac{0.35}{u_{5}} + \frac{1}{u_{6}}) . \end{matrix}

5.: Set formation of fuzzy logical connectives between fuzzified values of pure migration. These connectives display each consecutive pair of fuzzy linguistic categories from the 5th column of Table 1.

$A_{4} \to A_{5}, A_{5} \to A_{3}, A_{3} \to A_{3}, A_{3} \to A_{4}, A_{4} \to A_{3}, A_{3} \to A_{2}, A_{2} \to A_{1}, A_{1} \to A_{1}$
6.: Grouping fuzzy logical connectives.

Let us group the fuzzy logical connectives in such a way that each group has connections the predecessor of which is one fuzzy logical connective. We have:

\begin{matrix} A_{1} \to A_{1}; \\ A_{2} \to A_{1}; \\ A_{3} \to A_{2}, A_{3} \to A_{3}, A_{3} \to A_{4}; \\ A_{4} \to A_{3}, A_{4} \to A_{5}; \\ A_{5} \to A_{3} \end{matrix}

7.: Forecasting values at relevant points in a fuzzy time series.

In [17], the following algorithm for calculating the forecasted values in the fuzzy time series is proposed.

If the fuzzy historical value at the moment of time $i$ is $A_{j}$ , and there is only one logical connective, for example $A_{j} \to A_{k}$ , and the maximum membership value belongs to the interval $u_{k}$ , then the value at the midpoint of the interval is taken as the forecasted value at the time $i + 1$ .
If there are $l$ logical connectives in the group that connect the fuzzy category $A_{j}$ with the categories $A_{k 1}$ , $A_{k 2}$ ,... $A_{k l}$ , and the maximum membership values for these categories refer to the intervals $u_{k 1}$ , $u_{k 2}$ , ..., $u_{k l}$ , respectively, and the average values of these intervals are $m_{1}$ , $m_{2}$ , ..., $m_{l}$ , then the forecasted value at the moment of time is equal to the average of these values: $\frac{m_{1} + m_{2} + \dots + m_{l}}{l}$ .
Let the fuzzy value at the time $i$ be $A_{j}$ , and there is no logical connection for $A_{j}$ . If the maximum membership value for $A_{j}$ belongs to the interval $u_{j}$ , then the forecasted value at the moment of time $i + 1$ is taken as the value of the midpoint $m_{i}$ of the interval $u_{j}$ .

As an example, let us perform calculations for fuzzy linguistic categories

A_{1}

and

A_{3}

.

-
fuzzy linguistic category $A_{1}$ .

There is only one fuzzy logical connective

A_{1} \to A_{1}

in the corresponding group. Since the fuzzy linguistic category

A_{1}

has the maximum degree of membership in the interval

u_{1}

, then the forecasted value at the time point

i + 1

following the time point with the linguistic value of net migration equal to

A_{1}

, should be taken as the midpoint of the interval

u_{1} = - 4000

:

A_{1} \to - 4000

.

-
fuzzy linguistic category $A_{3}$ .

There are three fuzzy logical connectives

A_{3} \to A_{2}

,

A_{3} \to A_{3}

,

A_{3} \to A_{4}

in the corresponding group. The fuzzy category

A_{2}

has the maximum degree of membership on the interval

u_{2}

with the midpoint

m_{2} = - 6000

. The fuzzy category

A_{3}

has the maximum degree of membership exactly on the boundary of the intervals

u_{3}

and

u_{4}

, therefore, we take the midpoint of the common interval as the midpoint:

u_{3} + u_{4}

:

m_{34} = - 9000

. The fuzzy category

A_{4}

has the maximum degree of membership on the interval

u_{4}

with the midpoint

m_{4} = - 12, 000

. The forecasted value at the time point

i + 1

following the time point with the net migration linguistic value equal to

A_{3}

, is calculated as:

\frac{m_{2} + m_{34} + m_{4}}{3} = \frac{- 6000 + (- 9000) + (- 12, 000)}{3} = - 9000 : A_{3} \to - 9000

The remaining calculations are performed by analogy. The calculation results are presented in the 6-th column of Table 1.

In the situation under consideration, a fuzzy linguistic category

A_{1}

in 2020 has only one fuzzy logical connective

A_{1} \to A_{1}

, so the forecast for 2021 will be a fuzzy category

A_{1}

. As this category corresponds to the range of net migration values

[- 3000, - 6000]

we can forecast that the actual value of net migration in 2021 will be in this interval.

5.2. Forecasting Based on Method2

The second algorithm we use in this article is the one presented in [23]. A distinctive feature of this method is that it uses a specific partition of the working range of values of the forecasted value into intervals, resulting in intervals of unequal length.

The method presented below involves performing the following procedures:

Operating range of values determination of the predicted value.

Using the previously obtained results, we have the following operating range:

[- 3000, - 15,000]

.

2.: Division of the operating range into a certain number of intervals of the same length.

Set the initial number of intervals to 2. We have the following intervals:

u_{1} = [- 3000, - 9000]

;

u_{2} = [- 9000, - 15,000]

.

3.: Determination the number of data points that fall within each initial interval.

Using the historical data in the 4th column of Table 1, we have the following distribution of data points over intervals:

interval

u_{1}

: 6 data points;

interval

u_{2}

: 3 data points.

4.: Division of initial intervals into subintervals.

The original paper [23] recommended dividing the initial interval with the largest number of data points into 4 subintervals of the same length and the interval with the second largest number of data points into 3 subintervals of the same length. Let us divide the first initial interval into 4 subintervals of the same length and the second initial interval into 3 subintervals of the same length. As a result, we have the following set of operating intervals:

u_{1} = [- 3000, - 4500]

with the midpoint

m_{1} = - 3750

;

u_{2} = [- 4500, - 6000]

with the midpoint

m_{2} = - 5250

;

u_{3} = [- 6000, - 7500]

with the midpoint

m_{3} = - 6750

;

u_{4} = [- 7500, - 9000]

with the midpoint

m_{4} = - 8250

;

u_{5} = [- 9000, - 11,000]

with the midpoint

m_{5} = - 10,000

;

u_{6} = [- 11,000, - 13,000]

with the midpoint

m_{6} = - 12,000

;

u_{7} = [- 13,000, - 15,000]

with the midpoint

m_{7} = - 14,000

.

5.: Formation of fuzzy categories.
Let us form fuzzy categories on the formed intervals. Let us represent these categories in the form of triangular fuzzy numbers, the graphs of membership functions of which are shown in Figure 2.

In our case, these fuzzy categories are formally formed on the intervals generated above and have nothing to do with subjective verbal assessments. Therefore, we will simply call them fuzzy categories. If desired, linguistic labels can be assigned to these categories, but this will not change the essence of the fuzzy forecasting method under consideration.

6.: Fuzzification of historical data.

Let us restore the historical data on net migration in Latvia in the 2nd column of Table 2. For each data point, we determine its membership to the corresponding interval. Assign this data point a fuzzy category for this interval. Fuzzified historical data values are presented in the 3rd column of Table 2.

7.: Formation of fuzzy categories groups.

For each of the groups of fuzzy categories in the 3rd column of Table 2, we form the corresponding group of fuzzy categories based on the following rules:

-
for an intermediate fuzzy category $A_{j}$ , the group includes fuzzy categories $A_{j - 1}$ , $A_{j}$ , $A_{j + 1}$ ;
-
for the first fuzzy category $A_{1}$ , the group includes fuzzy categories $A_{1}$ , $A_{2}$ ;
-
for the last fuzzy category $A_{n}$ , the group includes fuzzy categories $A_{n - 1}$ , $A_{n}$ .

Groups of fuzzy categories formed on the base of these rules are presented in the 4th column of Table 2.

8.: Calculation of predicted values.

To calculate the forecasted values of fuzzy time series in [23], possible calculation expressions are used:

f_{j} = {\begin{matrix} \frac{1 + 0.5}{\frac{1}{m_{1}} + \frac{0.5}{m_{2}}}, j = 1; \\ \frac{0.5 + 1 + 0.5}{\frac{0.5}{m_{j - 1}} + \frac{1}{m_{j}} + \frac{0.5}{m_{j + 1}}}, 2 \leq j \leq n - 1; \\ \frac{0.5 + 1}{\frac{0, 5}{m_{n - 1}} + \frac{1}{m_{n}}}, j = n . \end{matrix} j = 1;

(4)

where

m_{j - 1}

,

m_{j}

,

m_{j + 1}

interval midpoints for fuzzy categories

A_{j - 1}

,

A_{j}

,

A_{j + 1}

, respectively.

As an example, let us calculate the forecasted value for the fuzzy category

A_{6}

in the first row of Table 2.

\begin{matrix} f_{6} = \frac{0.5 + 1 + 0.5}{\frac{0.5}{m_{5}} + \frac{1}{m_{6}} + \frac{0.5}{m_{7}}} = \frac{2}{\frac{0.5}{(- 10,000)} + \frac{1}{(- 12,000)} + \frac{1}{(- 14,000)}} = \\ = \frac{2}{- 0.00005 + (- 0.00008) + (- 0.00004)} = \frac{2}{(- 0.00017)} = - 11765 \end{matrix}

The remaining calculations are performed by analogy. The calculation results are presented in the 5th column of Table 2.

A feature of this method is that the calculation of the forecasted value is performed at the current point in the time sequence. Therefore, we cannot formally calculate the predicted value of net migration in 2021. Considering that in years 2019 and 2020 the values of net migration were estimated by the fuzzy category

A_{1}

, we predict that in 2021 the expected value will also be estimated by the fuzzy category

A_{1}

. Turning to numerical values, we can expect a net migration value in the interval

[- 3000, - 4500]

.

5.3. Forecasting Based on Method3

This method is proposed in [25]. The calculation procedures using this method are like to the calculation procedures using Method2. The difference between these methods is as follows: Method2 uses the original historical data as input data, Method3 uses the relative changes in the forecast value between two successive time periods as input data points, expressed as a percentage.

For historical data on net migration in Latvia, these relative changes are presented in the 3rd column of Table 3. Changes are calculated as follows. Let in a year

i

the value of net migration be

a_{i}

, and in a year

(i + 1)

−

a_{i + 1}

. Then

percent change = (\frac{a_{i + 1} - a_{i}}{a_{i}}) * 100 %

(5)

Calculation of forecasted values of net migration includes the following sequence of procedures.

Determination of the operating range of percentage changes.

The maximum negative value of percent change is −39.3, the maximum positive value is 23.0. We have the following initial range:

[- 39.3, 23.0]

. To simplify subsequent calculations, we form the following operating range:

[- 40, 24]

.

2.: The initial division of the operating range into intervals.

At this step, we divide the operating range into two initial intervals:

\begin{matrix} u_{1} = [- 40, 0]; \\ u_{2} = [0, 24] \end{matrix}

Taking into account the specifics of new forecasted value and the large spread of its values, in order to increase the accuracy of the forecasting results, we form 8 subintervals of the same length in the initial interval

u_{1}

and 4 subintervals of the same length in the initial interval

u_{2}

. As a result, we have the following set of working intervals:

u_{1} = [- 40, - 35]

with the midpoint

m_{1} = - 37.5

;

u_{2} = [- 35, - 30]

with the midpoint

m_{2} = - 32.5

;

u_{3} = [- 30, - 25]

with the midpoint

m_{3} = - 27.5

;

u_{4} = [- 25, - 20]

with the midpoint

m_{4} = - 22.5

;

u_{5} = [- 20, - 15]

with the midpoint

m_{5} = - 17.5

;

u_{6} = [- 15, - 10]

with the midpoint

m_{6} = - 12.5

;

u_{7} = [- 10, - 5]

with the midpoint

m_{7} = - 7.5

;

u_{8} = [- 5, 0]

with the midpoint

m_{8} = - 2.5

;

u_{9} = [0, 6]

with the midpoint

m_{9}^{} = 3

;

u_{10} = [6, 12]

with the midpoint

m_{10} = 9

;

u_{11} = [12, 18]

with the midpoint

m_{11} = 15

;

u_{12} = [18, 24]

with the midpoint

m_{12} = 21

.

3.: Formation of fuzzy categories.

We form fuzzy categories on each of the intervals of values of new forecasted variable. Let us represent each fuzzy category in the form of a triangular fuzzy number. Graphs of membership functions for all fuzzy categories are shown in Figure 3.

4.: Fuzzification of initial data.

Let us attribute each value of the percentage change to the fuzzy category, in the interval of which the given value falls. Fuzzified change values are presented in the 5th column of Table 3.

5.: Formation of fuzzy categories groups.

This procedure is exactly the same as when using Method2. The formed groups of categories are presented in the 6th column of Table 3.

6.: Forecasting of percentage change values.

The calculation of the relevant predicted values is performed according to the expression (4). As an example, let us provide the calculation of the forecasted value

f_{1}

for the fuzzy category

A_{12}

in the first row of Table 3.

f_{1} (%) = \frac{0.5 + 1}{\frac{0.5}{m_{11}} + \frac{1}{m_{12}}} = \frac{1.5}{\frac{0.5}{15} + \frac{1}{21}} = \frac{1.5}{0.03333 + 0.04762} = \frac{1}{0.08095} = 18.6

The remaining calculations are performed by analogy. The calculation results are presented in the 7th column of Table 3.

7.: Prediction of net migration values.

For the current value of net migration in a year

i

, this value is calculated based on the historical value of net migration in a year

(i - 1)

and forecasted percentage change in net migration in a year

(i - 1)

relative to the year

i

. The forecasted net migration values are presented in the 8th column of Table 3.

How can we forecast the expected value of net migration in 2021? To do this, we assume that the percentage change in net migration relative to 2020 will belong to the fuzzy category

A_{7}

, that is, it will be in the interval

[- 10, - 5] (%)

. From here, we can expect that the actual value of net migration in 2021 will be in the interval

[- 2992, - 2835]

.

6. Evaluation and Analysis of Results

Let us evaluate the accuracy of the forecasting results. To do this, for each of the algorithms used, we calculate the absolute deviations

| a_{i} - f_{i} |

and relative absolute deviations

\frac{| a_{i} - f_{i} |}{| a_{i} |}

of the predicted values from the corresponding historical values. The results are presented in Table 4, Table 5 and Table 6.

The largest average relative error occurs when using Method1: 0.31105 or 31%. When using Method2, this error is 0.04502 or 4.5%. The smallest average relative error occurs when using Method3: 0.01841 or 1.8%.

The large value of the average relative error when using Method1 is due to the underlying principle of calculating forecasted values. Let the fuzzy linguistic category of net migration be equal A_j in a year

i

and in a year

(i + 1)

the fuzzy linguistic category of net migration be equal to

A_{k 1}

. If, in addition to a fuzzy connection

A_{j} \to A_{k 1}

, a fuzzy linguistic category

A_{j}

has fuzzy logical connectives with fuzzy linguistic categories

A_{k 2}, A_{k 2}, \dots

, then the forecasted value of net migration in a year

(i + 1)

will depend on the midpoints of all intervals corresponding to fuzzy linguistic categories

A_{k 2}, A_{k 2}, \dots

. If these fuzzy linguistic categories are far from the fuzzy linguistic category

A_{k 1}

on the measurement scale of the forecasted variable, then the estimated forecasted value in the year

(i + 1)

may be significantly less or greater than its historical value. This leads to large forecasting errors.

This method is very sensitive to sudden changes in historical data values at close time points. The statistics on net migration in Latvia until 2012 are of just such a nature, so in this article, we limited ourselves to data on net migration in 2012–2020.

Chen’s algorithm [17] was once proposed as an alternative to the original algorithm in [15,16,17] in order to get rid of extremely large calculations. Although this goal has been achieved, the results of forecasting based on this algorithm are rather inaccurate. We used this archaic algorithm in this work only for the purpose of comparing its results with the results given by modern, more accurate algorithms.

It should be noted that Chen’s algorithm is indeed a forecasting method, since a fuzzy linguistic category in a year

i

and its fuzzy logical connections with other fuzzy linguistic categories are used to calculate the predicted value in a year

(i + 1)

.

When using Method2 [23], fuzzy categories of forecasted value are formed not in isolation from the intervals defined in the operating range of changes in this value, but directly on these intervals. This greatly simplifies the practical use of the method.

In general, fuzzy linguistic categories, as in Chen’s algorithm, appear to be rather artificial constructions since their main purpose is to link with operating intervals. The relevant calculations use the midpoints of these intervals. The use of fuzzy linguistic categories makes sense if the subjective assessment of historical data is possible only in terms of such categories, for example, the subjective assessment of the quality of weather or the mood of an individual. However, even in such very uncertain conditions, it is necessary to introduce some, for example, a point scale, for measuring a historical variable. This is required by the very essence of Chen’s algorithm.

An essential feature of Method2 is that each forecasted value is calculated at the current time point, so long as in essence, this method is a specific analogue of the method of smoothing deterministic time sequences based on the moving average. Therefore, to forecast the value of the relevant quantity outside the time sequence, some heuristic approach must be used.

The highest accuracy of forecasting results was achieved using Method3 [25]. The following explanation can be offered for this. At the very beginning of the forecasting process, the original time sequence is transformed into another time sequence, the elements of which are net migration changes at two consecutive time points, expressed as a percentage. The new scale explicitly reflects exactly the successive changes in the original historical data at all time points. This transformation of historical data makes it possible to achieve high forecasting accuracy. Implicitly ignoring such changes in Method1 leads to inaccurate forecasting results.

Except the sensitivity to sudden changes in historical data values at close time points, all three methods give once relatively short-term reliable forecasts. Forecasting results using the first time points of prospect interval leadlead to fast coming the only one value.

In a broader context, all versions of the fuzzy time series forecasting methods proposed in literature use historical enrolment data from Alabama University. The results obtained, which are more accurate than those obtained by other methods, are interpreted by the authors as an advantage of their proposed method.

However, there are relatively few analyses in literature of forecasting parameters influence on the accuracy of its results. These analyses are mainly concerned with the effect of interval lengths on the accuracy of the results. Unfortunately, the authors of the article did not find any source in which a detailed analysis of the impact of sharp changes in the predicted value at various time points on the forecasting accuracy would be given. This problem may become a direction for further research.

7. Discussion

In this article, the fuzzy time series forecasting methods are used to forecast net migration in Latvia. The reason is the high degree of uncertainty in the initial data. These initial data are transformed in the form of linguistic categories. The forecasting procedures are implemented using these linguistic categories. The output data are deterministic numbers. These numbers can be transformed into intervals accordingly to relevant fuzzy categories for input data.

To evaluate and compare the forecasting results, the only criterion used was relative forecasting error; as far as computational complexity of all three methods, it is roughly identical.

Fuzzy time series forecasting methods have undeniable advantages when initial data have a high degree of uncertainty.

However, these methods have an important drawback: they give only short-term forecasts. For migration problems in Latvia, a short-term forecast is sufficient in order to make decisions for migration flow stabilization.

We can use traditional well-known deterministic forecasting methods for migration forecasting and receive deterministic results. These results have low reliability. Therefore, the use of effective fuzzy time series forecasting method for migration forecasting in Latvia seems theoretically and practically well founded.

8. General Conclusions

Using the results received in this work, it can be asserted that the fuzzy methods are suitable for migration flow forecasting in Latvia. This is due to the high degree of uncertainty in migration statistical data. In addition to official migration registration, there are considerable latent migration flows that are not reflected in statistical data. Therefore, the use of fuzzy time series forecasting for migration flows in Latvia seems completely well founded.

This article presents three alternative methods for predicting net migration in Latvia based on a fuzzy time series. In modern historical conditions, negative net migration in Latvia takes place from 1992 to 2020. Until 2012, changes in net migration were irregular and convulsive. In some years, the value of net migration was several times higher than its value in the previous year.

In the last decade, there has been some order in the successive values of net migration. In the last two years, there has been a steady decrease. It can be expected that negative net migration will change its sign to positive in the coming years. However, for this, the following conditions must be met: (1) an increase in the well-being of population in Latvia, which will lead to an increase in the flow of re-emigrants to Latvia; (2) for migration flow, the government needs to provide favorable conditions for residents of Latvia which should guarantee a high living level; and (3) more active involvement of workers and specialists from other countries.

Author Contributions

Conceptualization, O.U.-R.; methodology, O.U.-R.; validation, O.U.-R. and P.G.; formal analysis, O.U.-R. and P.G.; investigation, O.U.-R. and P.G.; resources, P.G.; data curation, P.G.; writing—original draft preparation, O.U.-R. and P.G.; writing—review and editing O.U.-R. and P.G., supervision O.U.-R. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Perrochoud, R.; Jyllane, R.-C. (Eds.) Glossary on Migration, 2nd ed.; International Organization for Migration: Grand-Saconnex, Switzerland, 2011. [Google Scholar]
Bonfiglio, A. New Approaches for Researching the Determinants of Migration Processes: ESF Strategic Workshop on Migration Research; European Science Foundatio: Strasbourg, France, 2011; 19p. [Google Scholar]
Madison, G. Existential Migration. Existent. Anal. 2006, 17, 238–260. [Google Scholar]
Sohst, R.R.; Tjaden, J.; de Valk, H.; Susanne, H. The Future of Migration to Europe. In A Systematic Review of the Literature on Migration Scenarios and Forecasts; International Organization for Migration (IOM): Grand-Saconnex, Switzerland, 2020; 84p. [Google Scholar]
De Haas, H. The determinants of international migration. In Conceptualizing Policy, Origin and Destination Effects; The IMI Working Papers Series; University of Oxford: Oxford, UK, 2011; 35p. [Google Scholar]
De Haas, H. A theory of migration: The aspirations–capabilities. Comp. Migr. Stud. 2021, 2121, 35. [Google Scholar]
Rufa, M. Migration Process in Latvia: A Brief History and Driving Forces. 2020. Available online: https://stat.gov.lv/en/statistics-themes/population/migration (accessed on 11 April 2022).
Montgomery, D.C.; Jennings, C.L.; Murat, K. Introduction to Time Series Analysis and Forecasting; John Wiley & Sons: Hoboken, NJ, USA, 2008. [Google Scholar]
Box, G.E.P.; Jenkins, G.M.; Reinsel, G.C.; Greta, M.L. Time Series Analysis Forecasting and Control, 5th ed.; John Wiley & Sons: Hoboken, NJ, USA, 2016. [Google Scholar]
Brockwell, P.J.; Richard, A.D. Introduction to Time Series and Forecasting, 2nd ed.; Springer: Berlin/Heidelberg, Germany, 2017. [Google Scholar]
Zadeh, L.A. Fuzzy sets. Inf. Control. 1965, 8, 338–353. [Google Scholar] [CrossRef] [Green Version]
Zadeh, L.A. The concept of a linguistic variable and its application to approximate reasoning. Part I-III. Inf. Sci. 1975, 9, 43–80. [Google Scholar] [CrossRef]
Zadeh, L.A. Outline of a new approach to the analysis of complex systems and decision process. IEEE Trans. Syst. Man Cybern. 1975, 3, 28–44. [Google Scholar]
Song, Q.; Chissom, B.S. Forecasting Enrollments with Fuzzy Time Series. Part I. Fuzzy Sets Syst. 1993, 54, 1–9. [Google Scholar] [CrossRef]
Song, Q.; Chissom, B.S. Fuzzy time series and its models. Fuzzy Sets Syst. 1993, 54, 269–277. [Google Scholar] [CrossRef]
Song, Q.; Chissom, B.S. Forecasting Enrollments with Fuzzy Time Series. Part II. Fuzzy Sets Syst. 1994, 62, 1–8. [Google Scholar] [CrossRef]
Chen, S.-M. Forecasting Enrollments Based on Fuzzy Time Series. Fuzzy Sets and Systems 1996, 81, 311–319. [Google Scholar] [CrossRef]
Chen, S.-M.; Hsu, C.-C. A New Method to Forecast Enrollments Using Fuzzy Time Series. Int. J. Appl. Sci. Eng. 2004, 3, 234–244. [Google Scholar]
Hwang, J.-R.; Chen, S.-M.; Chia-Hoang, L. Handling forecasting problems using fuzzy time series. Fuzzy Sets Syst. 1998, 100, 217–228. [Google Scholar] [CrossRef]
Lee, M.H.; Efendi, R.; Zuhaieny, I. Modified Weighted Enrollment Forecasting Based on Fuzzy Time Series. Matematika 2009, 25, 67–78. [Google Scholar]
Huarng, K.-Y. Effective lengths of intervals to improve forecasting in fuzzy time series. Fuzzy Sets Syst. 2001, 123, 387–394. [Google Scholar] [CrossRef]
Huarng, K.; Tiffany, H.-K.Y. Ratio-Based Length of Intervals to Improve Fuzzy Time Series Forecasting. IEEE Trans. Syst. Man Cybern. Part B Cybern. 2006, 36, 328–340. [Google Scholar] [CrossRef]
Jilani, T.A.; Burney, S.M.A.; Ardil, C. Fuzzy Metric Approach for Fuzzy Time Series Forecasting based on Frequency Density Based Partitioning. Int. J. Comput. Inf. Eng. 2010, 4, 1194–1199. [Google Scholar]
Li, S.-T.; Cheng, Y.-C. Deterministic fuzzy time series model for forecasting enrollments. Comput. Math. Appl. 2007, 53, 1904–1920. [Google Scholar] [CrossRef] [Green Version]
Stewenson, M.; John, E.P. Fuzzy TimeSeries Forecasting Using Percentage Change as the Universe of Discourse. Int. J. Math. Comput. Sci. 2009, 3, 464–467. [Google Scholar]
Vovan, T. An improved fuzzy time series forecasting model using variations of data. Fuzzy Optim. Decis. Mak. 2019, 18, 151–173. [Google Scholar] [CrossRef]
Yolcu, U.; Egrioglu, E.; Vedide, R.; Basaran, M.A.; Cagdas, H.A. A new approach for determining the length of interval for fuzzy time series. Appl. Soft Comput. 2009, 9, 647–651. [Google Scholar] [CrossRef]
Pathak, H.K.; Prachi, S. A new Bandwidth Interval Based Forecasting Method for Enrollments Using Fuzzy Time Series. Appl. Math. 2011, 2, 504–507. [Google Scholar] [CrossRef] [Green Version]
Wang, C.-H.; Hsu, L.-C. Constructing and applying an improved fuzzy time series model: Taking the tourism industry for example. Expert Syst. Appl. 2008, 24, 2732–2738. [Google Scholar] [CrossRef]
Huarng, K.-Y.; Yu, T.H.-K.; Montinho, L.; Wang, Y.-C. Forecasting tourism demand by fuzzy time series model. Int. J. Cult. Tour. Hosp. Res. 2012, 6, 377–388. [Google Scholar] [CrossRef]
Aladag, C.H.; EgriogluErol, Y.U.; Vedide, R.U. A high order seasonal fuzzy time series model and application to international tourism demand of Turkey. J. Intell. Fuzzy Syst. 2014, 26, 295–302. [Google Scholar] [CrossRef]
Jilani, T.A.; Syed, M.A.B. A refined fuzzy time series model for stock market forecasting. Phys. A Stat. Mech. Its Appl. 2008, A387, 2857–2862. [Google Scholar] [CrossRef]
Wong, H.-L.; Tu, Y.-H.; Wang, C.-C. Application of fuzzy time series models for forecasting the amount of Taiwan export. Expert Syst. Appl. 2011, 37, 1465–1470. [Google Scholar] [CrossRef]
Tseng, F.M.; Tzeng, G.H.; Yu, H.C.; Yuan, B.J. Fuzzy ARIMA model for forecasting the foreign exchange market. Fuzzy Sets Syst. 2001, 168, 9–19. [Google Scholar] [CrossRef]
Xie, Y.; Zhang, P.; Yanyi, C. A Fuzzy ARIMA Correction Model for Transport Volume Forecast. Math. Probl. Eng. 2021, 6655102, 10. [Google Scholar] [CrossRef]
Kannan, K.; Senthamarai, B.M.S.; Fathima, S.S.A. Comparison of Fuzzy Time Series and ARIMA Model. Int. J. Sci. Technol. Res. 2019, 8, 1872–1876. [Google Scholar]
Rahman, N.H.A.; Lee, M.H.; Latif, M.T. Artificial neural networks and fuzzy time series forecasting: An application to air quality. Qual. Quant. 2015, 49, 2633–2647. [Google Scholar] [CrossRef]
Duru, O.; Butler, M. Modelling and Forecasting with Fuzzy Time Series and Artificial Neural Networks. Adv. Bus. Manag. Forecast. 2017, 12, 155–180. [Google Scholar] [CrossRef]
Huarng, K.; Tiffany, H.-K.Y. The application of neural networks to forecast fuzzy time series. Phys. A Stat. Mech. Its Appl. 2006, 363, 481–495. [Google Scholar] [CrossRef]
Sahin, A.; Dodurka, M.F.; Kumbassar, T.; Yesil, E.; Siradag, S. Review Study on Fuzzy Time Series and Their Application in the Last Fifteen. In Proceedings of the International Fuzzy Systems Symposium (FUZZYSS’15), Istanbul, Turkey, 5–6 November 2015. [Google Scholar]

Figure 1. Graphical representation of intervals and fuzzy linguistic categories in the problem of fuzzy forecasting of net migration in Latvia according to Method1.

Figure 2. Graphical representation of intervals and fuzzy categories in the problem of fuzzy forecasting of net migration in Latvia according to Method2.

Figure 3. Graphical representation of intervals and fuzzy categories of percent changes in the fuzzy forecasting problem of net migration in Latvia according to Method3.

Table 1. Historical data on migration flows in Latvia in 2012–2020 and the results of their processing according to Method1.

Year	Number of People Who Left Latvia	Number of People Who Entered Latvia	Net Migration (Balance) $(a_{i})$	Fuzzy Categories of Net Migration	Forecasted Values of Net Migration $(f_{i}^{})$
1	2	3	4	5	6
2012	25,163	13,303	−11,860	$A_{4}$	-
2013	22,561	8299	−14,262	$A_{5}$	−11,500
2014	19,017	10,365	−8652	$A_{3}$	−9000
2015	20,119	9479	−10,640	$A_{3}$	−9000
2016	20,574	8345	−12,279	$A_{4}$	−9000
2017	17,724	9916	−7808	$A_{3}$	−11,500
2018	15,814	10,909	−4905	$A_{2}$	−9000
2019	14,583	11,233	−3360	$A_{1}$	−4000
2020	11,992	8840	−3150	$A_{1}$	−4000

Table 2. Historical data on migration flows in Latvia in 2012–2020 and the results of their processing according to Method2.

Year	Net Migration (Balance) $(a_{i})$	Fuzzy Categories of Net Migration	Groups of Fuzzy Categories	Forecasted Values of Net Migration $(f_{i})$
1	2	3	4	5
2012	−11,860	$A_{6}$	$A_{5}, A_{6}, A_{7}$	−11,765
2013	−14,262	$A_{7}$	$A_{6}, A_{7}$	−13,636
2014	−8652	$A_{4}$	$A_{3}, A_{4}, A_{5}$	−8333
2015	−10,640	$A_{5}$	$A_{4}, A_{5}, A_{6}$	−10,000
2016	−12,279	$A_{6}$	$A_{5}, A_{6}, A_{7}$	−11,765
2017	−7808	$A_{4}$	$A_{3}, A_{4}, A_{5}$	−8333
2018	−4905	$A_{2}$	$A_{1}, A_{2}, A_{3}$	−5128
2019	−3360	$A_{1}$	$A_{1}, A_{2}$	−3261
2020	−3150	$A_{1}$	$A_{1}, A_{2}$	−3261

Table 3. Historical data on migration flows in Latvia in 2012–2020 and the results of their processing according to Method3.

Year	Net Migration $(a_{i})$	Sequences of Years	Migration Changes $(%)$	Fuzzy Categories	Groups of Fuzzy Categories	Forecasted Changes $f_{i} (%)$	Forecasted Values of Net Migration $(f_{i})$
1	2	3	4	5	6	7	8
2012	−11,860	2012–2013	20.3	$A_{12}$	$A_{11}, A_{12}$	18.6	-
2013	−14,762	2013–2014	−39.3	$A_{1}$	$A_{1}, A_{2}$	−37.5	−14,067
2014	−8652	2014–2015	23.0	$A_{12}$	$A_{11}, A_{12}$	18.6	−9045
2015	−10,640	2015–2016	15.4	$A_{11}$	$A_{10}, A_{11}, A_{12}$	13.7	−10,727
2016	−12,279	2016–2017	−36.4	$A_{1}$	$A_{1}, A_{2}$	−35.7	−12,197
2017	−7808	2017–2018	−37.2	$A_{1}$	$A_{1}, A_{2}$	−35.7	−7924
2018	−4905	2018–2019	−31.5	$A_{2}$	$A_{1}, A_{2}, A_{3}$	−32.1	−4979
2019	−3368	2019–2020	−6.2	$A_{7}$	$A_{6}, A_{7}, A_{8}$	−5.4	−3395
2020	−3150						−3186

Table 4. Deviations of forecasted values of net migration from its historical values when using Method1.

Year	$a_{i}$	$f_{i}$	$\| a_{i} - f_{i} \|$	$\frac{\| a_{i} - f_{i} \|}{\| f_{i} \|}$
2012	−11,860	-	-	-
2013	−14,262	−11,500	2762	0.19366
2014	−8652	−9000	348	0.4022
2015	−10,640	−9000	1640	0.15414
2016	−12,279	−9000	3279	0.26704
2017	−7808	−11,500	3692	0.47285
2018	−4905	−9300	4395	0.89602
2019	−3360	−4000	640	0.19048
2020	−3150	−4000	850	0.26984
Mean relative error				0.31105

Table 5. Deviations of forecasted values of net migration from its historical values when using Method2.

Year	$a_{i}$	$f_{i}$	$\| a_{i} - f_{i} \|$	$\frac{\| a_{i} - f_{i} \|}{\| f_{i} \|}$
2012	−11,860	−11,765	95	0.00801
2013	−14,262	−13,636	626	0.04389
2014	−8652	−8333	319	0.03687
2015	−10,640	−10,000	640	0.06015
2016	−12,279	−11,765	514	0.04186
2017	−7808	−8333	525	0.06724
2018	−4905	−5128	223	0.04546
2019	−3360	−3261	99	0.02946
2020	−3150	−3261	111	0.03524
Mean relative error				0.04502

Table 6. Deviations of forecasted values of net migration from its historical values when using Method3.

Year	$a_{i}$	$f_{i}$	$\| a_{i} - f_{i} \|$	$\frac{\| a_{i} - f_{i} \|}{\| f_{i} \|}$
2012	−11,860	-	-	-
2013	−14,262	−14,067	195	0.01367
2014	−8652	−9045	393	0.04542
2015	−10,640	−10,727	87	0.00818
2016	−12,279	−12,197	82	0.00668
2017	−7808	−7524	284	0.03637
2018	−4905	−4979	74	0.01509
2019	−3360	−2395	35	0.01042
2020	−3150	−3186	36	0.01143
Mean relative error				0.01841

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Uzhga-Rebrov, O.; Grabusts, P. Analysis of Fuzzy Time Series Forecasting for Migration Flows. Symmetry 2022, 14, 1441. https://doi.org/10.3390/sym14071441

AMA Style

Uzhga-Rebrov O, Grabusts P. Analysis of Fuzzy Time Series Forecasting for Migration Flows. Symmetry. 2022; 14(7):1441. https://doi.org/10.3390/sym14071441

Chicago/Turabian Style

Uzhga-Rebrov, Oleg, and Peter Grabusts. 2022. "Analysis of Fuzzy Time Series Forecasting for Migration Flows" Symmetry 14, no. 7: 1441. https://doi.org/10.3390/sym14071441

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Analysis of Fuzzy Time Series Forecasting for Migration Flows

Abstract

1. Introduction

2. Migration and Migration Flows

3. Literature Review about Fuzzy Time Series Forecasting

4. Conceptual Foundations of Fuzzy Time Series Forecasting

5. Forecasting Net Migration in Latvia Based on Fuzzy Time Series Forecasting Methods

5.1. Forecasting Based on Method1

5.2. Forecasting Based on Method2

5.3. Forecasting Based on Method3

6. Evaluation and Analysis of Results

7. Discussion

8. General Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI