Sales Forecasting Methods
Sales Forecasting Methods
Sales Forecasting Methods
Statistical Techniques
These statistical techniques include various well-known models that have formal statistical
foundations:
Moving Average
Weighted Moving Average
Least Square Regression
Linear Approximation
Second Degree Approximation
Linear Smoothing
Exponential Smoothing
Holt Winters
BOX & Jenkins Model
Regression Models
ARIMA & ARIMAX
These methods have been implemented in different areas and they provide satisfactory results.
However, their efficiency strongly depends of the field of application, the forecast goal (especially the
horizon) or the user experience.
The market is strongly impacted by numerous factors which make the sales very fluctuated. These
factors, also called explanatory variables, are sometimes not controlled and even unknown. Some of
them involve an increase of the purchase decision, others modify the store traffic. Hence, the difficulty
to exactly identify them and to quantify their impact.
The impact of these variable could be very dissimilar on sales. Indeed, some variables generate
punctual fluctuations without significantly affecting the overall volume of sales, for instance a
temporal price discount produces peaks of sales.
Others impact more globally the sales such as macroeconomic data or strategy of retail. For instance,
sales can show an unexpected decline which could be explained from these kinds of factors.
Consequently, these methods are not easily and not efficiently implemented, they require large
historical data sets, a complex optimization of their parameters, a certain experience of the operator,
and they are limited to linear structure.
Data needed-
Item Descriptions
Store Details (Size, Employees)
Distribution Hierarchy
Customer Details (Type, Buying pattern)
Customer Demographics
Promotion Details
Discounts Details
Return/Cancellation
Items Association
Reviews (Sentiment)
Sales, Profit/Cost
Approach
We can develop an ensemble of models where we divided all the styles into few buckets of
High volume – high duration
High volume – low duration
Low volume – high duration
Low volume – low duration and completely new launches.
For the styles that have high volume and high duration, we can still use a time-series or a
regression technique but for all the others these traditional methods will have limitations.
Hence, we needed to apply ML techniques for these styles.
For the styles with low duration, we can use Random forest and XGB methods to arrive at the
predictions. Also for these styles what was more important was to get a proper demand
prediction rather than identify the drivers of sales and their impact, hence ML techniques.