9.4 ETS + ARIMA

\( \newcommand{\mathbbm}[1]{\boldsymbol{\mathbf{#1}}} \)

Coming back to the topic of ETS and ARIMA, we can now look at it from the point of view of the SSOE state space model.

9.4.1 Pure additive models

A pure additive ETS + ARIMA model can be formulated in the general form, which we have already discussed several times in this monograph (Section 5.1): \[\begin{equation*} \begin{aligned} &{y}_{t} = \mathbf{w}^\prime \mathbf{v}_{t-\boldsymbol{l}} + \epsilon_t \\ &\mathbf{v}_{t} = \mathbf{F} \mathbf{v}_{t-\boldsymbol{l}} + \mathbf{g} \epsilon_t \end{aligned}, \end{equation*}\] but now the matrices and vectors of the model contain ETS and ARIMA components, stacked one after another.

For example, if we want to construct ETS(A,N,A)+ARIMA(2,0,0), we can formulate this model as: \[\begin{equation} \begin{aligned} &{y}_{t} = l_{t-1} + s_{t-m} + v_{1,t-1} + v_{2,t-2} + \epsilon_t \\ &l_t = l_{t-1} + \alpha \epsilon_t \\ &s_t = s_{t-m} + \gamma \epsilon_t \\ &v_{1,t} = \phi_1 v_{1,t-1} + \phi_1 v_{2,t-2} + \phi_1 \epsilon_t \\ &v_{2,t} = \phi_1 v_{1,t-1} + \phi_2 v_{2,t-2} + \phi_2 \epsilon_t \end{aligned}, \tag{9.30} \end{equation}\] where \(\phi_i\) is the \(i\)-th parameter of the AR(2) part of the model. This model represented in the conventional additive SSOE state space form leads to the following matrices and vectors: \[\begin{equation} \begin{aligned} \mathbf{w} = \begin{pmatrix} 1 \\ 1 \\ 1 \\ 1 \end{pmatrix}, & \mathbf{F} = \begin{pmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & \phi_1 & \phi_1 \\ 0 & 0 & \phi_2 & \phi_2 \end{pmatrix}, & \mathbf{g} = \begin{pmatrix} \alpha \\ \gamma \\ \phi_1 \\ \phi_2 \end{pmatrix}, \\ \mathbf{v}_{t} = \begin{pmatrix} l_t \\ s_t \\ v_{1,t} \\ v_{2,t} \end{pmatrix}, & \mathbf{v}_{t-\boldsymbol{l}} = \begin{pmatrix} l_{t-1} \\ s_{t-m} \\ v_{1,t-1} \\ v_{2,t-2} \end{pmatrix} & \boldsymbol{l} = \begin{pmatrix} 1 \\ m \\ 1 \\ 2\end{pmatrix} \end{aligned}. \tag{9.31} \end{equation}\] So, in this formulation, the states of ETS and ARIMA are independent and form a combination of models only in the measurement equation. In a way, this model becomes similar to fitting sequentially ETS to the data and then ARIMA to the residuals, but estimating both elements at the same time. This simultaneous estimation is supposed to remove the bias in the estimates of parameters, which might appear in the sequential procedure.

ADAM introduces the flexibility necessary for fitting any ETS+ARIMA combination, but not all combinations make sense. For example, here how ETS(A,N,N)+ARIMA(0,1,1) would look like: \[\begin{equation} \begin{aligned} &{y}_{t} = l_{t-1} + v_{1,t-1} + \epsilon_t \\ &l_t = l_{t-1} + \alpha \epsilon_t \\ &v_{1,t} = v_{1,t-1} + (1+\theta_1) \epsilon_t \end{aligned}. \tag{9.32} \end{equation}\] In the transition part of the model (9.32), the two equations duplicate each other because they have exactly the same mechanism of update of states. In fact, as we know from Section 8.4.1, ETS(A,N,N) and ARIMA(0,1,1) are equivalent, when \(\alpha=1+\theta_1\). If we estimate this model, then we are duplicating the state, splitting it into two parts with some arbitrary weights. This becomes apparent if we insert the transition equations in the measurement one, obtaining: \[\begin{equation} \begin{aligned} {y}_{t} = & l_{t-2} + \alpha \epsilon_{t-1} + v_{1,t-2} + (1+\theta_1) \epsilon_{t-1} + \epsilon_t =\\ & l_{t-2} + v_{1,t-2} + (1+\theta_1+\alpha) \epsilon_{t-1} + \epsilon_t \end{aligned}, \tag{9.33} \end{equation}\] which leads to an infinite combination of values of parameters \(\theta\) and \(\alpha\) that would produce exactly the same model fit. So, the model (9.32) does not have unique parameters and thus is unidentifiable. This means that we cannot reach the “true model” based on ETS(A,N,N)+ARIMA(0,1,1) and thus the model selection via information criteria becomes inappropriate. Furthermore, the estimates of parameters of such model might become biased, inefficient and inconsistent due to the “infinite combination” issue mentioned above.

In some other cases, some parts of the model might be duplicated (not the whole), making the overall model unidentifiable, so it makes sense to switch to either ETS or ARIMA, depending on circumstances. For example, if we have ETS(A,A,N)+ARIMA(0,2,3), then some parts of the models will be duplicated (because ETS(A,A,N) is equivalent to ARIMA(0,2,2)), so it would be more reasonable to switch to pure ARIMA(0,2,3) instead. On the other hand, if we deal with ETS(A,Ad,N)+ARIMA(0,1,2), then dropping the ARIMA part would be more appropriate (due to the relation between ETS(A,Ad,N) and ARIMA(1,1,2)).

These examples show that, when using ETS+ARIMA, model building needs to be done with care, not to get an unreasonable model that cannot be identified. As a general recommendation, keep the ETS and ARIMA connection in mind (see Section 8.4), when deciding, what to construct. And here is a short list of guidelines of what to do in some special cases:

For ETS(A,N,N)+ARIMA(0,1,q):
- use ARIMA(0,1,q) in case of \(q >1\);
- use ETS(A,N,N) in case of \(q\leq 1\);
For ETS(A,A,N)+ARIMA(0,2,q):
- use ARIMA(0,2,q) in case of \(q >2\);
- use ETS(A,A,N) in case of \(q \leq 2\);
For ETS(A,Ad,N)+ARIMA(p,1,q):
- use ARIMA(p,1,q), when either \(p>1\) or \(q>2\);
- use ETS(A,Ad,N), when \(p \leq 1\) and \(q \leq 2\).

Regarding seasonal models, the relation between ETS and ARIMA is more complex. It is highly improbable to get to equivalent ARIMA models, so we can neglect the rules for the seasonal part and focus on making sure that the three rules above hold for the non-seasonal part of the model.

9.4.2 Pure multiplicative models

In the most general case the pure multiplicative ETS+ARIMA model can be written as (based on (6.1) and (9.16)): \[\begin{equation} \begin{aligned} &{y}_{t} = \exp \left( \mathbf{w}_{E}^\prime \log \mathbf{v}_{E,t-\boldsymbol{l}_E} + \mathbf{w}_{A}^\prime \log \mathbf{v}_{A,t-\boldsymbol{l}_A} + \log(1+\epsilon_t) \right) \\ &\log \mathbf{v}_{E,t} = \mathbf{F}_{E} \log \mathbf{v}_{E,t-\boldsymbol{l}_E} + \log(\mathbf{1}_k + \mathbf{g}_E \epsilon_t) \\ &\log \mathbf{v}_{A,t} = \mathbf{F}_{A} \log \mathbf{v}_{A,t-\boldsymbol{l}_A} + \mathbf{g}_A \log(1+\epsilon_t) \end{aligned} , \tag{9.34} \end{equation}\] where the subscript reflects, which part corresponds to which model: “E” – ETS, “A” – ARIMA. The formulation (9.34) demonstrates that the ETS+ARIMA might not have the same issues as the pure additive one. This is because the multiplicative ETS (Section 6.1) and multiplicative ARIMA (Subsection 9.1.4) are formulated differently. Consider an example of ETS(M,N,N)+Log-ARIMA(0,1,1), which is formulated as: \[\begin{equation} \begin{aligned} &{y}_{t} = l_{t-1} v_{1,t-1} (1 + \epsilon_t) \\ &l_t = l_{t-1}(1 + \alpha \epsilon_t) \\ &\log v_{1,t} = \log v_{1,t-1} + (1+\theta_1) \log (1 + \epsilon_t) \end{aligned}. \tag{9.35} \end{equation}\] The last equation in (9.35) can be rewritten as \(v_{1,t} = v_{1,t-1} (1 + \epsilon_t)^{(1+\theta_1)}\), demonstrating the difference between the transition equation of ETS(M,N,N) and multiplicative ARIMA(0,1,1). Still, the two models will be similar in cases when \(\alpha\) is close to zero and (respectively) \(\theta\) is close to -1. So this combination of models should be treated with care, along with other potentially similar combinations. The following combinations of the two models can be considered as potentially unidentifiable under some conditions:

ETS(M,N,N)+Log-ARIMA(0,1,1);
ETS(M,M,N)+Log-ARIMA(0,2,2);
ETS(M,Md,N)+Log-ARIMA(1,1,2).

In addition, the recommendations discussed for the pure additive ETS+ARIMA can be applied here for the pure multiplicative ETS+ARIMA to guarantee that the resulting model is identifiable in all cases.

Finally, mixing additive ETS with multiplicative ARIMA or multiplicative ETS with additive ARIMA does not make sense from the modelling point of view. It only complicates the model building process, so we do not consider these exotic cases in this book, although they are theoretically possible.