1.School of Meteorology, University of Oklahoma, Norman, OK 73072, USA 2.College of Global Change and Earth System Science (GCESS), Beijing Normal University, Beijing 100875, China 3.Laboratory for Regional Oceanography and Numerical Modeling, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266237, China 4.Cooperative Institute for Research in the Atmosphere, GSD/ESRL/OAR/NOAA, Boulder, CO 80305, USA 5.Fujian Meteorological Observatory, Fuzhou 350001, China 6.Wuyishan National Park Meteorological Observatory, Wuyishan 354306, China 7.State Key Laboratory of Numerical Modeling for Atmospheric Sciences and Geophysical Fluid Dynamics, Institute of Atmospheric Physics, Chinese Academy of Sciences, Beijing 100029, China 8.Plateau Atmosphere and Environment Key Laboratory of Sichuan Province, Chengdu University of Information Technology, Chengdu 610225, China Manuscript received: 2018-06-06 Manuscript revised: 2018-09-07 Manuscript accepted: 2018-10-10 Abstract:It has been demonstrated that ensemble mean forecasts, in the context of the sample mean, have higher forecasting skill than deterministic (or single) forecasts. However, few studies have focused on quantifying the relationship between their forecast errors, especially in individual prediction cases. Clarification of the characteristics of deterministic and ensemble mean forecasts from the perspective of attractors of dynamical systems has also rarely been involved. In this paper, two attractor statistics——namely, the global and local attractor radii (GAR and LAR, respectively)——are applied to reveal the relationship between deterministic and ensemble mean forecast errors. The practical forecast experiments are implemented in a perfect model scenario with the Lorenz96 model as the numerical results for verification. The sample mean errors of deterministic and ensemble mean forecasts can be expressed by GAR and LAR, respectively, and their ratio is found to approach $\sqrt{2}$ with lead time. Meanwhile, the LAR can provide the expected ratio of the ensemble mean and deterministic forecast errors in individual cases. Keywords: attractor radius, ensemble forecasting, ensemble mean, forecast error saturation 摘要:前人研究表明集合平均预报在大样本平均的情况下比确定性(或单一)预报有更高的预报技巧。然而,很少研究关注它们预报误差之间的定量关系,尤其在一些个例预报中。同时,从动力系统吸引子的角度对确定性和集合平均预报的特征进行的研究也很少。本文利用吸引子的两个统计量即全局和局部吸引子半径来揭示确定性和集合平均预报误差的关系。基于Lorenz96模型的完美模式情景下的实际预报试验结果用来作为理论的检验。确定性预报和集合平均预报的样本平均误差可以分别用全局和局部吸引子半径来表达,它们的比值随着预报时间接近$\sqrt{2}$。同时,局部吸引子半径提供了确定性和集合平均预报误差在不同个例中的期望比值。 关键词:吸引子半径, 集合预报, 集合平均, 预报误差饱和
HTML
--> --> --> -->
4.2. Evolution of ensemble mean and deterministic forecast states
The differences between deterministic and ensemble mean forecast errors are essentially associated with their differing forecast states. Therefore, the statistical characteristics of deterministic and ensemble mean forecast states are analyzed before comparing their forecast errors. Each panel of Fig. 3 illustrates the probability distribution of the deterministic (black line) and ensemble mean (blue line) forecast states over all cases at the same lead time. It shows that the probability distribution of deterministic forecasts is always consistent with that of the reference (red line) from 0.5 to 6 tu, since they are from the same attractor. In contrast, the probability distribution of ensemble mean states appears to have a narrower range and a higher peak as time increases. In other words, the ensemble mean forecasts tend to, on the whole, move toward the climatic mean value (2.22) with lead time because of the nonlinear smoothing effect of the arithmetic mean of the forecast ensemble (Toth and Kalnay, 1997). Finally, when all forecast members become chaotic with sufficiently long lead time, their ensemble mean without exception would equal the climatic mean in any individual case. It indicates that the ensemble mean reduces the forecast error compared to deterministic forecasts, but at the expense of losing information and variability in forecasts. On the other hand, according to the characteristics of forecast states, it could be expected that the saturation value of sample mean ensemble mean forecast errors would be consistent with the attractor radius, while deterministic forecast errors will saturate at the level of GAR. The conclusion is verified through the results of forecast experiments in section 4.3. Figure3. Probability (%) distribution of the ensemble mean (blue line; right-hand scale) and deterministic (black line; left-hand scale) forecast states and true states (red line; left-hand scale) over all 2× 105 samples as a function of lead time.
2 4.3. Sample mean forecast errors -->
4.3. Sample mean forecast errors
Figure 4 shows the RMS error of x1 for deterministic and ensemble mean forecasts as a function of lead time averaged over all cases. Within the initial 1 tu, the deterministic and ensemble mean forecasts have similar errors due to the offset of the approximate linear growth of the positive and negative initial ensemble perturbations. After 1 tu, ensemble mean forecasts retain smaller errors compared to deterministic ones, and their difference continuously increases with lead time. Finally, deterministic and ensemble mean forecast errors both enter the nonlinear saturation stage and reach 5.13 and 3.63, respectively. The former is the same as the GAR and the latter equals the attractor radius. Their ratio of the saturation values is $\sqrt 2$, as is derived in section 4.2. It is also consistent with the conclusions in (Leith, 1974) and (Kalnay, 2003). Figure4. RMS error averaged over 2× 105 samples for the deterministic (black solid line) and ensemble mean (red solid line) forecasts as a function of time. The dashed lines are the saturation values, 5.13 and 3.63, of deterministic and ensemble mean forecasts errors, respectively.
2 4.4. Forecast errors in individual cases -->
4.4. Forecast errors in individual cases
In comparison with the sample mean forecasts, the forecasts of a specific weather or climate event is strongly influenced by the evolving dynamics (Ziehmann et al., 2000; Corazza et al., 2003), and it is thus difficult to estimate the expected values of both deterministic and ensemble mean forecast errors. LAR is a feasible statistic to estimate the expected value of deterministic and ensemble mean forecast errors in individual cases without running practical forecasts. As the nonlinearity in forecasts intensifies, the ensemble mean approaches the mean state (see Fig. 3), while the deterministic forecast tends to be a random state on the attractor. Referring to the definition of LAR in Eq. (2), the ratio r of the expected values of deterministic and ensemble mean forecast errors for a specific predicted state xi can be expressed by: \begin{equation} r=\frac{R_{{\rm L},i}}{\|{\textbf{x}}_i-{\textbf{x}}_{\rm E}\|}=\frac{\sqrt{\|{\textbf{x}}_i-{\textbf{x}}_{\rm E}\|^2+R_{\rm E}^2}}{\|{\textbf{x}}_i-{\textbf{x}}_{\rm E}\|} . \ \ (7)\end{equation} Figure 5 shows the variation of r as a function the true state x1. It can be seen that the ensemble mean has the maximum advantage over the deterministic forecast if the truth (or the observed state) is close to the climatic mean state. When the truth gradually deviates from the mean state, the superiority of the ensemble mean over deterministic forecasts diminishes fast. For an event within 1 to 2 SD, r ranges approximately from 0.7 to 0.9. Once the event is out of 2 SD, r is almost 0.95, which means the ensemble mean and deterministic forecasts perform very similarly. This indicates that the ensemble mean has no advantage over deterministic forecasts in predicting the variabilities of extreme events, and the overall better performance of the former (see Fig. 4) originates from its higher skill for neutral events. With a long-term series of a variable, its distribution of r can be estimated in advance and used as a reference for deterministic and ensemble mean forecast skill in individual cases, especially for long-range forecasts. Figure5. Ratio between the expected values of the ensemble mean (e_EM) and deterministic (e_Det) forecast errors of x1 as a function of the observed value of x1. The red dashed line represents the mean state and the black dashed lines are 1 and 2 SD, respectively.
To verify the above result, the practical errors of deterministic and ensemble mean forecasts are compared. The forecast skills are assessed against the truth divided into three categories——namely, the neutral (within 1 climatic SD), weak extreme (within 1-2 SD), and strong extreme (beyond 2 SD) events. Figure 6 compares the deterministic and ensemble mean forecast errors for the three groups of events at lead times of 1, 2, 3 and 4 tu. It can be seen that at 1 tu the deterministic and ensemble mean forecast errors are within similar ranges; at later times, the range of the ensemble mean errors, due to the nonlinear filtering, is evidently smaller than that of the deterministic forecast errors. After 1 tu, for both the deterministic and ensemble mean forecasts, the forecast errors of an extreme event are overall larger than those of a neutral event at the same lead time, as shown in Table 1, which is essentially related to the distribution of LAR on an attractor. At long lead time (4 tu), the ratios between the average ensemble mean and deterministic forecast errors are 0.54 (1.69 vs 3.11), 0.87 (4.19 vs 4.79) and 0.99 (7.23 vs 7.33) for neutral, weak and strong extreme events, respectively, which are within the range of the expected ratio in Fig. 5. At shorter lead times, the errors of deterministic and ensemble mean forecasts become closer for neutral and weak extreme events, but the ensemble mean performs much worse (about a 20% error increase at 1 and 2 tu) for strong extreme events. For more extreme events at a given lead time, the ensemble mean forecasts are less likely to have small RMS errors, especially for longer lead times (see Figs. 6c, f, i and l). Figure6. RMS error of the ensemble mean and the deterministic forecasts for neutral events within one SD at time (a) 1 tu, (d) 2 tu, (g) 3 tu and (j) 4 tu. The second and third columns are same as the first, but for weak extreme states within 1-2 SD and strong extreme states out of 3 SD, respectively.
-->
APPENDIX
This appendix shows the processes to prove Theorem 1. R L,i and R L,j are the local attractor radii of the compact attractor $\mathcal{A}$ at state xi and xj, respectively. x E and R E are the mean state and attractor radii of $\mathcal{A}$. Based on Eq. (2), the expression of R L,i can be derived as follows: \begin{eqnarray*} R_{{\rm L},i}^2&=&E(\|{x}_i-{x}\|^2) ,\quad {x}_i,{x}\in \mathcal{A} ,\\ &=&E({x}^2-2{x}{x}_i+{x}_i^2) ,\\ &=&E({x}^2)-2{x}_iE({x})+{x}_i^2 ,\\ &=&{x}_i^2-2{x}_{\rm E}{x}_i+({x}_{\rm E}^2+R_{\rm E}^2) ,\\ &=&({x}_i-{x}_{\rm E})^2+R_{\rm E}^2 . \end{eqnarray*} R L,i reaches the minimal value R E, i.e., the attractor radius, when xi=x E; and if di>dj, R L,i>R L,j, where di and dj denote the RMS distances of xi and xj from the mean state x E.