Predictability of Ensemble Forecasting Estimated Using the Kullback-Leibler Divergence in the Lorenz

HTML

--> --> -->

1. Introduction

The atmosphere is a chaotic system in which small errors in its initial state can lead to large forecast errors (Thompson, 1957; Lorenz, 1963, Lorenz, 1965; Chou, 1989; Li and Chou, 1997; Bengtsso and Hodges, 2006). We can never observe every detail of the atmosphere's initial state, either in terms of spatial coverage or accuracy of measurements, so the initial conditions from which every forecast starts are inevitably slightly inaccurate. Small errors in the initial state will be amplified, so there is always a limit to how far ahead we can predict weather events (Lorenz, 1969, Lorenz, 1996; Dalcher and Kalnay, 1987; Li and Ding, 2011). Considering that weather predictions are inherently uncertain, the concept of ensemble forecasting was proposed to provide probabilistic forecasts of the future state of the atmosphere (Epstein, 1969; Leith, 1974). The basic idea of ensemble forecasting is to produce not just one single forecast but an ensemble of many forecasts starting from slightly different initial conditions.
In contrast to a single forecast, the ensemble mean of forecasts acts as a nonlinear filter that reduces forecast error (Toth and Kalnay, 1993). In general, the ensemble mean of forecasts will, on average, have a smaller error than the error of any of the single forecasts making up the ensemble (Leith, 1974; Murphy, 1988). Most importantly, the spread between the ensemble members (also called the forecast variance), which is an estimate of the standard deviation of ensemble members with respect to the ensemble mean, provides key information on the degree of confidence in the predictions under the assumption that the outputs of the ensemble members follow a normal distribution (Barker, 1991; Buizza, 1997; Palmer et al., 1998; Zhu et al., 2002). A large (small) ensemble spread indicates more (less) uncertainty in the prediction in general. In view of its advantages, ensemble forecasting is commonly performed at most of the major operational weather prediction centers worldwide, including the National Centers for Environmental Prediction (Toth and Kalnay, 1993, Toth and Kalnay, 1997; Wei et al., 2006, Wei et al., 2008), the European Centre for Medium-Range Weather Forecasts (Molteni et al., 1996; Buizza, 1997), and the Canadian Meteorological Centre (Houtekamer et al., 1996).
Ensemble forecasting aims to provide an approximate description of the probability distribution of possible future states of the atmosphere. The probability information is typically derived by using a finite number of ensemble members. Assuming that the forecast probability distribution is normal or unimodal, the width of the distribution from forecast to forecast can be measured by the ensemble spread or variance. However, the forecast probability distribution is not always unimodal and can sometimes be bimodal or even multimodal. In this case, the ensemble spread may fail to reflect the ensemble mean skill or predictability of ensemble forecasting. As pointed out by (Whitaker and Loughe, 1998), even for a perfect ensemble the correlation between the ensemble spread and skill may be very low. In addition, the ensemble spread has limited utility as a predictor of ensemble mean skill (Houtekamer, 1993; Kumar et al., 2000; Grimit and Mass, 2002; Tang et al., 2008a).
Given that ensembles provide flow-dependent probabilistic forecasts of the future state of the atmosphere, it is more appropriate to investigate the predictability of ensemble forecasting from the standpoint of the flow-dependent probability distribution of ensemble forecasts instead of the ensemble spread. In the present study, in relation to the forecast probability distribution, we introduce the Kullback-Leibler (KL) divergence (also called the relative entropy) to measure the predictability limit of ensemble forecasting. The KL divergence is a measure of how one probability distribution diverges from a second, expected probability distribution (Kullback and Leibler, 1951), thereby enabling an estimate of the difference between the probability distributions of ensemble forecasts and local reference (true) states. By investigating the evolution of the KL divergence with time, we can quantitatively estimate the predictability limit of ensemble forecasting. In contrast to the ensemble spread, the KL divergence not only provides a quantitative measure of the predictability limit of ensemble forecasting but is applicable to a non-normal distribution of ensemble forecasts, thereby overcoming the limitations of the ensemble spread and providing an effective way to investigate the predictability of ensemble forecasting.
Note that information theory measures, such as the KL divergence or relative entropy, have been used in previous studies to measure the skill of ensemble forecasts (Stephenson and Dolas-Reyes, 2000; Roulston and Smith, 2002; DelSole, 2004, DelSole, 2005; Tang et al., 2005, Tang et al., 2008b). However, in these studies the entropy of ensemble forecasts was used as a measure or predictor of forecast skill, rather than a measure of the predictability limit. In this paper, we present a wider role of information theory in quantifying the predictability limit of ensemble forecasting, which can provide useful information on the time at which ensemble forecasts become meaningless.
The remainder of this paper is organized as follows. Section 2 provides a definition of the KL divergence and presents a method to compute the KL divergence for ensemble forecasting. Section 3 tests the validation and usefulness of the KL divergence in measuring the predictability of ensemble forecasting by applying it to a simple system——the three-variable Lorenz model. Section 4 summarizes the major results of this work and discusses possible limitations and future research.

2. Methods

2.1. KL divergence

-->

2.1. KL divergence

The KL divergence measures the difference between two probability distributions P and Q (Kullback and Leibler, 1951). For discrete probability distributions P and Q, the KL divergence from Q to P is defined as \begin{equation} \label{eq1} D_{\rm KL}(P\|Q)=\sum_i{P(i)\log\frac{P(i)}{Q(i)}} , \ \ (1)\end{equation} where "\|" denotes "relative to", and Eq. (2) is equivalent to \begin{equation} \label{eq2} D_{\rm KL}(P\|Q)=-\sum_i {P(i)\log\frac{Q(i)}{P(i)}} . \ \ (2)\end{equation}
For distributions P and Q of a continuous random variable x, the KL divergence is defined as \begin{equation} \label{eq3} D_{\rm KL}(P\|Q)=\int_{-\infty}^\infty p(x)\log\frac{p(x)}{q(x)}dx , \ \ (3)\end{equation} where p and q represent the probability densities of P and Q. The KL divergence is always non-negative, with D_KL(P\|Q) zero if and only if P=Q.

2

2.2. Local attractor radius

-->

2.2. Local attractor radius

Let x_i be a specific state on a compact attractor Ω, then the local attractor radius (LAR, R_L) with respect to the state x_i is defined by (Li et al., 2018) as \begin{equation} \label{eq4} R_L({x}_i)=\sqrt{E(\|{x}_i-{x}\|^2)} ,\quad {x}_i,{x}\in\Omega , \ \ (4)\end{equation}
where the norm $\| \|$ represents the L²-norm and E denotes the expectation. The LAR measures the root-mean-square distance between one specific state x_i and all other states on an attractor. In terms of the LAR, the local attractor with respect to the state x_i can be defined as a subset of all states on the attractor whose distance to the state x_i is less than the LAR. (Li et al., 2018) showed that the LAR can be used as an objective metric to quantify the local predictability limit of forecast models. In the present study, the LAR is used to define the local attractor with respect to a specific reference state and to construct the probability distributions of local reference (true) states.
An example from the three-variable Lorenz system is given to illustrate the spatial structure of the LAR over the Lorenz attractor. The three-variable Lorenz system is \begin{equation} \label{eq5} \left\{ \begin{array}{l} \dfrac{{\rm d}X}{{\rm d}t}=-\sigma X+\sigma Y \\ \dfrac{{\rm d}Y}{{\rm d}t}=rX-Y-XZ\\ \dfrac{{\rm d}Z}{{\rm d}t}=XY-bZ \end{array} \right., \ \ (5)\end{equation} where σ=10, r=28, and b=8/3, for which the system exhibits chaotic behavior (Lorenz, 1963). Figure 1 shows a projection of the LAR over the Lorenz attractor in the x-y plane. Obviously, the LAR varies widely over the attractor, with a minimum value of the LAR of ~15 and the maximum value exceeding 35. The LAR is not randomly distributed but exhibits a distinct organization in phase space, consistent with the results of (Li et al., 2018). The LAR is antisymmetric with respect to the x- or y-axis, with minimum values at the intersection of the two wings and maximum values at the outermost rims. As the LAR varies over the attractor, the local attractor with respect to a specific state also changes with the state.

Figure1. Projection of the LAR over the Lorenz attractor in the x-y plane.

2

2.3. Calculation of the KL divergence in ensemble forecasting

-->

2.3. Calculation of the KL divergence in ensemble forecasting

The definition of the KL divergence in Eq. (2) aims to quantify the difference between two probability distributions, P and Q. To compute the KL divergence in ensemble forecasting, it is necessary to estimate the probability distribution of local reference (true) states (hereafter P) and the probability distribution of ensemble forecasts (hereafter Q). For a specific reference state x_i, we first calculate the LAR of the state x_i. Then, we can obtain the subset of all states on the attractor whose distance to the reference state is less than the LAR. Finally, the probability distribution P of local reference (true) states can be obtained based on the subset of the states on the local attractor.
When N random perturbations are added to or subtracted from the reference states, N different results of ensemble forecasts can be generated from the prediction model. Based on N ensemble forecasts, the probability distribution Q of ensemble forecasts can then be obtained. Once both P and Q are obtained, we can directly compute the KL divergence. As the reference state and ensemble forecasts change with the forecast time, the KL divergence will vary with the forecast time. By examining the evolution of the KL divergence with the forecast time, we can quantitatively estimate the predictability limit of ensemble forecasting.

2

2.4. Nonlinear local Lyapunov exponent method

-->

2.4. Nonlinear local Lyapunov exponent method

The nonlinear local Lyapunov exponent (NLLE), which is a nonlinear extension of the existing linear finite-time or local Lyapunov exponents (Yoden and Nomura, 1993; Boffetta et al., 1998; Ziehmann et al., 2000), measures the mean growth rate of the initial errors of nonlinear dynamical systems without having to linearize the nonlinear equations of motion (Ding and Li, 2007, Ding et al., 2008a; Li and Ding, 2011). The NLLE and its derivative (i.e., the mean relative growth of the initial error) have been widely applied to quantitatively determine the limit of dynamic predictability of weather or climate variables (Ding et al., 2008b, Ding et al., 2010, Ding et al., 2011, Ding et al., 2015), exhibiting superior performance to the existing linear finite-time or local Lyapunov exponents. A brief description of the NLLE method is given in Appendix A.
Note that the NLLE method is defined based on nonlinear error dynamics, while the KL divergence is defined based on probability and information theory. Some differences exist between both methods. For example, the NLLE method uses the root-mean-square error as the measure of error, and therefore depends on the dimension of variables. In contrast, the KL divergence uses the difference between two probability distributions as the measure of uncertainty, and therefore does not depend on the dimension of variables. This may be one advantage of the KL divergence relative to the NLLE method. Nevertheless, although the NLLE method (the KL divergence) is used to determine the predictability limit by exploring the evolution of initial errors (the evolution of forecast probability distributions), considering that the predictability limit is an intrinsic property of a given dynamical system that does not depend on specific methods (Lorenz, 1969; Mu et al., 2017), the predictability limit of ensemble forecasting derived from the KL divergence and from error evolution should be consistent (see Fig. 2). Therefore, we compare the predictability limits of ensemble forecasting derived from the KL divergence and NLLE. Their consistency would support the effectiveness of the KL divergence in measuring the predictability of ensemble forecasting.

Figure2. For the initial state on the Lorenz attractor x₀₁ (-5.76, -0.29, 30.5), we show (a) the KL divergence and (b) the mean error growth obtained using the NLLE method with ε =10^-3 as a function of time t. In (a), the time at which the KL divergence reaches its maximum value is indicated by the red dashed line. In (b), the average value of the nonlinear stochastic fluctuation states of the mean error is indicated by the black dashed line, and the time at which the error growth enters the nonlinear stochastic fluctuation states is indicated by the red dashed line.

4. Conclusions

We have presented a new method using the KL divergence to measure the predictability of ensemble forecasting. The KL divergence allows us to estimate the difference between the probability distributions of ensemble forecasts and local reference (true) states. By investigating the evolution of the KL divergence with time, the local predictability limit of ensemble forecasting may be quantitatively determined. The KL divergence is applicable to a non-normal distribution of ensemble forecasts. This represents an improvement over the ensemble spread, which is only applicable under the assumption that the ensemble members follow a normal distribution. Using the KL divergence, we have performed a quantitative analysis of the predictability of ensemble forecasting in the Lorenz model. The local predictability limit derived from the KL divergence is clearly consistent with that derived from error evolution, lending support to the effectiveness of the KL divergence in measuring the predictability of ensemble forecasting.
In addition, we have investigated the sensitivity of the predictability of ensemble forecasting to the initial states and the magnitude of initial errors. We found that the predictability of ensemble forecasting depends on the initial states as well as on the magnitude of initial errors. The local predictability limit of ensemble forecasting varies considerably with time, but the predictability variability shows organization in phase space. The predictability of ensemble forecasting is also sensitive to the magnitude of initial errors. The local predictability limit decreases approximately linearly as the logarithm of the magnitude of initial errors is increased.
Our study presents a preliminary application of the KL divergence in measuring the predictability of ensemble forecasting in a relatively simple system. For more complex ensemble weather or climate forecasts, there will be higher dimensionality and more complicated models. This implies that there would exist some uncertainties in estimating the KL divergence for operational weather or climate forecasts, which poses a challenge to the accurate estimation of the KL divergence. It would be interesting to extend the current investigation to more realistic ensemble weather forecasts, which we intend to examine in future research. In addition, this study simply used random perturbations as ensemble perturbations. Up to now, various schemes have been developed to generate the initial perturbations in ensemble forecasts, such as the bred vector method (Toth and Kalnay, 1993, 1997), the singular vector method (Molteni et al., 1996; Buizza, 1997), and the ensemble transform Kalman filter (Bishop et al., 2001; Wang and Bishop, 2003). These schemes have been shown to improve operational forecasts compared with random perturbations. It is worthwhile examining from the standpoint of the KL divergence the predictability of these ensemble forecasts using perturbations generated by such schemes.

3

APPENDIX A

-->

APPENDIX A

Introduction to the NLLE method

-->

Introduction to the NLLE method

Consider a general n-dimensional nonlinear dynamical system whose evolution is governed by \begin{equation} \frac{{d}{x}}{dt}={F}({x}) , \ \ (A1)\end{equation}
where x=[x₁(t),x₂(t),…,x_n(t)]^T is the state vector at time t, the superscript T is the transpose, and F represents the dynamics. The evolution of a small error δ=[δ₁(t),δ₂(t),…,δ_n(t)]^T, superimposed on a state x is governed by the following nonlinear equation: \begin{equation} \frac{d}{dt}{\delta}={J}({x}){\delta}+{G}({x},{\delta}) , \ \ (A2)\end{equation} where J(x)δ are the tangent linear terms and G(x,δ) are the high-order nonlinear terms of the error δ. Without a linear approximation, the solutions of Eq. (A2) can be obtained by numerical integration along the reference solution x from t=t₀ to t₀+τ: \begin{equation} {\delta}_1={\eta}({x}_0,{\delta}_0,\tau){\delta}_0 , \ \ (A3) \end{equation} where δ₁=δ(t₀+τ), x₀=x(t₀), δ₀=δ(t₀), and η(x₀,δ₀,τ) is the nonlinear propagator. The NLLE is then defined as \begin{equation} \lambda({x}_0,{\delta}_0,\tau)=\frac{1}{\tau}\ln\frac{\|{\delta}_1\|}{\|{\delta}_0\|} , \ \ (A4)\end{equation} where Λ(x₀,δ₀,τ) depends in general on the initial state x₀ in phase space, the initial error δ₀, and time τ. The NLLE differs from existing local or finite-time Lyapunov exponents defined from linear error dynamics, which depend solely on the initial state x₀ and time τ, and not on the initial error δ₀. Assuming that all initial perturbations with amplitude ε and random directions are on an n-dimensional spherical surface centered at an initial point x₀, then we have \begin{equation} {\delta}_0^{\rm T}{\delta}_0=\varepsilon^2 . \ \ (A5) \end{equation}
The local ensemble mean of the NLLE over a large number of random initial perturbations is given by \begin{equation} \bar{\lambda}({x}_0,\tau)=\langle{\lambda({x}_0,{\delta}_0,\tau)}\rangle_N , \ \ (A6)\end{equation} where $\langle\ \rangle_N$ denotes the local ensemble average of samples of large enough size N ($N\to\infty$). Here, $\bar\lambda(x_0,\tau)$ characterizes the average growth rate of random perturbations superimposed on x₀ within a finite time τ. For a fixed time τ, $\bar\lambda(x_0,\tau)$ depends on x₀ and reflects the local error growth dynamics of the attractor. The mean local relative growth of the initial error can be obtained by \begin{equation} \bar{E}({x}_0,\tau)=e^{[\bar{\lambda}({x}_0,\tau)\tau]} . \ \ (A7) \end{equation}
For a given initial state x₀, $\bar{E}(x_0,\tau)$ initially increases with time τ and finally reaches a state of nonlinear stochastic fluctuation, which means that error growth reaches saturation with a constant average value. At that moment, almost all information on the initial state is lost and the prediction becomes meaningless. If the local predictability limit is defined as the time at which the error reaches the average value of the nonlinear stochastic fluctuation states, the predictability limit of the system at x₀ can be quantitatively determined.

本站小编 Free考研考试/2022-01-02

Corresponding author: Ruiqiang DING,drq@mail.iap.ac.cn;

HTML

2.1. KL divergence

2.2. Local attractor radius

2.3. Calculation of the KL divergence in ensemble forecasting

2.4. Nonlinear local Lyapunov exponent method

APPENDIX A

Introduction to the NLLE method

相关话题/Predictability Ensemble Forecasting

领限时大额优惠券,享本站正版考研考试资料!

Determination of the Backward Predictability Limit and Its Relationship with the Forward Predictabil

Indian Ocean Dipole-related Predictability Barriers Induced by Initial Errors in the Tropical Indian

Predictability of South China Sea Summer Monsoon Onset

The Relationship between Deterministic and Ensemble Mean Forecast Errors Revealed by Global and Loca

Ensemble Forecasts of Tropical Cyclone Track with Orthogonal Conditional Nonlinear Optimal Perturbat

Estimating the Predictability Limit of Tropical Cyclone Tracks over the Western North Pacific Using

Investigating the Initial Errors that Cause Predictability Barriers for Indian Ocean Dipole Events U

Impact of Soil Moisture Uncertainty on Summertime Short-range Ensemble Forecasts

Evaluation of TIGGE Ensemble Forecasts of Precipitation in Distinct Climate Regions in Iran

Aerosol Microphysical and Radiative Effects on Continental Cloud Ensembles