删除或更新信息,请邮件至freekaoyan#163.com(#换成@)

Some methods of parameter estimation for stochastic differential equations

本站小编 Free考研考试/2021-12-25

蔡昕芮, 王丽瑾
中国科学院大学数学科学学院, 北京 100049
摘要: 提出3种基于离散观测数据的随机微分方程参数估计的方法。第1种方法应用于线性随机微分方程。推导出这类方程的真解的相关运算服从的分布,使观测数据的运算也服从此分布,由此来估计漂移系数与扩散系数中的未知参数。第2种方法用于It?型随机微分方程。推导出Euler-Maruyama格式的数值解的相关运算服从的分布,使观测数据的运算服从此分布,由此来估计参数。第3种方法用于Stratonovich型随机微分方程。推导出中点格式的数值解的相关运算服从的分布,使观测数据的运算服从此分布,以此来估计参数。数值实验验证了这3种方法的有效性。数值实验显示,Euler-Maruyama格式参数估计的误差约为Oh0.5)阶,中点格式参数估计的误差约为Oh)阶,其中h是数值方法的时间步长。我们提出的3种估计方法均比文献中已有的EM-MLE方法更精确。
关键词: 随机微分方程参数估计Euler-Maruyama格式中点格式
Consider the stochastic differential equation (SDE)
$\begin{array}{*{20}{c}}{{\rm{d}}X\left( t \right) = f\left( {X\left( {\omega, t} \right), \theta } \right){\rm{d}}t + \sigma \left( {X\left( {\omega, t} \right), \gamma } \right){\rm{d}}B\left( t \right), }\\{X\left( {{t_0}} \right) = {X_0}, }\end{array}$ (1)
where t0tT, and ω is an element of the underlying probability space Ω. f(·) and σ(·) are known functions of X, and B(t) is a standard Brownian motion. In addition, θ and γ are unknown parameters, which will be estimated. To simplify the exposition, we first assume that the parameters θ and γ are of one-dimension, and will present the discussion of estimating multiple unknown parameters in the last section. Moreover, we divide the time interval [t0, T] into N pieces of equal length h, such that ti+1-ti=h, i=0, …, N-1, tN=T.
The problem of parameter estimation in diffusion processes based on discrete observations has been studied by some authors. Morel et al.[1] first studied estimation in discretely observed diffusions. Breton[2] used approximate maximum likelihood estimation (MLE), and studied asymptotic properties of the descritized versions of MLE for linear parametric stochastic differential equations. Dorogovcev[3] used conditional least squares estimation, and gave sufficient conditions for weak consistency of the estimator. Prakasa[4] showed that Bayes estimators and MLE estimators of SDEs have the same asymptotic properties and asymptotic distributions for suitable class of loss functions of some special types. Combining statistical methods with numerical methods, Pedersen[5] used numerical approximationsbased on iterations of the Gaussian transition densities emanating from Euler-Maruyama scheme and studied approximate maximum likelihood. Papaspiliopoulos and Beskos[6] deduced the distribution of certain operation of the numerical solution of the stochastic differential equations by Euler-Maruyama scheme, and then used the maximum likelihood method to estimate the parameters based on observations, which we call the EM-MLE estimator. Inspired by their work, we propose estimators by deducing the distributions of certain relevant operations of the exact solution, as well as numerical solutions resulted from the Euler-Maruyama and the midpoint schemes of the SDEs. Instead of the maximum likelihood method in Ref.[6], we use directly the deduced distributions together with tree-like observations to estimate the parameters. Errors of our estimators are compared with that of the EM-MLE estimator in the numerical experiments.
The contents are arranged as follows. In section 1, we derive the distribution of an operation of the exact solution of constant-coefficient homogeneous linear SDE, and deseribe the method of parameter estimation based on this distribution. In section 2 we propose two methods of parameter estimation for SDEs of It${\hat o}$ and Stratonovich senses, respectively, based on the distribution of corresponding operations of the numerical solutions produced by the Euler-Maruyama scheme and the midpoint scheme, respectively. In section 3, we perform numerical experiments to test these methods, and estimate the order of errors arising from the latter two methods. Error comparison with the EM-MLE estimator is also performed. Section 4 is devoted to the discussion of estimating multiple unknown parameters for SDEs, based on the Euler-Maruyama numerical scheme.
1 Method based on the exact solutionConsider the constant-coefficient homogeneous linear SDE
$\begin{array}{*{20}{c}}{{\rm{d}}X\left( t \right) = aX\left( t \right){\rm{d}}t + bX\left( t \right){\rm{d}}B\left( t \right), }\\{X\left( 0 \right) = {X_0} \ne 0, }\end{array}$ (2)
where a and b are unknown parameters. The exact solution of this SDE has the form
$X\left( t \right) = {X_0}\exp \left[{\left( {a-\frac{1}{2}{b^2}} \right)t + bB\left( t \right)} \right].$ (3)
Obviously,
$X\left( {{t_i}} \right) = {X_0}\exp \left[{\left( {a-\frac{1}{2}{b^2}} \right){t_i} + bB\left( {{t_i}} \right)} \right], $ (4)
$X\left( {{t_{i + 1}}} \right) = {X_0}\exp \left[{\left( {a-\frac{1}{2}{b^2}} \right){t_{i + 1}} + bB\left( {{t_{i + 1}}} \right)} \right].$ (5)
From (4) and (5), it follows
$\begin{array}{l}X\left( {{t_{i + 1}}} \right) = X\left( {{t_i}} \right)\exp \left[{\left( {a-\frac{1}{2}{b^2}} \right)h + } \right.\\\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\left. {b\left( {B\left( {{t_{i + 1}}} \right)-B\left( {{t_i}} \right)} \right)} \right], \end{array}$ (6)
where h=ti+1-ti, and
$B\left( {{t_{i + 1}}} \right) - B\left( {{t_i}} \right)\tilde{\ }N\left( {0, h} \right).$
Therefore,
$\begin{array}{l}\;\;\;\;\;\;\;\;\ln\left| {X\left( {{t_{i + 1}}} \right)} \right| - \ln\left| {X\left( {{t_i}} \right)} \right|\\ = \left( {a - \frac{1}{2}{b^2}} \right)h + b\left( {B\left( {{t_{i + 1}}} \right) - B\left( {{t_i}} \right)} \right)\\ \sim N\left( {\left( {a - \frac{1}{2}{b^2}} \right)h, {b^2}h} \right).\end{array}$ (7)
Suppose we have observations
$\begin{array}{*{20}{c}}{X\left( {{\omega _1}, {t_0}} \right), X\left( {{\omega _1}, {t_1}} \right), \cdots, X\left( {{\omega _1}, {t_N}} \right), }\\ \vdots \\{X\left( {{\omega _m}, {t_0}} \right), X\left( {{\omega _m}, {t_1}} \right), \cdots, X\left( {{\omega _m}, {t_N}} \right), }\end{array}$
where ωj ∈ Ω, X(ωj, t0)=X0, for j=1, …, m.
On the first subinterval [t0, t1], we calculate
$\begin{array}{*{20}{c}}{\ln \left| {X\left( {{\omega _1}, {t_1}} \right)} \right| - \ln \left| {X\left( {{\omega _1}, {t_0}} \right)} \right|, }\\{\ln \left| {X\left( {{\omega _2}, {t_1}} \right)} \right| - \ln \left| {X\left( {{\omega _2}, {t_0}} \right)} \right|, }\\ \vdots \\{\ln \left| {X\left( {{\omega _m}, {t_1}} \right)} \right| - \ln \left| {X\left( {{\omega _m}, {t_0}} \right)} \right|.}\end{array}$ (8)
According to the method of moments, we can approximate the total moments by the moments of the samples. Therefore, let
$\begin{array}{*{20}{l}}{\left( {a - \frac{1}{2}{b^2}} \right)h = \frac{1}{m}\sum\limits_{j = 1}^m {\ln \left| {X\left( {{\omega _1},{t_1}} \right)} \right|} - }\\{\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\ln \left| {(X\left( {{\omega _j},{t_0}} \right)} \right|,}\\{{b^2}h = \frac{1}{{m - 1}}\sum\limits_{j = 1}^m {(\ln \left| {X\left( {{\omega _i},{t_1}} \right)} \right|} - }\\{\;\;\;\;\;\;\;\;{{\left. {\ln \left| {X\left( {{\omega _j},{t_0}} \right)} \right| - \left( {a - \frac{1}{2}{b^2}} \right)h} \right)}^2}}\end{array}$
to get the estimator of a and b on the first subinterval [t0, t1], denoted by a ${\hat a_1}$ and ${\hat b_1}$.
By performing the same procedure on the subsequent small intervals [t1, t2], …, [tN-1, tN], we get a sequence of estimators $ {\hat a_2}, \cdots, {\hat a_N}$ and ${\hat b_2}, \cdots, {\hat b_N}$. Then, we estimate the parameter a by the average of $ {\hat a_1}, \cdots, {\hat a_N}$, and estimate b by the average of ${\hat b_1}, \cdots, {\hat b_N}$, that is
$\hat a = \frac{1}{N}\sum\limits_{i = 1}^N {{{\hat a}_i}}, \hat b = \frac{1}{N}\sum\limits_{i = 1}^N {{{\hat b}_i}} .$ (9)
2 Methods based on the numerical solutionsIn this section, we will give parameter estimators based on two kinds of numerical discretizations of the SDEs containing unknown parameters.
2.1 Method using the Euler-Maruyama schemeConsider the SDE of It${\hat o}$ sense (1). Denote by Xi the numerical solution at time ti. The Euler-Maruyama scheme[7] applied to (1) reads
${X_{i + 1}} - {X_i} = hf\left( {{X_i}, \theta } \right) + \left[{B\left( {{t_{i + 1}}} \right)-B\left( {{t_i}} \right)} \right]\sigma \left( {{X_i}, \gamma } \right).$ (10)
Given Xi, we know that the distribution of the increment is
${X_{i + 1}} - {X_i} \sim N\left( {hf\left( {{X_i}, \theta } \right), h{\sigma ^2}\left( {{X_i}, \gamma } \right)} \right).$ (11)
Suppose we have m observations X(ω10, t1), X(ω20, t1), …, X(ωm0, t1), each appearing after evolution of a time period h from the given initial value X0. Then due to (11), we can let
$\frac{1}{m}\sum\limits_{j = 1}^m {X\left( {\omega _j^0, {t_1}} \right)} = hf\left( {{X_0}, \theta } \right) + {X_0}$
and
$\frac{1}{{m - 1}}\sum\limits_{j = 1}^m {{{\left( {X\left( {\omega _j^0, {t_1}} \right) - hf\left( {{X_0}} \right)} \right)}^2}} = h{\sigma ^2}\left( {{X_0}, \gamma } \right), $
from which we get an estimate ${{\hat \theta }_1}$ and ${{\hat \gamma }_1}$.
Then, we fix an arbitrary X(ωk0, t1)=:X1 (k ∈ {1, …, m}), and observe other m values X(ω11, t2), X(ω21, t2), …, X(ωm1, t2), each resulting from evolving time h from the given value X(ωk0, t1)=X1. Let
$\frac{1}{m}\sum\limits_{j = 1}^m {X\left( {\omega _j^1, {t_2}} \right)} = hf\left( {{X_1}, \theta } \right) + {X_1}, $
and
$\frac{1}{{m - 1}}\sum\limits_{j = 1}^m {{{\left( {X\left( {\omega _j^1, {t_2}} \right) - hf\left( {{X_1}} \right)} \right)}^2}} = h{\sigma ^2}\left( {{X_1}, \gamma } \right).$
We obtain another estimate ${{\hat \theta }_2}$ and ${{\hat \gamma }_2}$.
Analogously, we can get successively a sequence of estimates ${{\hat \theta }_3}, \cdots, {{\hat \theta }_N}$ and ${{\hat \gamma }_3}, \cdots, {{\hat \gamma }_N}$. Then we use the average of $ {{\hat \theta }_i}$, i=1, …, N, to estimate the parameter θ and use the average of ${{\hat \gamma }_i}$, i=1, …, N, to estimate the parameter γ. That is
$\hat \theta = \frac{1}{N}\sum\limits_{i = 1}^N {{{\hat \theta }_i}}, \hat \gamma = \frac{1}{N}\sum\limits_{i = 1}^N {{{\hat \gamma }_i}} .$ (12)
2.2 Method using the midpoint schemeFor the SDE of Stratonovich sense
$\begin{array}{*{20}{c}}{{\rm{d}}X\left( t \right) = f\left( {X\left( {\omega, t} \right), \theta } \right){\rm{d}}t + \sigma \left( {X\left( {\omega, t} \right), \gamma } \right) \circ {\rm{d}}B\left( t \right), }\\{X\left( {{t_0}} \right) = {X_0}, }\end{array}$ (13)
the midpoint scheme is a kind of consistent numerical method (see Ref.[7]). If θ and γ are unknown parameters, we apply the midpoint scheme to (13), and simulate B(ti+1)-B(ti) by ${\xi _i}\sqrt h $, where ξi is a random variable of standard normal distribution. We get
$\begin{array}{l}{X_{i + 1}} - {X_i} = f\left( {\frac{{{X_{i + 1}} + {X_i}}}{2}, \theta } \right)h + \\\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\sigma \left( {\frac{{{X_{i + 1}} + {X_i}}}{2}, \gamma } \right){\xi _i}\sqrt h .\end{array}$ (14)
Since scheme (14) is usually implicit, we may need to use predictor-corrector type iterations, as well as proper truncation of the ξi to compute an approximation ${{\tilde X}_{i + 1}}$ of Xi+1. Then we have
$\frac{{{{\tilde X}_{i + 1}} - {X_i} - f\left( {\frac{{{{\tilde X}_{i + 1}} + {X_i}}}{2}, \theta } \right)h}}{{\sigma \left( {\frac{{{{\tilde X}_{i + 1}} + {X_i}}}{2}, \gamma } \right)\sqrt h }} \approx {\xi _i} \sim N\left( {0, 1} \right), $ (15)
where Xi is given by last iteration step.
We use the same observation as described in section 2.1. According to (15), for the first time interval [t0, t1], we can let
$\begin{array}{*{20}{c}}{\frac{1}{m}\sum\limits_{j = 1}^m {\frac{{X\left( {\omega _j^0, {t_1}} \right) - {X_0} - f\left( {\frac{{X\left( {\omega _j^0, {t_1}} \right) + {X_0}}}{2}, \theta } \right)}}{{\sigma \left( {\frac{{X\left( {\omega _j^0, {t_1}} \right) + {X_0}}}{2}, \gamma } \right)\sqrt h }} = 0, } }\\{\frac{1}{{m - 1}}\sum\limits_{j = 1}^m {{{\left( {\frac{{X\left( {\omega _j^0, {t_1}} \right) - {X_0} - f\left( {\frac{{X\left( {\omega _j^0, {t_1}} \right) + {X_0}}}{2}, \theta } \right)h}}{{\sigma \left( {\frac{{X\left( {\omega _j^0, {t_1}} \right) + {X_0}}}{2}, \gamma } \right)\sqrt h }}} \right)}^2}} }\\{ = 1, }\end{array}$
from which we can get an estimate ${{\hat \theta }_1}$ and ${{\hat \gamma }_1}$. Performing the same procedure for the time intervals [t1, t2], …, [tN-1, tN], we obtain a sequence of estimates ${{\hat \theta }_2}, \cdots, {{\hat \theta }_N}$ and $ {{\hat \gamma }_2}, \cdots, {{\hat \gamma }_N}$. We estimate θ and γ by
$\hat \theta = \frac{1}{N}\sum\limits_{i = 1}^N {{{\hat \theta }_i}}, \hat \gamma = \frac{1}{N}\sum\limits_{i = 1}^N {{{\hat \gamma }_i}} .$ (16)
Alternatively, we can transform the SDE of Stratonovich sense (13) into its equivalent It${\hat o} $ form
$\begin{array}{*{20}{c}}{{\rm{d}}X\left( t \right) = \left( {f\left( {X\left( {\omega, t} \right), \theta } \right) + \frac{1}{2}\frac{{\partial \sigma }}{{\partial X}} \cdot } \right.}\\{\left. {\sigma \left( {X\left( {\omega, t} \right), \gamma } \right)} \right){\rm{d}}t + \sigma \left( {X\left( {\omega, t} \right), \gamma } \right){\rm{d}}B\left( t \right), }\\{X\left( {{t_0}} \right) = {X_0}, }\end{array}$ (17)
and then use the method given in section 2.1 to estimate the parameters. We will compare the results of the two approaches based on Stratonovich and It${\hat o} $ equations, respectively, by numerical tests as follows.
3 Numerical tests3.1 Estimation based on the exact solutionFor the linear SDE (2), let t0=0, T=1, X0=1. Suppose a and b are given. a and b will be estimated by using the method given in section 1 to test the ability of our estimator.We set the number of subintervals of time to be N=500 and the number of sample path observations to be m=100, and we perform five tests for five different groups of a and b values. The observation data are simulated by the exact solution (3). Our five group of numerical results ${\hat a}$ and ${\hat b}$, with the corresponding a and b, are listed in Table 1.
Table 1
Table 1 Estimates based on the exact solution
a b ${\hat a}$ ${\hat b}$
1 0.5 0.984 2 0.497 7
2 2 2.018 9 1.994 7
3 1 2.984 7 1.004 1
4 2 4.025 5 1.992 3
5 3 5.010 9 2.995 8

Table 1 Estimates based on the exact solution

In Fig. 1(a), we plot the points (a, b) denoted by 'o', and the corresponding estimates ($\hat a, \hat b$) denoted by '*' in the five tests. Fig. 1(a) shows very good coincidence between the exact value and the estimates.
Fig. 1
Download: JPG
larger image


Fig. 1 Exact values and estimates of the parameter (a, b)

3.2 Estimation based on the numerical solutionsTo test the effect of the parameter estimations based on the Euler-Maruyama and the midpoint schemes, we first consider the linear SDE of Stratonovich sense
$\begin{array}{*{20}{c}}{{\rm{d}}X\left( t \right) = aX\left( t \right){\rm{d}}t + bX\left( t \right) \circ {\rm{d}}B\left( t \right), }\\{X\left( 0 \right) = {X_0} \ne 0, }\end{array}$ (18)
and its equivalent It${\hat o}$ equation
$\begin{array}{*{20}{c}}{{\rm{d}}X\left( t \right) = \left( {a + \frac{1}{2}{b^2}} \right)X\left( t \right){\rm{d}}t + bX\left( t \right){\rm{d}}B\left( t \right), }\\{X\left( 0 \right) = {X_0} \ne 0.}\end{array}$ (19)
We apply the midpoint scheme to (18) and the Euler-Maruyama scheme to (19). Then we compare their effects.
The data are set as t0=0, T=1, X0=1, N=100, and m=500, and the observation data are generated by the exact solution of (18) and (19)
$X\left( t \right) = {X_0}\exp \left( {at + bB\left( t \right)} \right).$ (20)
Again, we perform five tests with five different groups (a, b), and denote the estimates by the Euler-Maruyama scheme with (${{\hat a}_e}, {{\hat b}_e}$) and that by the midpoint scheme with ($ {{\hat a}_m}, {{\hat b}_m}$). The numerical results are listed in Table 2.
Table 2
Table 2 Estimates based on the numerical solutions
a b ${{\hat a}_e}$ ${{\hat b}_e}$ $ {{\hat a}_m}$ ${{\hat b}_m}$
1 0.5 1.045 1 0.506 2 1.040 7 0.499 9
2 2 1.927 8 2.097 6 2.035 6 1.975 6
3 1 3.039 3 1.034 7 3.007 7 0.994 5
4 2 3.989 3 2.140 8 4.063 1.974 6
5 3 4.579 8 3.391 7 5.206 6 2.937 8

Table 2 Estimates based on the numerical solutions

In Fig. 1(b) we plot the five points (a, b) denoted by 'o', and the corresponding estimates (${{\hat a}_e}, {{\hat b}_e}$) denoted by '*' and (${{\hat a}_m}, {{\hat b}_m}$) denoted by '△'. It can be seen that the midpoint scheme applied to the Stratonovich SDE gives better estimates than the Euler-Maruyama scheme applied to the It${\hat o}$ SDE.
3.3 Error analysisIn this section, we analyze the relation between the error of estimate $e: = \sqrt {{{\left( {a-\hat a} \right)}^2} + {{\left( {b-\hat b} \right)}^2}} $ and the time step size h of the numerical solutions using the Euler-Maruyama scheme and the midpoint scheme. Consider the linear SDE of It${\hat o}$ sense (19) and its equivalent SDE of Stratonovich sense (18), for which we apply the Euler-Maruyama scheme and the midpoint scheme, respectively.
Set X0=1, t0=0, T=1, a=3, b=1, and the number of observed sample paths m=500. We test four different values of the time step size of the numerical solutions, namely, h=0.02, 0.04, 0.05, and 0.1. In Table 3 listed are the estimated values (${{\hat a}_e}, {{\hat b}_e}$) resulted from the Euler-Maruyama scheme and (${{\hat a}_m}, {{\hat b}_m}$) resulted from the midpoint scheme, together with the corresponding time step sizes h.
Table 3
Table 3 Estimates and the corresponding h values
h a b $ {{\hat a}_e}$ $ {{\hat b}_e}$ $ {{\hat a}_m}$ $ {{\hat b}_m}$
0.02 3 1 3.062 6 1.075 1 3.001 6 0.990 9
0.04 3 1 3.060 7 1.173 5 2.952 4 0.993 3
0.05 3 1 3.055 6 1.182 8 2.913 8 0.974 2
0.1 3 1 3.125 1.476 7 2.911 0.966 6

Table 3 Estimates and the corresponding h values

In Table 3, we observe an inereasing tendency of errore with h for both numerical schemes. For a clearer demonstration, we plot lne against lnh for the four different values of h and the corresponding errors e in Fig. 2, with Fig. 2(a) for the Euler-Maruyama scheme and Fig. 2(b) for the midpoint scheme. In Fig. 2, we can observe the order of the error with respect to h. The dashed reference line in Fig. 2(a) is of slope 0.5. The near parallelism between the reference line and the plotted lne-lnh line indicates that the order of the error e for the Euler-Maruyama scheme is about O(h0.5). Analogously, it can be inferred from Fig. 2(b) that the error e for the midpoint scheme is of order O(h), since the dashed reference line in Fig. 2(b) is of slope 1.
Fig. 2
Download: JPG
larger image


Fig. 2 Error orders with respect to h

Meanwhile, it is interesting to note that the Euler-Maruyama method applied to the SDE (19) is of root mean-square convergence order 0.5, while the midpoint scheme applied to the SDE (18) is of root mean-square convergence order 1 ([7, 8]). Three facts imply that the estimate errors based on the numerical solutions may mainly result from the error of the numerical methods themselves. More accurate estimators might be obtained by applying higher-order numerical methods.
3.4 Error comparison with the EM-MLE estimatorFor the linear SDE (2), we apply our estimator based on the exact solution, which we call the exact solution estimator, and the EM-MLE estimator [6] to estimate the parameters (a, b), which are given as a=3, b=1. Moreover, set X0=1, t0=0, and T=1. We perform four tests with four different values of h, namely, h=0.02, 0.04, 0.05, and 0.1, and observe m=100 sample paths for each. Denote the estimate arising from our method by ($\hat a, \hat b$), and its error by $e: = \sqrt {{{\left( {a-\hat a} \right)}^2} + {{\left( {b-\hat b} \right)}^2}} $, and those from the EM-MLE estimator by (${{\hat a}_M}, {{\hat b}_M}$) and ${e_M}: = \sqrt {{{\left( {a-{{\hat a}_M}} \right)}^2} + {{\left( {b-{{\hat b}_M}} \right)}^2}} $, respectively. In Table 4 listel are the numerical results.
Table 4
Table 4 Exact solution and EM-MLE estimators
h ${\hat a}$ ${\hat b}$ ${{{\hat a}_M}}$ ${{{\hat b}_M}}$ e ${{{\hat e}_M}}$
0.02 3.048 1.101 3.329 1.122 0.049 0.351
0.04 2.925 1.006 2.505 1.049 0.074 0.497
0.05 2.901 0.985 2.492 1.238 0.099 0.560
0.1 3.109 6 1.017 4.578 1.044 0.110 1.579

Table 4 Exact solution and EM-MLE estimators

Table 4 shows that our exact solution estimator is more accurate than the EM-MLE estimator.
To compare our numerical solution estimators with the EM-MLE estimator, we consider the linear SDE of Stratonovich sense (18), to which we apply the midpoint scheme estimator, and the equivalent It${\hat o}$ SDE (19), for which we use the Euler-Maruyama estimator and the EM-MLE estimator. Set a=3, b=1, X0=1, t0=0, and T=1. Again we perform four tests with h=0.02, 0.04, 0.05, and 0.1, respectively, and m=100 for each.
In Table 5, (${{\hat a}_e}, {{\hat b}_e}$), (${{\hat a}_m}, {{\hat b}_m}$), and (${{\hat a}_M}, {{\hat b}_M}$) denote the estimates arising from the Euler-Maruyama, midpoint, and the EM-MLE estimators, respectively, and ee, em, and eM represent their errors, respectively. It is obvious that our numerical solution estimators are more accurate than the EM-MLE estimator as well.
Table 5
Table 5 Numerical solution and EM-MLE estimators
h ${{\hat a}_e}$ ${{\hat b}_e}$ ${{\hat a}_m}$ ${{\hat b}_m}$ ${{\hat a}_M}$ ${{\hat b}_M}$ ee em eM
0.02 3.007 1.134 3 2.972 7 0.981 7 2.990 8 0.889 5 0.134 4 0.032 9 0.110 9
0.04 3.113 1.075 1 3.05 0.990 8 3.414 1.006 9 0.135 7 0.050 9 0.414
0.05 3.103 9 1.214 7 2.893 9 0.973 8 2.250 9 0.984 2 0.238 5 0.109 3 0.749 3
0.1 2.955 6 1.5093 2.829 9 0.968 4.414 3 1.094 9 0.511 2 0.173 1 1.417 5

Table 5 Numerical solution and EM-MLE estimators

3.5 Numerical tests on non-linear SDEsTo test the effect of our estimators on non-linear SDEs, we consider the following SDE
$\begin{array}{*{20}{c}}{X\left( t \right) = a{X^2}\left( t \right){\rm{d}}t + b{X^2}\left( t \right) \circ {\rm{d}}B\left( t \right), }\\{X\left( {{t_0}} \right) = {X_0}, }\end{array}$ (21)
and its equivalent It${\hat o}$ form
$X\left( t \right) = \left( {a{X^2}\left( t \right) + {b^2}{X^3}\left( t \right)} \right){\rm{d}}t + b{X^2}\left( t \right){\rm{d}}B\left( t \right), $ (22)
where a and b are unknown parameters to be estimated.
We apply the midpoint estimator to (21), and the Euler-Maruyama estimator to (22). Set X0=0.5, t0=0, T=1, and h=0.02. We use the midpoint scheme with tiny step size h=0.001 to (21) to simulate the true solution of the system which serves as observation data for our tests, and we conduct m=500 samples. The estimation results with different sets of a and b are listed in Table 6, where ${{\hat a}_e} $ and ${{\hat b}_e}$ are from the Euler-Maruyama estimator and ${{\hat a}_m}$ and $ {{\hat b}_m}$ are from the midpoint estimator.
Table 6
Table 6 Estimates with different a and b values
a b ${{\hat a}_e}$ ${{\hat b}_e}$ ${{\hat a}_m}$ ${{\hat b}_m}$
1 0.5 1.103 0.511 8 0.992 4 0.510 2
2 2 1.917 3 1.965 3 2.045 1 1.976 7
3 1 2.858 1.072 8 3.146 3 0.992 2
4 2 3.881 4 2.103 8 4.104 5 1.978 1
5 3 4.627 7 3.260 2 5.179 4 2.893 6

Table 6 Estimates with different a and b values

Again, we plot the five groups of points (a, b), (${{\hat a}_e}, {{\hat b}_e}$), and (${{\hat a}_m}, {{\hat b}_m}$) in Fig. 3. It can be seen that the midpoint estimator is more accurate than the Euler-Maruyama estimator.
Fig. 3
Download: JPG
larger image


Fig. 3 Exact values and estimates of the parameter (a, b) in the nonlinear SDE (21) or (22)

Then, we fixa=1 and b=2, but take h=0.02, 0.04, 0.05, and 0.1, to observe the order of the error $e: = \sqrt {{{\left( {a-\hat a} \right)}^2} + {{\left( {b-\hat b} \right)}^2}} $ of the estimators with respect to h. Once more we set X0=0.5, t0=0, T=1, and m=500. The numerical results are given in Table 7, where the items with lower-index e are from the Euler-Maruyama estimator, while those with m are from the midpoint estimator.
Table 7
Table 7 Estimates and errors
h ${{\hat a}_e}$ ${{\hat b}_e}$ ${{\hat a}_m}$ ${{\hat b}_m}$ ee em
0.02 1.048 9 2.060 4 0.998 2 1.986 5 0.077 7 0.013 6
0.04 1.076 2 2.098 5 0.981 4 1.960 2 0.124 5 0.043 9
0.05 1.105 3 2.185 6 0.974 2 1.897 8 0.213 4 0.105 4
0.1 1.174 2.260 2 1.158 3 2.144 7 0.313 0.214 5

Table 7 Estimates and errors

Figure 4(a) and Fig. 4(b) illustrate the ln|error| ~ ln h relationships arising from the Euler-Maruyama and the midpoint estimator, respectively.
Fig. 4
Download: JPG
larger image


Fig. 4 Error orders with respect to h of non-linear SDE estimators

It is indicated that for the non-linear SDE (22) or its equivalent equation (21), the order of the error ee produced by the Euler-Maruyama estimator is about O(h0.5) while that by the midpoint estimator is about O(h). This is the same as the indication for the numerical results for the linear SDE in the previous subsections.
3.6 A remarkIt is observed from the above-performed numerical tests that, with the growth of the order of the numerical method, the estimation accuracy is improved. It is then naturally expected that higher order methods give better estimations. However, in high-order stochastic numerical schemes such as the weak order 2 It${\hat o}$ -Taylor scheme (see e.g., Ref.[7]), high order multiple stochastic integrals such as $\int\limits_{{t_n}}^{{t_{n + 1}}} {\int_{{t_n}}^s {{\rm{d}}B\left( u \right){\rm{d}}s} } $ will appear, together with B(tn+1)-B(tn), where B(t) denotes the standard Brownian motion. It therefore becomes complicated to derive an appropriate calculation with certain distribution from the numerical scheme. Nevertheless, this could be an interesting topic deserving further investigations.
4 Estimating multiple parametersUp to now we have considered estimation of 2 unknown parameters appearing in the drift and diffusion coefficients of a SDE. Actually, our methods can be extended to estimate multiple unknown parameters. Consider the following SDE
$\begin{array}{*{20}{c}}{{\rm{d}}X\left( t \right) = f\left( {X\left( {\omega, t} \right), {\theta _1}, \cdots, {\theta _p}} \right){\rm{d}}t + }\\{\sigma \left( {X\left( {\omega, t} \right), {\gamma _1}, \cdots, {\gamma _q}} \right){\rm{d}}B\left( t \right), }\\{X\left( {{t_0}} \right) = {X_0}, }\end{array}$ (23)
where θ1, …, θp, γ1, …, γq are (p+q) unknown parameters to be estimated. Applying the Euler-Maruyame scheme to the SDE (23), we have
${X_{i + 1}} - {X_i} \sim N\left( {hf\left( {{X_i}, \Theta } \right), h{\sigma ^2}\left( {{X_i}, \Gamma } \right)} \right), $ (24)
where Θ:=(θ1, …, θp), Γ:=(γ1, …, γq).
Given m observations at time $ {t_1} = {t_0} + h, X\left( {\omega _1^0, {t_1}} \right), X\left( {\omega _2^0, {t_1}} \right), \cdots, X\left( {\omega _m^0, {t_1}} \right)$, we let
$\begin{array}{*{20}{c}}{\frac{1}{m}\sum\limits_{j = 1}^m {X\left( {\omega _j^0, {t_1}} \right)} = hf\left( {{X_0}, \Theta } \right) + {X_0}, }\\{\frac{1}{{m - 1}}\sum\limits_{j = 1}^m {{{\left( {X\left( {\omega _j^0, {t_1}} \right) - {X_0} - hf\left( {{X_0}, \Theta } \right)} \right)}^2}} }\\{ = h{\sigma ^2}\left( {{X_0}, \Gamma } \right), }\\{\frac{1}{m}\sum\limits_{j = 1}^m {X{{\left( {\omega _j^0, {t_1}} \right)}^3}} = E\left( {X_1^3} \right) = \int\limits_{ - \infty }^{ + \infty } {{x^3}{\rho _1}\left( x \right){\rm{d}}x}, }\\ \vdots \\{\frac{1}{m}\sum\limits_{j = 1}^m {X{{\left( {\omega _j^0, {t_1}} \right)}^{p + q}}} = E\left( {X_1^{p + q}} \right) = \int\limits_{ - \infty }^{ + \infty } {{x^{p + q}}{\rho _1}\left( x \right){\rm{d}}x}, }\end{array}$ (25)
where ${\rho _1}\left( x \right) = \frac{1}{{\sqrt {2{\rm{\pi }}} {d_1}}}\exp \left( {-\frac{{{{\left( {x-{c_1}} \right)}^2}}}{{2d_1^2}}} \right) $, with c1:=hf(X0, Θ)+X0, d12:=h σ2(X0, Γ). The E(X1k) (k=3, …, p+q) can be calculated in terms of the characteristic function of X1, that is,
$\begin{array}{l}g\left( t \right) = E{{\rm{e}}^{{\rm{i}}t{X_1}}} = \frac{1}{{\sqrt {2{\rm{\pi }}} {d_1}}}\int\limits_{ - \infty }^{ + \infty } {{{\rm{e}}^{{\rm{i}}tx}}\exp \left( { - \frac{{{{\left( {x - {c_1}} \right)}^2}}}{{2d_1^2}}} \right){\rm{d}}x} \\\;\;\;\;\;\;\; = {{\rm{e}}^{{\rm{i}}{c_1}t - \frac{{d_1^2{t^2}}}{2}}}, \end{array}$ (26)
with the property that the k-th derivative of g at t=0 is g(k)(0)=ikE(X1k).
There are (p+q) unknowns and (p+q) equations in the equations system (25), which is usually non-linear and might need iteration methods for solving the unknowns $\hat \theta _1^1, \cdots, \hat \theta _p^1, \hat \gamma _1^1, \cdots, \hat \gamma _q^1$.
Given observations at time t2=t0+2h, …, tN=t0+Nh, we write similar equations systems regarding the p+q moments of Xi with probability density function ${\rho _i}\left( x \right) = \frac{1}{{\sqrt {2{\rm{\pi }}} {d_i}}}\exp \left( {-\frac{{{{\left( {x-{c_i}} \right)}^2}}}{{2d_i^2}}} \right) $, where ci=hf(Xi-1, Θ)+Xi-1, di2=2(Xi-1, Γ), for i=2, …, N, and get the corresponding estimations $\hat \theta _1^2, \cdots, \hat \theta _p^2, \hat \gamma _1^2, \cdots, \hat \gamma _q^2, \cdots, \hat \theta _1^N, \cdots, \hat \theta _p^N, \hat \gamma _1^N, \cdots, \hat \gamma _q^N $. Then we set
${{\hat \theta }_i} = \frac{1}{N}\sum\limits_{l = 1}^N {\hat \theta _i^l}, {{\hat \gamma }_i} = \frac{1}{N}\sum\limits_{l = 1}^N {\hat \gamma _i^l}, $ (27)
for i=1, …, p, j=1, …, q.
Note that the observation data are the same tree-like data as described in section 3.1, wherefore the Xi is known when we calculate the moments of Xi+1, for i=0, …, N-1.
References
[1] Morel J M, Takens F, Teissier B. Parameter estimation in stochastic differential equations[M].Berlin Heidelberg: Springer-Verlag, 2008: 6-10.
[2] Breton A L. On continuous and discrete sampling for parameter estimation in diffusion type processes[M].Berlin Heidelberg: Springer, 1976: 124-144.
[3] Dorogovcev A J. The asymptotic properties of least square estimators for regression coefficients[J].Problemy Peredai Informacii, 1973, 4:49–57.
[4] Prakasa R. On Bayes estimation for diffusion fields[M].North Holland: Statistical Publishing Society, 1984: 575-590.
[5] Pedersen A R. A new approach to maximum likelihood estimation for stochatic differential equations based on discrete observations[J].Scandinavian Journal of Statistics, 1995, 22(1):55–71.
[6] Papaspiliopoulos O, Beskos A. An introduction to modelling and likelihood inference with stochastic differential equations. Great Britain: Warwick, 2011.
[7] Kloeden P E, Platen E. Numerical solution of stochastic differential equations[M].Berlin Heidelberg: Springer-Verlag, 1992: 305.
[8] Milstein G N, Tretyakov M V. Stochastic numerics for mathematical physics[M].Berlin Heidelberg: Springer, 2004: 9-10.


相关话题/图片 观测 数据 实验 科学学院

  • 领限时大额优惠券,享本站正版考研考试资料!
    大额优惠券
    优惠券领取后72小时内有效,10万种最新考研考试考证类电子打印资料任你选。涵盖全国500余所院校考研专业课、200多种职业资格考试、1100多种经典教材,产品类型包含电子书、题库、全套资料以及视频,无论您是考研复习、考证刷题,还是考前冲刺等,不同类型的产品可满足您学习上的不同需求。 ...
    本站小编 Free壹佰分学习网 2022-09-19
  • 咸淡水界面空间分布特征演变的野外观测
    周晓妮,石建省,马荣,苗青壮,刘少玉,刘鹏飞,张薇,王哲中国地质科学院水文地质环境地质研究所,石家庄0500612016年11月07日收稿;2017年02月16日收修改稿基金项目:中国地质科学院水文地质环境地质研究所所控基金项目(SK201602)和中国地质调查局地质调查项目(DD20160241) ...
    本站小编 Free考研考试 2021-12-25
  • 砂岩吸水过程与吸水特性的核磁共振实验研究
    张倩1,2,董艳辉1,2,童少青1,21.中国科学院地质与地球物理研究所中国科学院页岩气与地质工程重点实验室,北京100029;2.中国科学院大学地球科学学院,北京1000492016年04月12日收稿;2016年09月23日收修改稿基金项目:北京市自然科学基金(8162042)和中国科学院青年创新 ...
    本站小编 Free考研考试 2021-12-25
  • 全球变化领域研究现状与趋势的大数据分析
    毛萍,黄东晓,王芋华,周华,王海燕,陈槐中国科学院成都生物研究所,成都6100412016年05月30日收稿;2016年08月29日收修改稿基金项目:中国科学院“****”项目、四川省“****”项目和四川省青年科技创新团队项目(2015TD0026)资助通信作者:陈槐,E-mail:chenhua ...
    本站小编 Free考研考试 2021-12-25
  • 基于卫星重力、卫星测高和温盐度综合数据的中国近海各区域海平面变化
    常乐1,钱安2,易爽1,徐长仪3,孙文科11.中国科学院大学,北京100049;2.防灾科技学院,北京101601;3.中国地震局地震预测研究所,北京1000362016年08月04日收稿;2016年11月17日收修改稿基金项目:国家自然科学基金(41331066,41474059),中国科学院科学 ...
    本站小编 Free考研考试 2021-12-25
  • 基于Hadoop的邮政寄递大数据分析系统设计与实现
    王卫锋,杨林中国科学院大学计算机与控制学院信息动态学与工程应用实验室,北京1000492016年09月14日收稿;2016年11月18日收修改稿通信作者:王卫锋,E-mail:ry_009@126.com摘要:面对海量邮政寄递数据,现有的构建于关系数据库上的数据仓库系统在做数据分析时具有建设成本高、 ...
    本站小编 Free考研考试 2021-12-25
  • NaCl溶液静态闪蒸瞬态传热特性的实验研究
    王朝阳,张丹,杨庆忠,王宇,严俊杰西安交通大学动力工程多相流国家重点实验室,西安7100492016年04月22日收稿;2016年07月14日收修改稿基金项目:国家自然科学基金(51306148,51436006)资助通信作者:严俊杰,E-mail:yanjj@xjtu.edu.cn摘要:对NaCl ...
    本站小编 Free考研考试 2021-12-25
  • 空气-水段塞流冷却传热与相界面分布实验研究
    王鑫1,董传帅2,张晓凌1,岳晓庆1,何利民11.中国石油大学(华东)储运与建筑工程学院储运系,山东青岛266580;2.香港理工大学,香港9990772016年04月28日收稿;2016年07月22日收修改稿基金项目:国家自然科学基金(51376197)和山东省自然基金(ZR2011EEM029) ...
    本站小编 Free考研考试 2021-12-25
  • 气液两相间歇流管道蜡沉积实验研究
    高歌,吴海浩,全青,宫敬,王玮中国石油大学(北京)油气管道输送安全国家工程实验室,北京1022492016年05月25日收稿;2016年11月22日收修改稿基金项目:国家自然科学基金重点基金(51534007)和国家科技十三五重大专项(2016ZX05028-004-001)资助通信作者:宫敬,E- ...
    本站小编 Free考研考试 2021-12-25
  • 微小颗粒在多孔介质中运移的实验研究
    雷海燕1,崔明杰1,戴传山1,李琪21.天津大学中低温热能高效利用教育部重点实验室,天津300072;2.东北电力大学,吉林吉林1320122016年04月18日收稿;2016年09月28日收修改稿基金项目:国家自然科学基金(51306130和41574176)资助通信作者:雷海燕,E-mail:l ...
    本站小编 Free考研考试 2021-12-25
  • SAR原始数据压缩的自适应比特分配BAQ算法
    潘志刚,王小龙,李志勇中国科学院电子学研究所,北京1001902016年03月16日收稿;2016年05月16日收修改稿基金项目:国家自然科学基金(61101201)资助通信作者:潘志刚,E-mail:zgpan@mail.ie.ac.cn摘要:针对SAR原始数据压缩,在传统BAQ算法基础上,提出一 ...
    本站小编 Free考研考试 2021-12-25