Reconstruction of dynamical forecasting model between

1
Reconstruction of dynamical forecasting model between
2
Western Pacific subtropical high area index and its
3
summer monsoon impact factors based on the improved
4
self-memorization principle
5
6
Mei Hong 1 ,Ren Zhang 1 , Xi Chen1, Shanshan Ge 1 , Chengzu Bai1, Vijay P. Singh 2, 3
7
8
1. Research Center of Ocean Environment Numerical Simulation, Institute of Meteorology
9
and oceanography, PLA University of Science and Technology, Nanjing 211101,China
10
2.Department of Biological and Agricultural Engineering, Texas A & M University, College
11
Station, TX77843, USA
12
3.Zachry Department of Civil Engineering, Texas A & M University, College Station, TX
13
77843, USA
14
15
Corresponding author address: Mei Hong
16
Institute of Meteorology,
17
PLA University of Science and Technology,
18
P.O. Box 003, No. 60,Shuanglong Road,
19
Nanjing 211101, China.
20
Telephone and Fax number: 0086-025-80831626
21
E-mail: flowerrainhm@126.com
22
23
24
25
1
26
Abstract. W ith the objective of tackling the problem of inaccurate long -term Western Pacific
27
Subtropical High(WPSH) forecasts,based on the concept of dynamical model reconstruction and
28
improved self-memorization principle, a new dynamical forecasting model of WPSH area (SI) index is
29
developed. To overcome the problem of single in itial pred iction value, the largest Lyap unov exponent
30
is introduced to imp rove the traditional self-memorization function, making it mo re appropriate to
31
describe the chaotic systems, such as WPSH; and the equation reconstruction by actual data is used as
32
its dynamical core to overco me the proble m of relatively simp le dynamical core. The developed
33
dynamical forecasting model of SI index is used to predict WPSH strength in long -term. Through 10
34
experiments of the WPSH abnormal years, forecast results within 25 days are found to be good, with a
35
correlation coefficient of about 0.80 and root mean square error under 8%, showing that the improved
36
model has satisfactory long-term forecasting results. Especially the aberrance of the subtropical high
37
can be draw and forecast. It is acknowledged that mechanism for the occurrence and development of
38
WPSH is co mplex, so the discussion in this paper is therefore exp loratory.
39
40
Key words: The subtropical high; Improved self-memorization function; Dynamical forecasting model
41
reconstruction; Long-term forecast; Largest Lyapunov exponent
42
43
44
45
46
47
2
48
1. Introduction
49
The Western Pacific Subtropical High (WPSH) is one of the most important components
50
of the East Asian Summer Monsoon (EASM) system [1]. The intensity and position of WPSH
51
show complex seasonal evolutions and the changes in them greatly affect the occurrence of
52
rainy season in China, including floods, droughts, heavy rains, etc[1]. For example, when
53
WPSH reaches the northernmost position, especially in summer, it significantly influences
54
rainfall over China and Japan[2].
55
Owing to its dominance on the East Asian climate, WPSH has become one of the leading
56
topics of interest in atmospheric sciences[3-5]. Over the past decades, much effort has gone
57
into to uncovering the forecast of the WPSH[6], especially the forecast of abnormal
58
WPSH[7-8]. Current forecasts for the WPSH can be divided into two categories: numerical
59
forecasts and statistical forecasts. Numerical forecasts are widely used throughout the world;
60
examples include the numerical forecast products of the European Centre for Medium-Range
61
Weather Forecasts Model [9], the Japanese FUFE502 numerical forecast products, etc.
62
However, numerical forecasts require field boundaries and the complex computations and low
63
efficiency mean that the results are very unstable. Numerical forecast products have better
64
results for large-scale weather systems, but for mesoscale weather systems, such as the WPSH,
65
the results are less good and the forecast time is short.
66
Statistical methods, in contrast, have achieved some success in forecasting the WPSH.
67
These methods can use historical data and the computation is simple. However, there are
68
some inherent flaws in statistical methods. Using neural networks as an example, it is difficult
69
to objectively determine the number of hidden layer neurons and the training process tends to
3
70
predict a local optimum, which will limit the forecast accuracy [10]. Moreover, the reliability
71
of all these methods is gradually reduced with increasing forecast time, so the forecast results
72
and credibility become very low after two weeks [11]. Statistical forecasting products and
73
numerical forecasting products both have some degrees of bias. Especially, error is more
74
obvious in WPSH anomalies and long-term forecasting [3]. So the prediction of unusual
75
activities of WPSH within season and the long-term trend forecast of WPSH has become
76
difficult problems in recently years.
77
A statistical-dynamical model of a weather system is reconstructed from actual data and
78
can be used to describe the physical mechanisms of a complex weather system. Concerning
79
the questions of local convergence of errors, Zhang et al. [12] introduced genetic algorithms
80
(GA), which is widely used in a lot of field [13-14], to improve the determination of root
81
efficiency of model parameters. On that basis, Mei et al. [15-16] carried out research on the
82
reconstruction of a nonlinear statistical-dynamical forecast model of the WPSH, and achieved
83
good results.
84
However, the dynamical prediction equations derived by Zhang et al.[12] and Mei et
85
al.[15-16] greatly depend on the initial value, so the long-term forecast over 15 days diverged
86
significantly and the results were not satisfactory. For the long-term forecast, the model
87
should be improved. Cao [17] proposed the self-memorization principle, transforming the
88
dynamical equation into memorization equation in a
89
differential-integral equation, wherein the memory coefficients can also be determined by
90
actual data. This method has been widely used in prediction problems in meteorology,
91
hydrology and environmental field [18-20]. Because this method avoids the problem of initial
broader sense, named a
4
92
condition in differential equations, it can be introduced to improve the proposed dynamical
93
forecast model.
94
The set of self-memory function is relatively simple [17] and is suitable for cyclical
95
and linear systems. For nonlinear systems, especially chaotic systems, forecast results are
96
unsatisfactory [20].As the atmosphere and ocean are nonlinear systems, the self-memory
97
function is needed to be modified for nonlinear-system, modeling. The largest Lyapunov
98
exponent is introduced to improve the traditional self-memorization function. Finally, the
99
improved dynamical forecasting model of WPSH with a new self-memorization function is
100
developed. The improved function not only takes into account the chaotic characteristics of
101
the nonlinear system, but also absorbs the information of past observations.
102
In our study, we firstly define the WPSH area index (SI) as a measurement of the
103
scope and form of the WPSH. The rest of this paper is organized as follows: Section 2
104
introduces data and factors. Three summer monsoon factors which have higher correlation
105
with SI are chosen. The dynamical model of SI and three factors is reconstructed in section 3.
106
In section 4, self-memorization dynamics is introduced to improve the reconstructed model.
107
The improved self-memorization functions for chaotic systems are investigated and discussed
108
in section 5. Forecast experiment of 2010 and further forecast tests of an additional nine
109
WPSH abnormal years are described in section 6, and results are summarized in section 7.
110
2. Research data and factors
111
2.1.Data
112
The daily data from May to October of the past 30 years from 1982 to 2011 were
113
obtained from the National Centers for Environmental Prediction (NECP) Climate Forecast
5
114
System Reanalysis (CFSR), including: horizontal wind field and geopotential height field at
115
850 hPa and 200 hPa; geopotential height field at 500 hPa; sea level pressure field with a
116
resolution of 0.5° × 0.5°; and sensible heat flux, latent heat flux, and precipitation rate in the
117
Gaussian grid.
118
Observed long-wave radiation (OLR) from May to October of the 30 years from 1982 to
119
2011 of National Oceanic and Atmospheric Administration (NOAA) satellites, with a
120
resolution of 0.5° × 0.5°. Unit is W m−2 .
121
These data are primarily used to calculate the SI and summer monsoon impact factors in
122
section 2.3.
123
2.2 The basic facts of WPSH activity in the summer of 2010
124
Change of WPSH within seasons in various years is very different compared with the
125
average, especially the “abnormal” activities of the WPSH, which often lead to East Asia
126
subtropical circulation anomalies and extreme weather events in China in some years, such as
127
1998, 2003, 2006 and 2010 [21-22].
128
In order to better describe the changes in the WPSH, SI is used, which has been defined
129
by the Central Meteorological Observatory [23] to characterize the WPSH range and intensity,
130
i.e. the average grid points of 500 hPa geopotential height greater than 588 gpm in the range
131
[10 ° -90° N, 110 ° E -180 ° E]. The higher the value, the wider is the range represented, or
132
the greater is the intensity.
133
SI of summer half-year (from May to October) from 1982 to 2011 can be calculated, and
134
changes of SI in different years are shown in Fig. 1. In Fig. 1, the straight line represents the
135
average SI in the observed years. SI varied widely in different years, representing irregular
6
136
activities of the WPSH; particularly, in some years, SI showed dramatic changes. For example,
137
in 2010 SI is 32, while in 1984 SI is just 6, which indicates that WPSH activities behaved
138
abnormally in these particular years. If we use the average data of several years, these
139
exceptions will not be apparent. So we should choose an abnormal SI year to study.
140
From Fig. 1, 2010 is the most prominent year of WPSH anomalies as shown by the
141
arrow. From May to October in 2010, SI is above the mean and it is the peak of nearly 30
142
years. The anomaly of WPSH strength resulted in a very unusual climate in China in 2010.
143
Extreme heat and heavy precipitation events occurred frequently. The strength and wide range
144
of these events are rarely seen in history. Especially, the most powerful WPSH happened
145
from June to August and is to date the strongest one in the meteorological records (Fig. 2).
146
This most powerful WPSH directly resulted in rare flood disasters in the eastern part of south
147
China, the Yangtze River and northeast and northwest of China. Therefore, we selected the
148
summer of 2010 as a typical case to analyze the relevance between the enhanced WPSH and
149
the members of the monsoon system.
150
2.3. Selection of three factors
151
According to previous studies [21-22], there are many members of the summer monsoon
152
system, 21 of which are closely related to the WPSH. If all are used for modeling, the
153
equation would be too complex. Therefore, the correlation analysis method is used to analyze
154
the correlation between these factors and SI. (Specific definition of each factor can be seen in
155
Xue et al. [24] and Yu et al. [22]. Values of these factors can be calculated from CFSR data as
156
in section 2 and are not described in detail here.)
157
Based on the above correlation analysis, three factors with the best correlation with the
7
158
SI are filtered out for further study, which are:
159
(1) Mascarene cold high strength index (MH): the average grid points of sea level pressure
160
within [40–-60° E, 20–10° S] region;
161
(2) Tibetan high (eastern type) activity index (TH): the average grid points of 200 hPa
162
geopotential height within [95–105° E, 25–30° N] range;
163
(3) Monsoon circulation index at Bay of Bengal (J1V): the average grid point J1V =
164
V850-V200 within [80–100° E, 0–20° N] region.
165
166
The correlation coefficients between the above three factors and SI are 0.77, -0.80 and
0.86, all of which can reach above 0.75.
167
The Mascarene high in the southern hemisphere enhances the WPSH early in the year,
168
which is a positive correlation and a very close relationship, consistent with previous research
169
[24]. The close relationships among TH, J1V and SI are basically consistent with previous
170
research [22,25].
171
3. Reconstruction of the dynamical model based on GA
172
Takens [26] set forth and tested the basic idea of reconstructing the dynamical system
173
from the time series of observed data in his phase space reconstruction theory. Hence,
174
nonlinear dynamics research has entered a new stage, where the evolution information of the
175
dynamical system can be extracted from the time series, such as the calculation of fractal
176
dimension and Lyapunov exponent [27]. However, these studies help understand what a
177
chaotic system is and what the periodic system is. Dynamical models for practical problems
178
have not yet been developed fully. Huang and Yi [28] proposed a method of nonlinear
179
dynamic model reconstruction from the actual data and tested it with the Lorenz system, the
8
180
result of which was satisfactory. Therefore, we introduce this idea and improve it for
181
reconstructing a dynamical model of the SI and three factors.
182
The principle of dynamical model reconstruction has been introduced in the
183
previous studies [12,15-16] and is not described in detail here but can be found in Appendix
184
A.
185
The existing parameter estimation methods (such as neighborhood search method and
186
least-square estimation) are mostly one-way search which needs to travel the entire parameter
187
space, so the searching efficiency is low. Because of the limitation of the error gradient
188
convergence and its dependence on initial solution, parameter estimation is prone to fall into
189
local optimum, rather than global optimum. GA is a method extensively used as a global
190
optimization method which has developed in recent years. GA has been found to be excellent
191
in global search and parallel computing, and error convergence rate is greatly improved, so it
192
is helpful in parameter optimization.
193
We can use a simplified second-order nonlinear dynamic model to depict the basic
194
characteristics of the atmosphere and the ocean interactions [29]. In our paper,
195
x 1 ,x 2 ,x 3 ,x 4 is used to represent time coefficient series of SI, MH, TH and J1V, then GA
196
is introduced to reconstruct the dynamical model and for parameter optimization.
197
198
T
With S  ( D  GP) ( D  GP) minimum as a restriction, an optimization solution
searching is made in the model parameter space by GA.
199
Suppose that the following nonlinear second-order ordinary differentia l equations are
200
taken as the dynamical model of reconstruction and reconstruction, we choose the time
201
coefficient series of SI, MH, TH and J1V from May 1 to July 31 in
2010
as the expected
9
202
data to optimize and retrieve model parameters.
dx1
dt
dx2
203
dt
dx3
dt
dx4
dt
 a1 x1  a2 x2  a3 x3  a4 x4  a5 x1  a6 x2  a7 x3  a8 x4  a9 x1 x2  a10 x1 x3  a11 x1 x4  a12 x 2 x3  a13 x 2 x 4  a14 x3 x 4
2
2
2
2
 b1 x1  b2 x2  b3 x3  b4 x4  b5 x1  b6 x2  b7 x3  b8 x4  b9 x1 x2  b10 x1 x3  b11 x1 x4  b12 x2 x3  b13 x2 x4  b14 x3 x4
2
2
2
2
 c1 x1  c2 x2  c3 x3  c4 x4  c5 x1  c6 x2  c7 x3  c8 x4  c9 x1 x2  c10 x1 x3  c11 x1 x4  c12 x2 x3  c13 x2 x4  c14 x3 x4
2
2
2
2
 d1 x1  d 2 x2  d 3 x3  d 4 x4  d 5 x1  d 6 x2  d 7 x3  d 8 x 4  d 9 x1 x 2  d 10 x1 x3  d 11 x1 x 4  d 12 x 2 x3  d 13 x 2 x 4  d 14 x3 x 4
2
2
2
2
204
(1)
Suppose that the parameter matrix P  [a1 , a2 ,......a9 ; b1 , b2 ,......b9 ; c1 , c2 ,......c9 ] in
205
206
the
207
S  ( D  GP)T ( D  GP) is the objective function, the individual fitness value is li 
208
and the total fitness value is L 
above
equation
is
the
population, the
minimal residual sum of
squares
1
,
S
n
 l . The idiographic operating steps include: coding and
i 1
i
209
creating of initial population, calculation of fitness, male individual choice, crossover and
210
variation, and so on. For a complete theoretical explanation, one can refer to the literature
211
[30]. During calculation, the step length is one day. After about 35 times of genetic operations
212
and optimization search, it’s possible to converge to the target adaptive value rapidly and
213
retrieve each optimized parameter of the dynamical equations. After eliminating weak items
214
with little dimension coefficient, the nonlinear dynamic model of SI and three factors are
215
inverted as follows:
216
217
 dx1
 dt

 dx2
 dt

 dx3
 dt
 dx
 4
 dt
 F1  15.86 x  0.0332 x  838.0874 x  49.2840 x  3.259  10 x  1.5532  10 x x  0.0083 x x  4.9142  10 x x
7
1
2
3
4
4
2
2
4
1
2
2
3
2
4
 F2  25.4550 x1  2.245  10 x3  194.127 x4  2.4791  10 x1 x2  0.0222 x2 x3  0.0019 x2 x4
4
3
8
5
4
 F3  2.1993 x1  0.0032 x2  73.1122 x3  3.1485  10 x2  2.1743  10 x1 x2  7.2315  10 x2 x3
2
 F4  5.7855 x1  0.0225 x2  11.8816 x4  0.3044 x3  0.0054 x4  0.0092 x1 x3  0.0975 x3 x4
2
2
(2)
10
218
The dynamical reconstruction model should be tested. So we chose the data of August 1
219
in 2010 of SI, MH, TH and J1V which do not participate in the modeling as the forecast
220
initial data. Then the Longkuta method is used to carry out numerical integration of the above
221
equations and every step of integration can be regarded as a day forecasting result. The
222
forecast results of four time series within 25 days can be obtained. The correlation
223
coefficients between forecast results and actual results of SI, MH, TH and J1V within 25 days
224
are respectively, 0.5659,0.5746,0.6091,0.5023 and root mean square errors (RMSE) is
225
36.22%, 34.54% , 32.71% and 36.78%. This indicates that the results of long-term forecast
226
within 25 days are unsatisfactory for the dynamical reconstruction model. That is because the
227
integration results may diverge significantly with time, as well as our initial data for
228
integration is relatively simple. Therefore, we should improve the model to get better
229
long-term forecast results.
230
Following these analyses, correlation coefficients ( r ) were evaluated using the t-test
231
(   0.05 ,).
232
confidence level and were thus deemed to be statistically acceptable.
233
4. Introduction of self-memorization dynamics to improve the reconstructed model
All resulting coefficients were found to be statistically significant at the 95%
234
From the above discussion, we know that the accuracy of forecast results of the
235
dynamical reconstruction model within 25 days is unsatisfactory. Literature suggests that
236
introducing the principle of self- memorization into the mature model can improve long-term
237
forecasting results [18,20]. So the self- memorization principle is introduced to improve the
238
model.
239
Following the study of Cao [17] and Feng et al. [19], the mathematical principle of
11
240
241
242
243
244
245
246
247
248
self-memorization dynamics of systems is shown in Appendix B.
Using equations (2) as the dynamical kernel, the improved model based on the
self-memorization equation (B10) in Appendix B can be expressed as:
0
0

 x1t   a1i y1i   1i F1 ( x1i , x2i , x3i , x4i )
i  p
i  p

0
0

 x2t   a2i y2i    2i F2 ( x1i , x2i , x3i , x4i )
i  p
i  p


0
0
x 
a3i y3i   3i F3 ( x1i , x2i , x3i , x4i )

3t

i  p
i  p

0
0
x 
a
y

 4i F4 ( x1i , x2i , x3i , x4i )


4
t
4
i
4
i

i  p
i  p

(4)
Because we predict the value of SI,the first formula in equation (4) are used to get
results of our final prediction. That is:
x1t 
0
0
i  p
i  p
 a1i y1i   1i F1 ( x1i , x2i , x3i , x4i )
(5)
If we can get the value of a,  , Eq. (5) can be used for prediction. Values of a,  are
associated with the self-memorization function  , which is defined in the next section.
249
We make a prediction with the self- memorization equation (5), the model uses the p
250
values before t0 , therefore  (t ) in the self- memorization equation(5) in memorizing the
251
p  1 values of x . This explains the reason why we call  (t ) the self-memorization function.
252
This is the mathematical basis for introducing the self-memorization principle.
253
The physical basis for introducing the self-memorization principle is that the
254
thermodynamic equation is one of atmospheric motion equations, which implies that the
255
atmospheric motion is an irreversible process. An important contribution to the study of
256
irreversible process is to introduce the memory concept to physics. So the atmospheric
12
257
development in the future is not only related to the state at the initial time, but also to states in
258
the past, which means the atmosphere does not forget its past.
259
5. Improved self-memorization functions for chaotic systems
260
Cao [17] referred to the problem of self-memorization function assignment. Based on
261
experience and tests, he found the memory level gradually decreased with the distance of
262
initial prediction time, so the memory function can be expressed as:
263
 0

 (i)  e a (tN ti )
 1

ti  t N  P
t N  P  ti  t N
(6)
ti  t N
264
It shows that the self-memorization function (6) indicated by Cao only considers the
265
characteristics of memory level gradually decreasing with the distance of initial prediction
266
time, and ignores nonlinear characteristics of the self-memorization function. So we modify
267
the self-memorization function as:
268
0

  a (tN ti ) (1 r )
 (i )  e
ti

1

ti  t N  P
t N  P  ti  t N
(7)
ti  t N
269
Term e a (tN ti ) represents the characteristics of memory level gradually decreasing with the
270
distance of initial prediction time; while ti (1 r ) reflects the nonlinear characteristics of
271
self-memorization function. r and a are the parameters to be determined, and they are
272
important for the specification of the assigned self-memorization function  (i ) . In Cao’s
273
study, he found no good way to determine parameters r and a , only believing that they
274
had powerful connection with the past observational data.
275
The Lyapunov exponent(LE) has long been used to study atmospheric and oceanic
276
predictability [31]. In chaotic dynamical systems theory, the largest Lyapunov exponent can
13
277
be portrayed as a whole (long-term) average forecast error growth rate, which generally
278
describes the divergence of nonlinear chaotic systems. So it is often introduced in the forecast
279
study of chaotic systems such as atmospheric and oceanic systems [32-33]. The traditional
280
self-memorization function is useful in the linear periodic system forecast, but not appropriate
281
in the forecast of nonlinear chaotic systems. The largest Lyapunov exponent (LLE) is one of
282
the indexes which can represent the characteristics of chaotic systems. So here r can be
283
considered as LLE of the system, which can be calculated from the reconstructed dynamical
284
equation in section 3. And a can be obtained through GA from historical data.
285
5.1. Determination of Parameter r
286
Based on the above analysis, the solution of r requires the solution of the LLE in
287
dynamical equation (2).
288
5.1.1 Definition of LLE
289
The definition of the Lyapunov spectrum and the largest Lyapunov exponent (LLE) can
290
be referred to the literature [34] and are not described in detail here. The LLE determines the
291
notion of predictability for a dynamical system. A positive LLE is usually taken as an
292
indication that the system is chaotic (provided some other conditions are met, e.g., phase
293
space compactness).
294
5.1.2 The principle of Lyapunov exponent spectrum calculations
295
If the dynamic differential equations are known, through theoretical derivation or using
296
a numerical iterative algorithm to discretize differential equations, the exact Lyapunov
14
297
exponents of known dynamical systems can be obtained. First, we work out the solution of
298
the ordinary differential equations and get the Jacobi matrix. Then, we decompose the Jacobi
299
matrix by QR while necessary orthogonal reformings in a multiple of small time intervals are
300
performed. Finally, the Lyapunov spectrum of the system can be obtained by iterative
301
calculations. The specific calculation principle of the the Lyapunov exponents spectrum are
302
not shown here, which can be referred to Hubertus et al. [35].
303
5.1.3 Lyapunov spectrum of dynamical systems and determination of r
304
According to the above calculation method, the Lyapunov spectrum of the reconstruction
305
dynamical model of SI and three factors can be obtained as in Fig. 2 which shows that
306
convergence speed and volatility are not too large and they are relatively stable. Final
307
Lyapunov exponents are [0.3816,0.0016,-0.5715], containing both negative Lyapunov
308
exponent and two positive Lyapunov exponents, which demonstrate our dynamical system is
309
indeed a chaotic system.
310
311
Finally, r is taken as the LLE, and r =0.3816.
5.2. Determination of Parameters a
312
Not only the nonlinear characteristics of chaotic system should be considered, but also
313
the impact of the past actual data on self-memorization function should be took into account.
314
So parameter a should be optimized.
315
The equation (5) is written as equation (8):
15
x(t )  f (t , x,  )
316
317
318
319
The initial value is x(ti )  xi , i  N , N  1,..., N  p .
Our goal is to make the smallest error between the fitting values and the observed values
of p periods before, i.e., secondary quality index reaches a minimum.
J (u ) 
320
N
 ( xˆ(t )  x(t ))
i  p 1
321
322
325
i
i
2
 min
(9)
where xˆ is the estimated value of the prediction model. x is the actual value. When
further constraint is added, the self-memorization function should be taken as:
 (ti )  1
323
324
(8)
(10)
Parameter r can be obtained by solving for the optimal solution of equation (9) in the
constraint condition of equation (10). Setting J (u ) 
N
 ( xˆ (t )  x (t ))
i  p 1
i
i
2
as the objective
326
function of GA, putting  (ti )  1 as a constraint, the specific process of GA is discussed in
327
section 3.
328
We can see the value of parameter a is related with the p data before forecasting time t .
329
The forecasting value is preserved as the early data for the next prediction. With the change of
330
data, the value of parameter a is varied. The first calculation is a1  0.244 through 39
331
iterations of GA, then in next prediction step, parameter a is varied, which we do not list
332
one by one here. In fact, the continuous adjustment of parameter a is more accurate.
333
Self-memorization function  is determined, and then it is substituted into equation (5),
334
which can be used to do prediction.
335
6. Model Prediction experiments
16
336
6.1. Effective steps and maximum effective computation time
337
In order to further study the improved model, the algorithm to solve for effective steps
338
and maximum effective computation time should be given, based on the dynamic core
339
(equation(2)) of the improved model. Currently the optimal search method is used, which is
340
achieved based on the size of the differences among many numerical solutions. Its principles
341
can be seen in David [36].
342
In step interval [ hmin , hmax ], ( hmin is a very small number), n steps are selected,
343
hi  hmin  i * dtt  hmax (i  1, 2,..., n) , dtt is known as the time increment. The n steps are
344
integral to t at the same time, getting the n number of numerical solutions, denoted
345
by y t ( n ) . If their differences Vs (t ) are less than the given tolerance limit  , it means that
346
they are relatively close to the true solution at t time. At that time the effective time step
347
interval is [ hmin , hmax ] and the width is Wh (t )  lg hmax  lg hmin . Otherwise, if Vs (t )   ,
348
that means values between some step appear larger deviations, so removing these step from
349
y t ( n ) will make the difference of the remaining n1 (n1  n) solutions less than . The
350
optimum is to exclude the least number.
351
The effective step intervals which meet such conditions can be gotten. Given integral
352
step, we continue to do integral calculation to t  Nd t , here, N is an integer. Integral
353
operation continues until
354
interval [ h j (t1 ), h j 1 (t1 ) ] is divided into m parts for a another round of integral operation,
355
until the difference of the solution in m 1 step is smaller than  . In this case, the maximum
356
effective computation time of the dynamical kernel of the improved model (equation (2)) can
357
be obtained, as shown in Fig. 4.
there are only two adjacent steps h j (t1 ), h j 1 (t1 ) left. Then step
17
358
From Figure 4, when  is certain, for our system, the maximum effective computation
359
time increases with the increment of dtt , and finally remains at 28.7, which is the maximum
360
effective computation time(Figure 4a). When dtt remains constant, the maximum effective
361
computation time oscillates with  (Figure 4b).
362
6.2. Prediction experiment of 2010
363
The period from 2 August to 5 September in 2010 is selected to do predictions (35 days in
364
total). August contains abnormal activities of WPSH. Here, the forecasting result is obtained by
365
the sum of equation (5), which is called as step-by-step forecast. The specific procedure is shown
366
as follows:
367
Step-by-step forecasts are made after retrospective order p is fixed. That means when we
368
forecast the SI value of 2 August, we must get y i based on the previous p +1 time SI data
369
and F (x 1i ,x 2i ,x 3i,x 4i ) based on the previous p time SI, MH, TH and J1V data
370
(because x 2 , x 3 and x 4 are MH, TH and J1V). Then using them in equation (5) can obtain
371
the SI value of 2 August. Then, taking the SI value of 2 August as the initial value for the next
372
prediction step, the SI value of 3 August can be generated, and so on.
373
6.2.1 Determination of p
374
When the principle of self-memorization is introduced, the retrospective order p is
375
related with self-memorization of the system [17]. If the system “forgets” slowly, which
376
means parameters a and r are smaller, a high level of p should be used. WPSH abnormal
377
activities are on ten days scale[22,24], which is a slow process in contrast to the large-scale
378
atmospheric motion. So parameters a and r are smaller, and generally p is in the range
379
of 5 to 15 .
18
380
381
382
383
The correlation coefficients and root mean square errors(RMSE) between forecast value
and real value can be calculated with different retrospective order, as shown in Table 1.
From the table, we can see when p  8 , the correlation coefficient is the largest and
the root mean square error is the smallest. So we choose p  8 .
384
After p is determined, the improved self-memorization equation (5) can be used for
385
prediction experiments. Short, medium and long-term integration forecasts of 15 days, 25
386
days and 35days are carried out. Retrospective order p  8 , it means earlier observational
387
data ( p  1  9 ) are used for integration began. The integral result per day is preserved as
388
preliminary information, and will be used for integration in next period.
389
6.2.2 Prediction results of 2010
390
391
392
The forecast results within 35 days are shown in Fig. 5, which show that the forecast
result of SI is better.
As can be seen in Fig. 5, forecast performance of the first 15 days is better. The
393
correlation coefficient has reached 0.9542 and the root mean square errors(RMSE) is 2.45%.
394
Two peaks and one valley of SI are also forecast very accurate. The forecast time series from
395
15 days to 25 days gradually diverges, but the trend is accurate. The correlation coefficient
396
reaches 0.9254 and the RMSE is 6.37%. Forecast rate is still accurate. The peak of SI in 18
397
August is also forecasts accurately.
398
After nearly 25 days, the RMSE begins to increase: the forecast trend is still accurate
399
with correlation coefficient of 0.8136. Forecast curve from 25 days to 35 days is accurate
400
compared with the actual situation. But the peaks and valleys are not forecast accurately.
401
Especially after 28 days, the error has started to increase significantly. For example, the
19
402
forecast value of August 28 is nearly 100 more than the actual value, resulting a false peak.
403
The root mean square error increases to 19.18%, but it is still controlled within 20%. Fig.5
404
shows the forecast results after 28 days are obviously unsatisfactory with the occurrence of
405
greater oscillations. The effective forecast period (within 28 days) happens to be in accord
406
with the maximum integration time (28.7 units) calculated in section 6.1.
407
In brief, from Fig. 5, we can see that the short-term forecast within 28 days is accurate,
408
and the average RMSEs are less than 8%, which indicates that the reconstruction model can
409
perform accurate predictions of the changing trends of indices.
410
6.3 More forecasting experiments of WPSH abnormal years
411
To further test the forecasting performance of the improved model, cross testing of more
412
experiments is performed. From Fig. 2 discussed in section 2.2, we choose another four years
413
(1998, 2006, 1987 and 1983) in which WPSH intensity is abnormally strong (bigger SI) and
414
five years (1984, 2000, 1994, 1999 and 1985) in which WPSH intensity is abnormally weak
415
(smaller SI) to carry out integral forecast experiments of SI. In accordance with the previous
416
idea, the common 2010 model which is reconstructed in section 4 is used to carry out the
417
cross testing. Forecast results of different time periods (1–15 days as short term, 16–25 days
418
as medium term and 26–35 days as long term) are compared with the actual situation. Cross
419
test results are displayed in Table 2. From the table, we can see the forecast results of the short
420
term and the medium term are accurate, and those of the long term (>26 days) can also be
421
accepted. The results of MH, FLH and TH are similar to those of SI.
422
7. Summary and discussion
423
7.1 Summary
20
424
Although in recent years the WPSH has become one of the leading topics of
425
atmospheric sciences, it is still difficult to be forecast. Combining dynamical system
426
reconstruction idea with the improved self-memorization principle, a new dynamical
427
forecasting model of SI index is developed. The approach of this paper consists of the
428
following steps:
429
(1) We use correlation analysis method to discuss contributions and impact of the key
430
members of EASM to the abnormal WPSH in 2010 and identify the three most significant
431
factors: the Mascarene high, the Tibetan high (eastern type) and Monsoon circulation index at
432
Bay of Bengal.
433
(2) Take the SI, MH, XZ and J1V time series and consider them as trajectories of a set
434
of 4 coupled quadratic differential equations based on the dynamic system reconstruction idea.
435
Parameters of this dynamical model are estimated by a genetic algorithm (GA).
436
(3) Apply the "principle of self-memorization" in order to improve the forecasting
437
results of the above dynamical model. This involves a manipulation of the time series using
438
an exponentially decaying (in time) function  (r ) , which also depends on the largest
439
Lyapunov exponent of the above dynamical system. A second free parameter of  (a) is
440
determined by minimizing the distance of the modified reconstructed time series.
441
(4)The improved model is used to forecast the SI index. Through 2010 experiments and
442
other 9 experiments of WPSH abnormal years, we find forecast results within 25 days are
443
good, which proves that our improved model has better long-term forecasting results. Given
444
the complexity of the mechanism of the WPSH [3], the new dynamical forecasting model
445
has some scientific significance and practical value.
21
446
Based on the discussion in section 1, we know that the forecast results and credibility of
447
the previous methods are very low after two weeks [11]. So our improved
448
statistical-dynamical model represents an exploration of and supplement to the traditional
449
numerical forecasting and statistical forecasting methods, and also extends the prediction
450
time.
451
7.2 Discussion
452
Why the forecasting results of our improved model are especially good? (1) Previous
453
studies have showed that the long-term forecasting results of dynamical system reconstructed
454
model are unsatisfactory, thus, we introduce the self-memorization principle to improve the
455
long-term forecasting results. Previous studies have proved that this approach is feasible. For
456
example, Gu et al.[18] used the self-memorization principle to improve the traditional T42
457
model. And Wang et al. [25] also carried out dynamical prediction of building subsidence
458
deformation with self-memorization model. Long-term forecasting results of their models
459
were very good. Our study also shows that introducing the self-memorization principle to
460
improve a mature model can bring better long-term forecasting results. (2)The largest
461
Lyapunov exponent is firstly introduced to improve the traditional self-memorization
462
function, making it more appropriate to describe the chaotic systems, such as WPSH. (3) In
463
developing the improved model, the parameters are obtained from the historical data, which
464
contain sufficient information of WPSH abnormal process. Statistical data combining with
465
improved dynamical model makes forecasting results more reliable.
466
467
Although the forecast results of improved model are better, there are still some issues
which are needed for further research:
22
468
469
(1) The physical meaning of the factors in the model is not clear, so its dynamical
characteristics should be further analyzed.
470
(2) The forecast accuracy has a great relationship with self-memorization functions.
471
Maybe we can find a better self-memorization function to improve forecast accuracy for long
472
term in the future.
473
These two items are the focus of our next work.
474
Acknowledgments.This study is supported by the Chinese National Natural Science Fund for
475
young scholars (41005025/D0505), the Chinese National Natural Science Fund (41375002;
476
41075045) and the Chinese National Natural Science Fund (BK2011123) of Jiangsu Province.
477
478
Conflict of Interests: The authors declare that there is no conflict of interests regarding the
479
publication of this article.
480
481
482
483
484
485
486
487
488
489
APPENDIX A: Principle of dynamical model reconstruction
Suppose that the finite difference form of the physical law of any nonlinear system
evolving with time can be expressed as follows:
qi( j 1) t  qi( j 1) t
 f i (q1jt , q 2jt ,..., qijt ,..., q Njt )
2t
where fi is the generalized nonlinear function of
j  2,3,......M  1
q1, q2 ,...,qi ,...,qN
(A1)
, N is the number of state
variables and M is the length of time series of observed data. We assume that
fi (q1jt , q2jt ,..., qijt ,..., qNjt )
variable qi and
Pi k
contains two parts:
G jk
representing the expanding items containing
just representing the corresponding parameters which are real numbers
23
490
491
(
i  1, 2,...N , j  1,2,...M , k  1,2,..., K ).
It can be supposed that:
K
492
f i ( q1 , q2 ,..., qn ) 
G
jk Pik
(A2)
k 1
493
The matrix form of equation (A2) is D  GP , in which
494
 qi3t  qit 


 d1  
2t

 d   q 4 t  q 2 t 
i
i
  

D 2

2t
...
  

...
d M   qiMt  qi( M  2) t 


2t
 G11 , G12 ,......G1K 
 Pi 1 
 G , G ,......G 
P 
 i 
 21 22
2, K 
P 2
G

...
 ... 




 Pi K 

GM 1 , GM 2 ,.....GM , K 
,
,
(A3)
495
Coefficients of the above generalized unknown equation can be identified through
496
inverting the observed data. Given a vector D, vector P can be solved to satisfy the above
497
equation. It is a nonlinear system with respect to q , however it is a linear system with respect
498
to P (assume P is unknown). So the classical least square method can be introduced to
499
T
T
estimate the equation and the regular equation G GP  G D can be derived by making the
500
T
residual sum of squares S  ( D  GP) ( D  GP) minimum.
501
502
Based on the above approach, coefficients of the nonlinear dynamical systems can be
determined and the nonlinear dynamical equations of observed data can be established.
503
504
505
506
APPENDIX B: Mathematical Principle of self-memorization dynamics of systems
In general, dynamical equations of a system can be written as:
xi
 Fi ( x,  , t )
t
i  1, 2,..., J
(B1)
507
where J is an integer,xi the ith variable of the system state,  parameter. Eq. (B1)
508
expresses the relationship between a local change of x and a source function F . Obviously,
24
is scalar function at the space r0
509
x
510
time T  [t p ...t0 ...tq ]
511
space R  [ra ...ri ...r ] ,where ri is a spatial point considered. An inner product in space
512
L2 : T  R is defined by
where
t0
is
an
initial
t . Consider a set of
time,
b
( f , g )   f ( ) g ( )d , f , g  L2
513
514
,
and the time
a
and
a
set
of
(B2)
Accordingly, define a norm
b
f  [  ( f ( ) 2 d  ]
515
1
2
a
516
Making a completion for L2 , it becomes a Hilbert space H . A solution of the
517
multi-time model can be regarded as a generalized one in H . Dropping i in Eq. (B1) and
518
applying an operation of the inner product defined in Eq.(B2), by introducing a memorial
519
function  (r , t ) , we obtain

520
t
t0
 ( )
t
x
d    ( ) F ( x, )d
t0

(B3)
521
where r in  (r , t ) is dropped because of fixing on the spatial point r0 . Supposing
522
variable x and function  (r , t ) etc. are all continuous, differentiable and integrable,
523
following the calculus, making an integration by parts for the left of q. (B3) yields
524
525
526
527

t
t0
 ( )
t
x
d   (t ) x(t )   (t0 ) x(t0 )   x( )  ( )d
t
0

(B4)
where  (t )   (t ) / t . Applying the median theorem in calculus to the third term
in the right-hand side of Eq. (B4). i.e. is obtained.
t
 x( ) ( )d   xm (t0 )[ (t )   (t0 )]
t0
(B5)
25
528
529
530
where the median x m (t0 )  x(tm ), t0  tm  t . Substituting Eqs. (B4) and (B5) for (B3)
and performing algebraic operation, we get
x(t ) 
 (t0 )
 (t )   (t0 ) m
1 t
x(t0 ) 
x (t0 ) 
 ( ) F ( x, )d
 (t )
 (t )
 (t ) t
(B6)
0
531
As the first and second term in Eq. (B6) relate to the x value only on the fixed point
532
r0 itself at the initial time t0 and the middle time t m , therefore they are called a
533
self-memory term. Naturally, call the third term an exogenous effect, i.e., total effect
534
contributed by other spatial points to point r0 in an interval t0 , t  .
535
For multi-time ti , i   p,  p  1..., t0 , t , similar to Eq. (B4), we have
t p1
t p  2
t
t
x
x
x
d  
 ( ) d  ...    ( ) d    ( ) F ( x, )d
t p1
t0
t p



536

537
By eliminating the same term  (ti ) x(ti ), i   p  1,  p  2,..., 0 , it gives
538
 (t ) x(t )   (t p ) x(t p )   [  (ti 1 )   (ti )]x m (ti )    ( ) F ( x, ) d  0
539
540
t p
 ( )
543
t
i  p
t p
are used in the following context. Then, equation(B7) can be written as:
0
t
i  p
t p
 t xt    p x p   xim (  i 1   i )    ( ) F ( x, )d  0
(B8)
m
Setting x p  x p1 ,   p1  0 , we rewrite Eq. (B8) as:
xt 
1
t
0

i   p 1
xim (  i 1   i ) 
1
t
 t t
p
 ( ) F ( x, )d  S1  S 2
(B9)
544
We call S 1 a self-memory term, S 2 an exogenous effect term.
545
For the convenience of calculations, to discretize self-memorization equation, the
546
(B7)
For simplicity, setting  t   (t ),  0   (t 0 ), xt  x(t ), x0  x(t 0 ) , similar notations
541
542
0
integration in Eq. (B9) in Appendix B is replaced by summation and the differential by
26
547
difference, and the median xim is simply replaced by the mean of two values at adjoining
548
times, i.e., xim 
1
( xi 1  xi )  yi
2
By sampling an equal time interval t , i.e., ti  t0  it , i  1, 0, 1, 2,...,  p , taking an
549
550
equal time interval t i  t i 1  t i  1 , where t0 is initial time, t0  t is forecast time, p
551
is retrospective order, and incorporating  i and  t , a discretized self-memorization
552
equation is obtained:
xt 
553
1

i   p 1
 i yi 
0
  F ( x, i )
i p
554
where F is the dynamic core of the self-memorization equation;  i 
555
i 
556
557
(B10)
i
(  i 1   i )
t
;
i
.
t
We call a technique with which one make forecast and computation based on Eq.(B10) a
self-memorization principle.
558
559
560
561
References:
562
[1] Miyasaka, T., and H. Nakamura. Structure and formation mechanisms of the Northern
563
Hemisphere summertime subtropicalhighs. J. Climate, 18 (2005) 5046–5065.
564
[2] Kurihara, K.. A climatological study on the relationship between the Japanese summer
565
weather and the subtropical high in the western northern Pacific. Geophys. Mag.. 43
566
(1989) 45–104.
567
568
[3] Tao S. Y., Ni Y. Q. and Zhao S. X. et al., The study on formation mechanism and forecast
of the 1998 summer rainstorm in China, Meteorological Press, Beijing, 2001.
27
569
[4] Wang H. J. and Xue F., Interannual variability of somali jet and its influences on the
570
inter-hemispheric water vapor transport and on the east Asian summer rainfall. Chinese
571
Journal of Geophysics. 01 (2003) 230-239.
572
[5] Xu X. L., Xu H. M. and Si D.. Relationship between June sustained torrential rain in
573
South China and anomalous convection over the bay of Bengal. Journal of Nanjing
574
Institute of Meteorology. 30 (2007) 463-471.
575
[6] Cao J, Huang R H, Xie Y Q and Tao Y., The study of the physical mechanism of West
576
Pacific subtropical high evolution. SCIENCE IN CHINA (SERIES D). 08 (2002)
577
659-666.
578
[7] Zhang R., Wang J. G. and Yu Z. H.. Teleconnection of Rossby inertial gravity waves and
579
East - West Pacific subtropical. Applied Mathematics and Mechanics. 23 (2002)
580
707-714.
581
[8] Grinsted J C and Moore S. J.. Application of the cross wavelet transform and wavelet
582
coherence to geophysical time series. Nonlinear Processes in Geophysics. 11 ( 2004)
583
561-566.
584
[9] Wu M. L., Liang H. and Wang Y., et al.. Contrast tests of precipitation products between
585
T213 and Germany numerical prediction in 2008. Journal of Meteorology and
586
Environment. 25 (2009) 4-12.
587
[10] Lu, R. Y.,
H. Ding,
C.-S.
Ryu,
et
al.. Midlatitude
588
disturbances preceding intraseasonal oscillations of convection
589
western
590
doi:10.1029/2007GL031277.
North
Pacific
westward propagating
over
the subtropical
during summer. Geophys. Res. Lett. 34 (2007) L21702,
28
591
[11] Zou L. W., Zhou T. J., Wu B., et al.. The interannual variability of summertime western
592
Pacific subtropical high hindcasted by GAMIL CliPAS experiments. Chinese Journal of
593
Atmospheric Science. 33 (2009) 959~970.
594
[12] Zhang R and Hong M, et al.. Non-linear Dynamic Model Retrieval of Subtropical High
595
Based on Empirical Orthogonal Function and Genetic Algorithm. Applied Mathematics
596
and Mechanics. 27 (2006) 1645-1654.
597
[13] Mei H., Ren Z., J. X. Li et al., Inversion of the Western Pacific Subtropical High
598
Dynamic Model and Analysis of Dynamic Characteristics for its Abnormality. Nonlinear
599
Processes in Geophysics. 20 (2013) 131-142.
600
[14] Mei H., Ren Z., K. F. Liu. Retrieving Dymamic Forecast Model of the Western Pacific
601
Subtropical High in Abnormal Years Based on GA. Acta Phy. Sin. 62(7) (2013)
602
6430-6442.
603
[15] Bai, C. Z., Mei, H., et al.. Evolving an Information Diffusion Model Using a Genetic
604
Algorithm for Monthly River Discharge Time Series Interpolation and Forecasting. J.
605
Hydrometeo, in press, (2014) doi: http://dx.doi.org/10.1175/JHM-D-13-0184.1.
606
[16] Bai, C. Z., Zhang, R., et al.. A new information diffusion modeling technique based on
607
vibrating string equation and its application in natural disaster risk assessment. Int. J.
608
Gen Syst., in press, (2014)
609
610
611
612
doi: http://dx.doi.org/10.1080/03081079.2014.980242.
[17] Cao H X. Self-memorization Equation in Atmospheric Motion. Science in China (Series
B). 36 (1993) 845-855.
[18] Gu X Q. A spectral model based on atmospheric self_memorization principle. Chinese
Science Bulletin. 43 (1998) 1692-1702.
29
613
614
[19] Feng G L, Cao H X, Gao X Q, et al.. Prediction of precipitation during summer monsoon
with self-memorial model. Adv Atmos Sci. 18 (2001) 701–709.
615
[20] Chen X D, Xia J, Xu Q. Differential Hydrological Grey Model(DHGM) with
616
self-memory function and its application to flood forecasting. Sci China Tech Sci. 52
617
(2009) 1039–1049.
618
619
[21] Tao, S. Y., and J. Wei. The westward and northward advance of the subtropical high over
the west Pacific in summer. J. Appl.Meteor. Sci,17 (2006) 513−524.
620
[22] Yu D. D., Zhang R. and Hong M.. A charateristic correlation analysis between the Asia
621
summer monsoon memberships and west Pacific subtropical high. Journal of Tropical
622
Meteorology. 01 (2007) 58-67.
623
624
[23] The Central Meteorological Station forecast group, The long-range weather forecast
technology experience, The Central Meteorological Station Press, Beijing, 1976, pp.5-6.
625
[24] Xue F., Wang H. J. and He J. H.. The influence of Mask Lin high and Australia high
626
interannual variation to the East Asian summer monsoon precipitation. Chinese Science
627
Bulletin. 3 (2003) 19-27.
628
[25] Wang W, Su J Y, Hou B W et al.. Dynamic prediction of building subsidence deformation
629
with data-based mechanistic self-memory model. Chinese Science Bulletin. 57 (2012)
630
3430-3435.
631
632
[26] Takens, F,. Detecting strange attractors in fluid turbulence. Lecture Notes in Mathematics.
898 (1981) 361-381.
633
[27] Palmer T N, Alessandri A and Andersen U, et al.. Development of a European
634
multi-model ensemble system forseasonal to interannual prediction (DEMETER). Bull
30
635
636
637
638
639
640
641
642
643
644
645
646
647
648
Amer Met Soc. 85 (2004) 853-872.
[28] Huang J P and Yi Y H. A Non-linear Dynamic System Reconstructing of Actual data.
Science in China. 3 (1991) 331-336.
[29] Yeh S W, Kug J S and Dewitte B, et al.. El Nino in a changing climate. Nature. 461
(2009) 511-514.
[30] Wang L. Intelligent Optimization Algorithms and Its Application. Tsinghua University
Press, Chendu, 2001, pp. 23-24.
[31] Fraedrich K,. Estimating weather and climate predictability on attractors. J .A tmos.S ci.
44 (1987)722-728
[32] Yoden S, Nomura M,. Finite-time Lyapunov stability analysis and its application to
Atmospheric Predictability. J Atm Sci. 50 (1993) 1531—1543.
[33] Kazantsev E. Local Lyapunov exponents of the quasi–geostrophic ocean dynamics. Appl
Math Comput. 104 (1999) 217—257.
[34] Firdaus
E.
Udwadia,
Huertus
F.von
Bremen.
Computation
of
Lyapunov
649
characteristic exponents for continuous dynamical systems. Z. angew. Math. Phys.
650
(2001) 123-128.
651
652
653
654
53
[35] Hubertus F. von Bremen, Firdaus E. Udwadia, Wlodek Proskurowski. An efficient QR
based method for the computation of Lyapunov exponents. Physica D. 101 (1997) 1-16.
[36] DAVID KIRKPATRICK. OPTIMAL SEARCH IN PLANAR SUBDIVISIONS. SIAM J.
COMPUT. 12 (1983) 28-35.
655
656
657
31
658
List of Figures:
659
Fig.1 SI of summer half-year (from May to October) from 1982 to 2011
660
Fig.2. The average monthly geopotential height field at 500 hPa of July and August in 2010
661
Fig.3 the Lyapunov spectrum of the reconstructed dynamical model
662
Fig.4. the maximu m effect ive computation time changing with time increment
663
limit  (b)
664
Fig.5
dtt (a) and tolerance
The 35 day forecast results of the subtropical high area index
665
666
667
Table captions:
668
Table1. The correlation analysis between SI and three factors
669
Table2. The correlation coefficient and Root mean square errors when the retrospective order
670
different
671
Table3. Co rrelation coefficients and Root mean square errors between forecast value and real value of
672
different events of WPSH abnormal years
p is
673
674
675
676
677
678
679
680
681
682
32
683
Figure:
684
685
686
Fig.1 SI of summer half-year (from May to October) from 1982 to 2011
687
688
689
Fig.2. The average monthly geopotential height field at 500 hPa of July and August in 2010
33
690
691
Fig.3 the Lyapunov spectrum of the reconstructed dynamical model
692
693
34
694
695
(a)
696
697
(b)
35
698
Fig.4. the maximu m effect ive computation time changing with time increment
699
limit  (b)
dtt (a) and tolerance
700
701
Fig.5
The 35 day forecast results of the subtropical high area index
702
703
Table:
704
Table1. The correlat ion coefficient (C.C.) and Root mean square errors (RM SE) when the retrospective
705
order
p
p is different
1
5
6
7
8
9
10
11
12
C.C.
0.38
0.72
0.78
0.81
0.94
0.92
0.87
0.86
0.84
RMSE
27.53
%
14.33
%
13.76
%
13.06
%
3.72%
4.63%
5.01%
6.77%
7.01%
13
0.83
14
0.71
15
0.67
20
0.44
30
0.36
40
0.32
50
0.28
60
0.21
70
0.17
10.98
%
13.24
%
15.50
%
24.13
%
28.91
%
30.22
%
39.11
%
42.18
%
50.24
%
1
5
6
7
8
9
10
11
12
C.C.
0.38
0.72
0.78
0.81
0.94
0.92
0.87
0.86
0.84
RMSE
27.53
14.33
13.76
13.06
3.72%
4.63%
5.01%
6.77%
7.01%
%
%
%
%
13
14
15
20
30
40
50
60
70
0.83
10.98
0.71
13.24
0.67
15.50
0.44
24.13
0.36
28.91
0.32
30.22
0.28
39.11
0.21
42.18
0.17
50.24
p
C.C.
RMSE
p
p
C.C.
RMSE
36
%
%
%
%
%
%
%
%
%
706
707
708
709
710
Table2. The correlation coefficients (C.C.) and Root mean square errors (RMSE)
711
between forecast value and real value of different events of WPSH abnormal years
712
Sta tisti cal
tests
Short term
medium term
long term
(1~15days)
(16~25 days)
(26~35 days)
Forecast
events
C.C.
RM SE
C.C.
RM SE
C.C.
RM SE
0.957
2.92%
0.812
4.51%
0.723
11.91%
0.936
2.88%
0.877
4.72%
0.776
10.98%
0.942
3.16%
0.881
3.97%
0.718
11.48%
0.958
3.40%
0.820
4.06%
0.729
10.90%
SI bigger event1(1998.
06.21 as initial values to
forecast)
SI bigger event2(2006.
07.18 as initial values to
forecast)
SI bigger event3(1987.
07.08 as initial values to
forecast)
SI bigger event4(1983.
08.05 as initial values to
forecast)
37
SI smaller event1(1984.
07.28 as initial values to
0.951
3.12%
0.815
3.43%
0.789
11.21%
0.892
3.47%
0.825
4.97%
0.708
10.76%
0.914
4.52%
0.755
3.83%
0.698
11.87%
0.809
3.97%
0.873
3.89%
0.777
11.70%
0.926
2.07%
0.882
4.85%
0.737
10.17%
0.921
3.28%
0.838
4.25%
0.739
11.22%
forecast)
SI smaller event2(2000.
06.29 as initial values to
forecast)
SI smaller event3(1994.
08.17 as initial values to
forecast)
SI smaller event4(1999.
06.12 as initial values to
forecast)
SI smaller event5(1985.
07.11 as initial values to
forecast)
The average
713
714
715
38