HTML
--> --> --> -->2.1. Observational data
Changes in instruments, station relocations, observational practice, processing procedures, and other issues may result in spurious changes and discontinuities in raw near-surface RH records (Zhu et al., 2015). These discontinuities can greatly affect the estimation of the interannual variations and long-term trends in RH. Li et al. (2020b) homogenized these daily RH records at 746 National Reference and Basic Stations in the Chinese mainland using the Multiple Analysis of Series for Homogenization (MASH) method (Szentimrey, 1999). This method has been widely used to detect and correct inhomogeneities by not assuming reference series are homogeneous (Li et al., 2018). The advantage of this procedure is that possible break points in raw climate series can be detected and adjusted through mutual comparisons of series within the same climatic area. The new version, MASHv3.03, was developed for homogenization of daily series as well as for quality control of daily data and missing daily data completion. After this homogenization, the surface RH exhibits not only temporally more consistent long-term variations but also demonstrates spatially more coherent long-term trends than methods without homogenization. The homogenized data over China are now updated to 2018, and the observational stations are shown in Fig 1. The density of these observational stations is higher in eastern China than in western China, likely resulting in more accurate estimates of regional mean RH for the east than for the west. In addition, the monthly precipitation from the Global Precipitation Climatology Centre (GPCC, Rudolf et al., 2010) in China is used to investigate the correlation between the precipitation and RH.Figure1. Geographic distribution of the 746 meteorological stations (red dots) over China used in this study and the three subregions: region I as East China (with elevation lower than 1 km), region II as Northwest China (with elevation of 1?3 km), and region III as the Tibetan Plateau (with elevation over 3 km). The lower right-hand illustration shows the South China Sea.
2
2.2. Reanalysis products
The reanalysis RH data are obtained from six products: CRA-40, CFSR, ERA5, ERA-Interim, JRA-55, and MERRA-2, which have substantially improved spatiotemporal resolution compared with previous versions of the products. Table 1 summarizes the basic information about these reanalyses, including reference information. The ERA5 has the highest spatial resolution with a grid size of 0.25° × 0.25°, followed by the CFSR and ERA-Interim (0.5° × 0.5°). The CRA-40 has the lowest resolution with a grid size of 1.0° × 1.0°.Name | Agency | Horizontal resolution | Temporal coverage | Assimilation method | Reference |
CRA-40 | CMA | 0.312° × 0.312° | 1979?2018 | 3-D-Var | Wang et al. (2018) |
CFSR | NCEP | 0.5° × 0.5° | 1979?present | 3-D-Var | Saha et al. (2010) |
ERA5 | ECMWF | 0.25° × 0.25° | 1958?present | 4-D-Var | Hersbach et al. (2019) |
ERA-Interim | ECMWF | 0.5° × 0.5° | 1979?2019 | 4-D-Var | Dee et al. (2011) |
JRA-55 | JMA | 0.5625° × 0.5625° | 1958?present | 4-D-Var | Kobayashi et al. (2015) |
MERRA-2 | NASA | 0.6667° × 0.5° | 1980?present | 3-D-Var | Gelaro et al. (2017) |
Table1. The six reanalyses used in this study.
The CRA-40 reanalysis is based on the Global System Model (GSM) in the Global Forecast System (GFS) of NCEP and the Gridpoint Statistical Interpolation (GSI) 3-D-Var assimilation system. The original output has a spatial resolution of 0.312° × 0.312°, with 64 vertical levels. Besides more traditional observations and satellite datasets collected over East Asia, the CRA-40 has also assimilated more observation data of the Integrated Global Meteorological Observation Archive from Aircraft (IGMOAA), mainly adding more Chinese Aircraft Meteorological Data Relay (AMDAR) data. Complex quality control procedures used by NCEP for aircraft observations are applied to detect data errors (Liao et al., 2021).
Although each reanalysis was developed with distinct model physics and resolutions to meet specific goals, nearly all of the reanalyses assimilate satellite observations (mainly after 1979) using either a 3-D or a 4-D variational assimilation technique with a constant error covariance matrix for the first-guess fields throughout the reanalysis period. It should also be noted that the near surface RH is not archived directly in the ERA datasets (ERA5 and ERA-Interim), thus, the relative humidity is calculated from near-surface temperature (T) and dew point temperature (Td): RH=es(Td)/es(T) × 100, es is the saturation vapor pressure at a certain temperature.
2
2.3. Data processing
To facilitate the evaluations and comparisons, both the monthly mean and anomalies (relative to the 1979?2018 climatology) are remapped onto a common 1° × 1° grid using bilinear interpolation for reanalyses and using the Cressman analysis method (Cressman, 1959) for regridding the station observations, respectively. The Cressman interpolation technique with a maximum search radius of 1000 km was used in the gridding of the observational data. Such a radius often represents the typical correlation distance in monthly air temperature and humidity fields and thus is commonly used in gridding temperature and humidity monthly anomaly data (e.g., Dai et al., 2011). We emphasize that the observational stations are relatively sparse over some areas, such as the western Tibetan Plateau, compared with those in eastern China; thus, the interpolated RH values are less reliable over these regions. By averaging the gridded datasets using the grid-box area as the weighting, the regional mean values are obtained. To facilitate the comparison in subregions with different topography, mainland China was divided into three subregions as in our previous studies (e.g., Zhao et al., 2015; Zhang and Zhao, 2019), which are East China as region I (with elevations lower than 1000 m), Northwest China as region II (with elevations of 1000?3000 m), and Tibetan Plateau as region III (with elevations over 3000 m).To further examine the impact of the verifying grids on the comparison, both the ranalyses and the observations were also regridded into 0.5° × 0.5° grids. We compared the analysis results between the 1° × 1° and 0.5° × 0.5° grids. The result suggests that differences in the long-term mean, variations, and changes in surface RH between reanalysis products and the observations at two different grids are negligible for a large scale. Thus, the following comparisons are based on a 1° × 1° grid.
2
2.4. Basic statistics
Using the monthly mean and anomalies from the observations and the reanalyses on the common 1° × 1° grid, some basic statistics were calculated to assess the performance of each reanalysis product in describing the observed RH mean climatology, spatiotemporal variations, and long-term trends over China during the period of 1979?2018.The mean bias (B) is defined to assess the absolute difference between the observations and reanalyses averaged during the study period at each grid box:
where Y and X denote the monthly reanalyses and observed RH, respectively, and N is the number of data points during the study period.
The root-mean-square error (RMSE) is defined to measure the average magnitude of the error with a focus on the extreme values:
The Pearson correlation coefficient (r) is used to examine the consistency between the observations and reanalyses:
where
The statistical significance of the RH trend was tested using the Mann-Kendall Tau nonparametric technique (Mann, 1945; Kendall, 1975). The Mann-Kendall statistic (S) is calculated as:
where n is the number of data points and V(S) is the variance of S. m is the number of tied groups, and tk denotes the number of ties for group k. In cases where the sample size n>10, the standard normal test statistic ZS is calculated as:
If ZS is greater than zero, it means an upward trend; if ZS is less than zero, it means a downward trend. At the 5% and 1% significance levels, the null hypothesis of no trend is rejected if |ZS|>1.96 and |ZS|>2.576, respectively.
To explore the differences of variability, skewness, and shape of RH distributions from the observations and reanalysis products, we use smoothed histograms estimated from the monthly RH anomalies at all given grids for China as a whole and three subregions. The histograms are derived using cubic spline fitting smoothed during 1979?2018 (1980?2018 for the MERRA-2). The monthly anomalies of RH are normalized at each grid box using the standard deviation of the 1979?2018 period (1980?2018 for the MERRA-2) before regional averaging. In addition, an empirical orthogonal function (EOF) analysis is utilized to examine how well each current reanalysis can capture the spatiotemporal variations and the leading modes displayed by observed surface RH.
-->
3.1. Climatological bias
Figure 2 displays the spatial distribution of the differences between the reanalysis and observed surface RH for the annual, warm season (April to September), and cold season (October to March) means over China from 1979 to 2018 (1980?2018 for the MERRA-2). It is clear that most reanalyses overestimate the observations by 15%?30% over the Tibetan Plateau, while they underestimate the observations by 5%?10% over most of northern China. In particular, wet biases in the CRA-40, CFSR, and MERRA-2 can reach from –5% to –25% over northwestern China. Relative to the observed long-term means over eastern China, the biases from the ERA5, ERA-Interim, and JRA-55 are generally within ~5% (Figs. 2g-o), while wet biases from ~5% to 10% exist in the CFSR and MERRA-2 over southeastern China (Figs. 2d-f and p-r). Among these reanalysis products, the CRA-40 is the driest compared to the observations with a bias of ~5%?25% over most of China, except the Tibetan Plateau and the northern part of northeastern China (Figs. 2a-c). The large wet biases over southwestern China likely result from the fact that the observed stations are relatively sparse over the Tibetan Plateau and that the gridded data were interpolated from the surrounding stations with lower elevations, which leads to less reliable interpolated RH values over these regions. From the perspective of the national regional average (Table 2), the RH from the MERRA-2 has a lower bias than that from the other reanalyses, especially for the annual and warm season. In contrast, the CFSR has a larger bias, especially for the annual and cold season. Table 2 also suggests that most reanalysis products are successful in reproducing the spatial variations of the observations with pattern correlations of 0.67?0.92. Among the six reanalysis products, the MERRA-2 is the best to describe the observed spatial variations, not only in annual but also in warm and cold seasons (with a pattern correlation of 0.83?0.92). Although a similar assimilation system is utilized for the CFSR and CRA-40, the CFSR is the worst in reproducing the observed spatial pattern (with a pattern correlation of 0.67?0.84). Furthermore, the pattern correlations in the warm season are generally higher than those in the cold season.Figure2. Bias distributions of long-term mean surface RH between reanalyses and observations (%; reanalyses minus observations) during 1979?2018 (1980?2018 for the MERRA-2). Biases of (a?c) CRA-40, (d?f) CFSR, (g?i) ERA5, (j?l) ERA-Interim, (m?o) JRA-55, (p?r) MERRA-2 for annual (left), warm season (middle), and cold season (right) over China.
CRA-40 | CFSR | ERA5 | ERA-Interim | JRA-55 | MERRA-2 | ||
Pattern correlation | Annual | 0.80 | 0.78 | 0.82 | 0.83 | 0.84 | 0.89 |
Warm season | 0.86 | 0.84 | 0.83 | 0.84 | 0.89 | 0.92 | |
Cold season | 0.68 | 0.67 | 0.80 | 0.83 | 0.80 | 0.83 | |
Biases (%) | Annual | ?1.62 | 7.64 | 2.72 | 3.14 | 4.47 | 0.63 |
Warm season | ?2.78 | 2.43 | 2.82 | 3.12 | 4.08 | ?1.89 | |
Cold season | ?0.51 | 12.81 | 2.63 | 3.18 | 4.87 | 3.13 |
Table2. The pattern correlations of the reanalyses with their regional-averaged biases (reanalysis minus observation) for annual, warm season (April to September), and cold season (October to March) over China during 1979?2018 (1980?2018 for the MERRA-2). All pattern correlations are significant according to the two-sided Student's t test (p < 0.01).
Figure 3 shows that the RMSE of the reanalyses with respect to the observations is about 2%-4% for most of eastern China but over 10% in most of northwestern and southwestern China, largely following the terrain features of China. In particular, the RMSE reaches 20% or more over the Tibetan Plateau, where the observed RH is relatively less reliable due to sparse observations. The CRA-40, CFSR, and MERRA-2 show larger RMSE over most of China compared to the other products. This suggests that despite assimilating more observations over China, the newer-generation CRA-40 reanalysis dose not clearly show reduced RMSE in surface humidity over China.
Figure3. Spatial distributions of root-mean-square error (RMSE, %) of the monthly surface RH between the reanalyses and the observations during 1979?2018 (1980?2018 for the MERRA-2). RMSE for (a?c) CRA-40, (d?f) CFSR, (g?i) ERA5, (j?l) ERA-Interim, (m?o) JRA-55, (p?r) MERRA-2 for annual (left), warm season (middle), and cold season (right) over China.
The zonal and meridional means of monthly mean surface RH show a consistent variation along the west?east and south?north directions (Fig. 4). The driest air with RH of ~35%?55% is seen in the hinterland of the Taklimakan Desert (40°N, 85°E). In addition, most reanalyses are wetter than observed in the areas north of 50°N, east of 120°E (located in northeastern China), south of 35°N (including the Tibetan Plateau and south of the Yangtze River), and west of 80°E (Pamir Plateau) but are drier than the observations in the northern areas of the Yangtze River (35°?50°N, 105°?115°E). This is consistent with the biases and RMSE as shown in Figs. 2 and 3.
Figure4. (a) Zonal and (b) meridional average of monthly mean surface RH from the observations and the reanalyses for the 1979?2018 period (1980?2018 for the MERRA-2) over China.
Figure 5 presents the month?years evolution of the surface RH for the observations and reanalyses averaged over China during 1979?2018 (1980?2018 for the MERRA-2). It can be seen that the temporal evolution in the surface RH from most reanalyses is broadly consistent with the observed result (r ≥ 0.75; except for the CFSR). The RH is often greatest in August, declining to its lowest in April. In particular, the ERA5, ERA-Interim, and JRA-55 are more consistent (r ≥ 0.89) with the observations than the other products. By contrast, the CFSR still performs poorly in presenting the month?years evolution seen in the observed surface RH with a pattern correlation of 0.27 (Fig. 5c).
Figure5. Month-year evolution of the surface RH for the observations and the reanalyses averaged over China from 1979 to 2018 (1980?2018 for the MERRA-2). The pattern correlation (R) between the reanalyses and the observations is also shown in (b?g). The pattern correlations of each reanalysis are statistically significant at the 5% level.
Histograms are often used to demonstrate how the variability, the skewness, or the shape of the distribution of a climate variable may change in a particular region over a specific period (Cubasch et al. 2013). Figure 6 exhibits the histograms of regionally averaged surface RH anomalies smoothed using cubic spline fitting. To improve the spatial comparability, the local RH anomalies are first normalized by their climatology for 1979?2018 (1980?2018 for the MERRA-2). It is clear that the RH anomalies largely follows a Gaussian distribution, and the shapes of the smoothed histograms from reanalyses are quite consistent with the observations for the whole of China and the three subregions. Consistent with previous results, the shapes of the CFSR histograms are flatter, with an increased spread and reduced peak, than those derived from the other reanalyses and observations; this is especially clear in the Tibetan Plateau.
Figure6. Histograms of the monthly RH anomalies for all grid boxes over (a) the whole of China and (b?d) the three subregions outlined in Fig. 1. The histograms are derived using cubic spline fitting smoothed during 1979?2018 (1980?2018 for the MERRA-2). The monthly anomalies of RH are normalized at each grid box using the standard deviation of the 1979?2018 period (1980?2018 for the MERRA-2) before regional averaging. The rate of RH anomaly is binned at 1% intervals.
2
3.2. Correlation and trend analysis
Figure 7 shows the spatial distribution of the correlations between the reanalyses and the observed monthly RH anomalies over China for 1979?2018 (1980?2018 for the MERRA-2). It is clear that most reanalyses perform better in representing the temporal variations of the observed RH in eastern China (r > 0.60), especially over areas around the lower reaches of the Yangtze River and the Yellow River (r > 0.70). In particular, the ERA5 and JRA-55 both are more highly correlated with the observations (r > 0.90) than the rest of reanalysis products over most of eastern China. As shown by the above analysis, the CFSR is the least correlated to the observations over western China, especially for the Tibetan Plateau.Figure7. (a?f) Spatial distribution of the correlations between the observations and the reanalysis RH anomalies during 1979?2018 (1980?2018 for the MERRA-2) over China and (g) correlations of the regional mean RH anomalies. The stippled areas in (a?f) indicate statistical significance at the 5% level.
To further examine the performance of the current reanalysis products in capturing surface RH, we utilized the GPCC precipitation dataset to evaluate the correlation between precipitation and surface RH. The correlations between the observed monthly precipitation and surface RH anomalies are more than 0.5 for most of China. In particular, the correlations are more than 0.6 in the northwest part of Xinjiang province and some parts of northeastern and southeastern China (Fig. 8a). Among these reanalysis products, the surface RH from the MERRA-2 is more strongly correlated (r = 0.5?0.8) with the observed precipitation than the other products for most of China (Fig. 8g). Except the MERRA-2, the surface RH anomalies from other reanalysis products are weakly correlated with the observed precipitation over most of China (r ≤ 0.40), especially for many areas of northwestern China and the Tibetan Plateau.
Figure8. (a?g) Spatial distribution of the correlations between the GPCC precipitation and reanalysis RH during 1979?2018 (1980?2018 for the MERRA-2) over China, and (h) correlation coefficients of the regional-average precipitation and RH anomalies. The stippled areas in (a?g) indicate statistical significance at the 5% level
To examine the interannual variations and long-term changes in RH, Fig. 9 displays the time series of the surface RH annual anomalies from the observations and reanalyses averaged over the whole of China and the three subregions (cf. Fig. 1). It is clear that the surface RH interannual variations in most reanalysis products, except the CFSR, are highly consistent with the observations over China. The CRA-40, ERA5, and JRA-55 are better than other reanalyses in describing the observed RH variations over the whole of China and the three subregions (r = 0.42?0.83; Table 3); the CRA-40 performs especially well over eastern China (r = 0.83). Consistent with Fig. 8, the CSFR and ERA-Interim are less correlated with the observed series than the other reanalyses over northern China and the Tibetan Plateau (i.e., regions II and III; Table 3). In terms of the three subregions, relatively low correlations between the observations and the reanalyses are seen over the Tibetan Plateau (Table 3). In contrast with the GPCC precipitation, the surface RH changes from most of the reanalysis products (except the CFSR) are highly consistent with the observed precipitation changes, and the MERRA-2 performs best among these reanalyses in depicting the long-term changes in the observed precipitation with a correlation of 0.53?0.62.
Figure9. Time series of annual RH anomalies for the observations and the reanalyses averaged over (a) the whole of China and (b?d) the three subregions outlined in Fig. 1 during 1979?2018 (1980?2018 for the MERRA-2). The anomalies (%) of observed precipitation (Pre) are also shown in cyan lines.
OBS | CRA-40 | CFSR | ERA5 | ERA-Interim | JRA-55 | MERRA-2 | ||
Whole China | R | ? | 0.68* | ?0.10 | 0.42* | 0.15 | 0.55* | 0.38 |
trend | 0.25 | ?0.12 | ?0.82* | ?0.56* | ?1.02* | ?0.32 | 0.38 | |
Region I (East) | R | ? | 0.83* | 0.65* | 0.69* | 0.56* | 0.73* | 0.50* |
trend | 0.02 | ?0.28 | ?0.44 | ?0.80* | ?0.96* | ?0.64* | 0.17 | |
Region II (Northwest) | R | ? | 0.78* | 0.01 | 0.62* | 0.40 | 0.76* | 0.61* |
trend | 0.22 | 0.07 | ?1.06* | ?0.78* | ?1.23* | ?0.17 | 0.18 | |
Region III (Tibetan Plateau) | R | ? | 0.58* | ?0.24 | 0.56* | 0.20 | 0.67* | 0.33 |
trend | 0.83* | 0.17 | ?1.90* | ?0.18 | ?0.59* | ?0.01 | 1.28* |
Table3. The correlation coefficients (R) of the monthly surface RH anomalies between the observations and the reanalyses averaged over the whole of China and the three subregions outlined in Fig. 1, along with the regional mean trends [% (10 yr)–1)]estimated from the monthly surface RH anomalies. Numbers in bold are statistically significant at the 5% level, while the numbers denoted with * are statistically significant at the 1% level according to the two-sided Student's t test.
Figure 10 shows the spatial patterns of the long-term RH linear trend derived from the reanalyses and the observations over China during the study period. Largely following the Clausius-Clapeyron equation (Trenberth et al., 2003, 2005), the observed RH changes are often small for most of China (Fig. 10a). This is broadly consistent with the atmospheric RH long-term changes derived from the radiosonde data (Zhao et al., 2011, 2015). However, none of the reanalyses can reproduce the observed trend pattern well (pattern correlations are less than 0.38), and the reanalyses themselves differ substantially among themselves. In contrast to the observed weak trends, spurious drying trends over eastern China are seen in the CFSR, ERA5, ERA-Interim, and JRA-55, even though they perform well in depicting the observed RH variations in these areas. The MERRA-2 does relatively better to roughly capture the observed RH trend patterns with comparable magnitudes of upward trends over most of China (with a significant pattern correlation of 0.38) and for the three subregions (Table 3). Broadly consistent with the long-term changes in atmospheric precipitable water (Zhao et al., 2015), the newer-generation reanalyses are hardly to capture the long-term trends in the observed surface RH changes over China.
Figure10. Distributions of the linear trends [% (10 yr)?1] of monthly RH anomalies estimated from the observations and reanalyses over China during 1979?2018 (1980?2018 for the MERRA-2). The stippled areas indicate the trend is statistically significant at the 5% level. The pattern correlation (R) between the reanalyses and the observations is also shown in (b?g). The bold numbers are statistically significant at the 5% level.
In summary, most reanalysis products can describe the interannual variations in observed surface RH over eastern and northern China well but exhibit relatively poor performance over western China, especially over the Tibetan Plateau. Few reanalyses can capture the observed long-term RH trends, and the reanalyses themselves differ substantially from each other. In general, the CRA-40, a newer-generation reanalysis, performs well in depicting the interannual variations seen in the observed surface RH over China since 1979.
2
3.3. EOF analysis
To provide further insight into how well the reanalyses can capture the spatial and temporal variations of the observed surface RH, an EOF analysis was conducted based on the monthly RH anomalies (normalized by the local standard deviation and square root of the cosine of the latitude). Figures 11 and 12 display the spatial patterns and the principal components (PCs) for the first two leading EOFs of the observations and reanalyses, which together account for 33.3%?39.1% of the total variance.Figure11. (a?g) First leading EOF and (h) its corresponding nine-point moving average principal component (PC) time series derived from observed and reanalysis RH monthly anomalies for 1979?2018 (1980?2018 for the MERRA-2). The monthly RH anomalies are normalized by the local standard deviation and square root of the cosine of the latitude in each grid box, respectively, before the EOF analysis. The explained percentage variance (Var) is shown in (a?g). Both the (R) pattern correlations are shown in (b?g), and the PC correlations (R1) between the observations and the reanalyses are shown in (h). The bold numbers are statistically significant at the 5% level.
Figure12. As in Fig. 11, but for the second leading EOF and its PC time series.
As shown in Fig. 11, the first EOF modes derived from the observations and reanalyses all show patterns with the same sign over the whole of China and have the largest magnitudes contributed from central East China. The PC time series of this mode depicts the main feature of the nationwide-average RH anomalies, which is largely related to the long-term variations of RH shown in Fig. 9a. Overall, the first EOF modes derived from the reanalyses, except the CFSR, are consistent with EOF1 derived from the observations, with pattern correlations of 0.81?0.93 and temporal correlations of 0.38?0.86, despite slight regional discrepancy. For instance, compared with the observations, the CFSR and ERA-Interim show obvious negative surface RH variations in northeastern China, which is not seen in other reanalysis products.
The second EOF derived from the observations shows a robust dipole mode (i.e., anticorrelated) between the Tibetan Plateau and northeastern China (Figs. 12a-g), with the temporal coefficient (PC2) showing mostly multiyear variations and a small decreasing trend (Fig. 12h). The observation-derived PC2 is significantly correlated with the El Ni?o?Southern Oscillation index with the PC2 lagging the indices by six months. This suggests that during warm El Ni?o events, RH tends to be above normal over eastern China and below normal over the rest of China. The corresponding modes and PC time series derived from most of the reanalyses, except the CFSR and ERA-Interim, can roughly capture the spatial and temporal variations represented by EOF2 derived from the observations, with significant pattern correlations of 0.68?0.92 and temporal correlations of 0.55?0.76. Among the six reanalyses, the CRA-40 outperforms the other reanalysis products in depicting the spatiotemporal variations revealed by EOF2 from the observations, with a high pattern correlation of 0.92 and temporal correlation of 0.76.