1.Key Laboratory of Regional Climate-Environment in Temperate East Asia, Institute of Atmospheric Physics, Beijing 100029, China 2.University of the Chinese Academy of Sciences, Beijing 100049, China 3.National Meteorological Information Center, China Meteorological Administration, Beijing 100081, China 4.Climatic Research Unit, University of East Anglia, Norwich, Norfolk, NR4 7TJ, United Kingdom 5.Center of Excellence for Climate Change Research/Department of Meteorology, King Abdulaziz University, Jeddah 21589, Saudi Arabia Manuscript received: 2017-11-06 Manuscript revised: 2018-02-24 Manuscript accepted: 2018-03-04 Abstract:A set of homogenized monthly mean surface air temperature (SAT) series at 32 stations in China back to the 19th century had previously been developed based on the RHtest method by Cao et al., but some inhomogeneities remained in the dataset. The present study produces a further-adjusted and updated dataset based on the Multiple Analysis of Series for Homogenization (MASH) method. The MASH procedure detects 33 monthly temperature records as erroneous outliers and 152 meaningful break points in the monthly SAT series since 1924 at 28 stations. The inhomogeneous parts are then adjusted relative to the latest homogeneous part of the series. The new data show significant warming trends during 1924-2016 at all the stations, ranging from 0.48 to 3.57°C (100 yr)-1, with a regional mean trend of 1.65°C (100 yr)-1; whereas, the previous results ranged from a slight cooling at two stations to considerable warming, up to 4.5°C (100 yr)-1. It is suggested that the further-adjusted data are a better representation of the large-scale pattern of climate change in the region for the past century. The new data are available online at http://www.dx.doi.org/10.11922/sciencedb.516. Keywords: homogenization, Multiple Analysis of series for homogenization (MASH), monthly temperature series, long-term trend, China 摘要:长期的均一化气温观测序列对于气候变化的准确评估和归因至关重要. 然而, 我国多数气象台站不可避免地受到了台站迁址、仪器换型、环境变迁等非自然因素的影响, 造成多数观测序列中存在非均一性. 近几年, 曹丽娟等人利用RHtest方法建立了百年来中国32站均一化逐月气温序列集, 改善了气候变化研究的数据基础. 但这套数据集中仍然存在非均一性, 主要原因有:过于严格的数据处理先决条件, 如:检测到的间断点必须有元数据支持;1950年之前多数台站在订正时无参考序列;不完整的元数据信息, 特别是1950年之前, 这可能使得一些间断点被忽略的可能性进一步增大. 为此, 本研究基于MASH方法对这套数据集中中国中东部28个台站进行了进一步的非均一性订正. 结果表明:1924-2016年间28个台站逐月气温记录中, MASH方法检测到33个月值气温记录异常值和152个有意义的间断点. 根据MASH估计的逐月气温非均一性值, 对5673个月值气温记录做了进一步修正, 调整原则是将气温序列中非均一记录订正到最近时段序列水平上. 通过对比发现, 进一步订正后28个台站1924-2016年年平均气温记录均呈增温趋势且变化趋势范围减小(0.48℃/100年 - 3.57℃/100年), 而之前数据中长沙和南京站呈现出与周边站不一致的降温趋势(?0.23℃/100年- 4.02℃/100年). 进一步订正的数据能更好地代表中国过去百年大尺度气候变化空间格局. 关键词:均一化, 逐月气温序列, 序列均一化多元分析(MASH), 长期趋势, 中国
HTML
--> --> --> -->
2.1. Updated SAT series
The monthly SAT series at 32 stations from the start of observation to 2015, homogenized by (Cao et al., 2017), are available from the China Meteorological Data Service Center (CMDC, http://data.cma.cn/). We updated the time series with instrumental temperature records in 2016 collected from the CMDC, Hong Kong Observatory, Macao Meteorological and Geophysical Bureau, and Central Weather Bureau of Taiwan. The dataset that was updated is hereafter referred to as the "previous dataset". The number of stations increased from 1 in 1873 to 28 in 1924 and 32 in 1942. To avoid using the early period of scarce data to facilitate application of the MASH software (Szentimrey, 1999), we applied MASH to the 28 stations with continuous records since 1924. The basic information on the stations is listed in Table 1.
2 2.2. MASH -->
2.2. MASH
MASH is an iterative procedure designed to detect and adjust possible break points through mutual comparisons of a number of series with similar climate variability based on statistical tests of hypotheses at a given significance level. Any series is not necessarily homogeneous. Several difference series are constructed from the candidate and weighted reference series. The optimal weighting is determined by minimizing the variance of the difference series, in order to increase the efficiency of the statistical tests. The inhomogeneity of the difference series can be characterized by the test statistic, which should be smaller than the critical value via a Monte Carlo method and cases of homogeneity at the given significance level. MASH has been widely applied to homogenize climate data in many studies worldwide (Manton et al., 2001; Lakatos et al., 2008; Rasol et al., 2008; Birsan and Dumitrescu, 2014). It has also been applied to homogenize temperature series in China, and proved a suitable technique via a number of applications of the homogenized data (Li and Yan, 2009; Li et al., 2015b, 2016). In the present study, the latest version of MASH (v3.03) was used. Different from previous versions, MASH v3.03 starts with a preliminary examination of the annual series and uses the detected breaks as preliminary information (used as proxy metadata) for the standard procedure of MASH for monthly data. The new developments of automatic procedures make the homogenization easier for the end user. The fourth is some developments for daily data, including some new program procedures for missing data completion and data quality control. The mathematical and technical details are introduced in the online manual at http://www.met.hu/en/omsz/rendezvenyek/homogenization_and_interpolation /software/. The additive model is applied to temperature series underlying a normal distribution. The significance level for testing break points via the Monte Carlo method is α=0.01. The reference system of nine nearby stations for each candidate station is determined based on their distances to the candidate station. The inhomogeneous sections are adjusted to the latest homogeneous part of the SAT series. A linear trend is estimated via the least-squares linear fitting method, to assess the long-term change in the SAT series. The t-test is used to assess the significance of the trend at α=0.05.
-->
3.1. Outliers
Figure 1a shows the number of potential outliers in the previous data for each month at each station, estimated by the MASH procedure. To facilitate discussion, we define an erroneous outlier in the present study if the potential outlier exhibits an inhomogeneous shift from the neighboring year larger than 1.5°C. There are 33 monthly temperature records from 12 stations detected as erroneous outliers. The station HHHT in northern China contains the most (nine outliers in six monthly series). There are no outliers at the other 16 stations. Figure1. (a) Number of erroneous outliers in the SAT records for each month at each station. (b, c) Outlier case for January at XM: (b) SAT anomalies in the 1971-2000 climatology for January at XM and nine reference stations in the previous data; (c) the previous and new January SAT series at XM.
To highlight what an outlier is, we take an example of SAT for January 1951 at XM station. As Fig. 1b shows, the SAT record of January 1951 at XM is of an anomaly larger than 5°C, far beyond the average of those at the nine reference stations, which are all negative anomalies for the same month. Figure 1c shows the new SAT series compared with the previous for January at XM, highlighting the inhomogeneous record in 1951. Obviously, the further-adjusted SAT series for January at XM becomes consistent with those at the reference stations. The new series has a warming trend of 1.67°C (100 yr)-1, compared with 1.52°C (100 yr)-1 in the previous data. Figure2. (a) Number of break points in the SAT records for each month at each station. (b, c) Inhomogeneity case for the annual SAT series at NJ: (b) previous annual SAT anomalies at NJ and nine reference stations; (c) previous and new annual SAT series at NJ. (d) PDF of all the monthly adjustments based on MASH.
Figure3. Linear trends in the annual SAT series at 28 stations during 1924-2016 based on the (a) previous and (b) new data. (c) Regional mean series compared between the previous and new data.
2 3.2. Break points -->
3.2. Break points
Figure 2a shows the number of inhomogeneous break points in the SAT series for each month at each station during 1924-2016. To aid understanding, we set those with an inhomogeneous shift larger than 0.5°C as a meaningful break point. There are 152 meaningful break points in the monthly SAT series at 26 stations. The MC station has the most (24 break points in 10 monthly series). There is no break point detected for the HC and TN stations in Taiwan. To help understand the meaningful inhomogeneous biases, we draw attention to the SAT records around the 1940s at NJ. Figure 2b shows the annual SAT anomalies (from the 1971-2000 mean climatology) during 1924-2016 at NJ and nine reference stations from the previous dataset. Obviously, there are unusual warm peaks around the 1940s and the earlier years at NJ, compared with the SAT anomalies at the surrounding reference stations. Figure 2c compares the adjusted series with the previous one. The new series becomes more coherent with the surrounding series around the 1940s. The inhomogeneous biases in the previous data exist mainly before the 1950s. The new SAT series at NJ shows a warming trend of 0.82°C (100 yr)-1, compared with -0.23°C (100 yr)-1 based on the previous data. As discussed in Cao et al. (2013, 2017), many stations moved from a city to a rural location, causing a drop in temperature records around that time; and hence, most of the adjusted series showed an enhanced warming trend. However, there were no metadata for Nanjing around these early times, and hence no adjustment was made for this station by (Cao et al., 2013).
2 3.3. Probability distribution of inhomogeneous biases -->
3.3. Probability distribution of inhomogeneous biases
There are 5673 monthly SAT records adjusted based on MASH, of which 3358 are of an absolute value larger than 0.5°C, about 10% of the total monthly records. Figure 2d shows the probability density function (PDF) of the monthly adjustments. A majority (4986 or 88%) of the adjustments are between -1°C and 1°C, with two probability peaks around -0.5°C and 0.5°C, respectively. As most of the adjustments are for the early period, they should influence the estimation of the long-term trend in the climate series.