删除或更新信息,请邮件至freekaoyan#163.com(#换成@)

基于机器学习方法重建的过去1000年北半球环状模(NAM)指数

本站小编 Free考研考试/2022-01-03

杨佼1,,
效存德2,1,,,
丁明虎3,1
1. 中国科学院西北生态环境资源研究院, 冰冻圈科学国家重点实验室, 甘肃 兰州 730000
2. 北京师范大学, 地表过程与资源生态国家重点实验室, 北京 100875
3. 中国气象科学研究院, 青藏高原与极地气象科学研究所, 北京 100081

基金项目: 中国科学院(A类)战略性先导科技专项项目(批准号:XDA19070103)、国家重点研发计划项目(批准号:2018YFC1406104)和国家自然科学基金项目(批准号:42071086)共同资助


详细信息
作者简介: 杨佼, 女, 30岁, 助理研究员, 极地冰冻圈与气候变化研究, E-mail: yangjiao@lzb.ac.cn
通讯作者: 效存德, E-mail: cdxiao@bnu.edu.cn
中图分类号: P467

收稿日期:2021-01-29
修回日期:2021-03-21
刊出日期:2021-05-30



Reconstruction of the Northern Annular Mode(NAM) index for the past 1000 years based on machine learning methods

YANG Jiao1,,
XIAO Cunde2,1,,,
DING Minghu3,1
1. State Key Laboratory of Cryospheric Science, Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou 730000, Gansu
2. State Key Laboratory of Earth Surface Processes and Resource Ecology, Beijing Normal University, Beijing 100875
3. Institute of Tibetan Plateau and Polar Meteorology, Chinese Academy of Meteorological Sciences, Beijing 100081


More Information
Corresponding author: XIAO Cunde,E-mail:cdxiao@bnu.edu.cn
MSC: P467

--> Received Date: 29 January 2021
Revised Date: 21 March 2021
Publish Date: 30 May 2021


摘要
基于机器学习方法重建的过去1000年北半球环状模受限于观测资料的短缺,关于北半球主要大气环流模态(Northern Annular Mode,简称NAM)的演变规律和机理还很不明确。运用树轮、冰芯、沉积物等代用指标重建时间序列更长的NAM指数有助于加深对其演变规律和驱动机制的认识。本文通过评估多种机器学习模型在古气候重建中的适用性,基于PAGES 2k的气候代用指标重建了过去1000年高分辨率(1年)的NAM指数。研究结果表明相比普通线性回归模型和随机森林等模型,CatBoost、极端随机树和主成分回归模型可以有效地避免过拟合,模型具有更高的稳定性和可靠性,其中CatBoost模型的重建结果与器测时段内NAM指数的相关系数最高(R=0.93,p < 0.01),能够更好地拟合NAM指数的量级和峰谷变化。分析过去1000年NAM指数的变化特征,发现NAM具有显著的百年际周期(167.5年)和多年代际周期(32.3年)波动,1950~2000年NAM由负位相转向正位相的速率在过去1000年中前所未有。进一步探究NAM与温度和海冰的关系发现,1850年之前,暖期对应NAM增强,冷期对应NAM减弱;而在1850年之后NAM的多年代际变化与巴伦支-喀拉海海冰范围的变化趋于一致,1950年之后NAM向正位相快速转变可能是温度和北极海冰异常共同影响的结果。
北半球环状模(NAM)/
重建/
代用指标/
机器学习

As the dominant mode of the atmospheric circulation in the Northern Hemisphere(NH), the Northern Annular Mode(NAM) makes important influences on both NH and global climate change. The knowledge of the NAM is limited by the lack of the observed data. The reconstruction of long-term changes of the NAM using proxies such as tree rings, ice cores and sediments can help us understanding the long-term characters and mechanisms of NAM.
In this study, we compare eight reconstruction models, including Original Linear Regression(OR), Principle Composite Regression(PCR), Ridge, Support Vector Regression(SVR), Random Forest(RF), Extremely Randomized Trees(ET), Light Gradient Boosting Machine(LightGBM) and CatBoost, to estimate their applicability in NAM reconstruction. And further reconstruct the annual NAM index for the past 1000 years based on the climate proxies obtained from PAGES2k. By removing missing and anomalous data, 54 proxies(obtained from 22 tree ring and lake sediment records) can be used for NAM reconstruction were selected after filtering by feature engineering and correlation analysis. Compared with the OR and RF methods, the results show that the CatBoost, ET and PCR model can effectively avoid overfitting problems and better reconstruct the variability of the NAM over the instrumental period (1948~2000). The CatBoost reconstruction has the highest correlation coefficient(R=0.93, p < 0.01) with the NAM index over the instrumental period and is able to better fit the magnitude and peak-to-valley variability of the NAM index. Even the reconstruction series of the OR and RF are significantly correlated with the instrumental NAM index from 1948 A.D. to 2000 A.D., the results of the cross validation show the high correlation in the calibration period (1967~2000) and very low correlation in the validation period(1949~2000), and the Nash-Sutcliffe coefficient of efficiency(NSCE) are both negative in validation period. This suggests there is a high risk of overfitting when use the two models. The reconstructions of OR and Ridge are highly variable in amplitude because of the high sensitivity to noises in proxies, which may produce spurious estimates. The reconstructions of SVR and LightGBM, which their NSCE are lower than 0.1 in validation period, are also less reliable because of the high sensitivity to the noise of proxies or the insufficient amount of data. We recommend the CatBoost and ET as the preferred machine learning models for paleoclimate reconstruction.
The ensemble empirical mode decomposition(EEMD) is used to decompose of the NAM series during the past 1000 years to obtain its multi-scales characters and nonlinear trends. The results suggest that the NAM has a significant centennial(167.5 a) and multidecadal(32.3 a) oscillations. The positive shift of NAM from 1950 A.D. to 2000 A.D. is unprecedented in the past 1000 years. We further found that prior to 1850 A.D., the multi-decadal variability of NAM was mainly influenced by the global mean temperature, with warm periods corresponding to an enhanced NAM and cold periods corresponding to a weakened NAM. While after 1850A.D., the multi-decadal changes of NAM tend to coincide with sea ice extent in the Barents-Kara Sea. We infer that rapid shift of the NAM towards a positive phase occurred after 1950s is probably influenced by the combined effect of the anomalies in both temperature and Arctic sea ice.
Northern Annular Mode(NAM)/
reconstruction/
proxies/
machine learning



PDF全文下载地址:

http://www.dsjyj.com.cn/data/article/export-pdf?id=95df1aa8-53d7-487e-b13c-78a6a036c5fb
相关话题/指标 北京 实验室 中国科学院 生态