HTML
--> --> --> -->2.1. Meteorological observation data
Meteorological observations began in March 1989, and so far, 32 years of conventional ground meteorological observation data have been accumulated at Zhongshan Station, Antarctica. To ensure the accuracy and quality of observation data, the observation instruments and methods and the accuracy of the ground observation system have been operated in accordance with the ground meteorological observation standards of the China Meteorological Administration (Ground meteorological Observation Standards of China Meteorological Administration). Before February 2002, instrumental observations were recorded manually four times per day (except sunshine duration, which was recorded hourly) at Zhongshan Station; after February 2002, the observation mode changed to an hourly wired telemetry automatic observation system. For more information on the meteorological variables and current sensor types, please see Table 1. All the sensor sampling intervals were changed to 1-min intervals, and the observation data were recorded 24 hours per day. During the instrument replacement period, parallel observations were performed for at least three months. After filtering the abnormal values, homogenization and quality control were also performed by the China Meteorological Administration. Then, the quality controlled and homogeneous observation data were used for data processing and model construction.Variable | Unit | Definition | Sensor type |
SP | hPa | Atmospheric pressure | Vaisala PTB220 |
RH | % | Relative humidity | Vaisala HMP35D |
Tem | °C | Surface air temperature | Vaisala HMP35D |
WS | M s?1 | Wind speed | XFY3-1 |
SSD | h | Sunshine duration | Australia DSU12 |
GSR | MJ m?2 | Global solar radiation | TBQ-2-B |
CF | % | Cloud fraction | ? |
LCF | % | Low cloud fraction | ? |
Vis | km | Visibility | ? |
e | hPa | Ground vapor pressure | ? |
DGSR | MJ m?2 | Sum of global solar radiation over a day | ? |
DOY | ? | Day of year | ? |
Month | ? | Month of year | ? |
Table1. The short name, definition, and sensor type (if applicable) for each variable.
Previous work aims to establish a virtual network of DGSR datasets using the random forest (RF) model and high-density ground meteorological observations at ~2400 sites across China (Zeng et al., 2020). Due to the spatiotemporal heterogeneity of DGSR, dummy variables including day of year (DOY), latitude, longitude, and altitude at all paired stations were also used as input variables in the prediction model. We used the RF model (as the optimal model) and calculated the relative importance of the variables in this study. This study aimed to construct a high-precision, long-term DGSR dataset for Zhongshan Station, Antarctica. See Fig. 2 for detailed procedures of the estimation and long-term trend analysis of DGSR. Some dummy variables (latitude, longitude, and altitude) and some meteorological variables (e.g., land surface temperature, due to the lack of observations at this station) were not used in this study. As suggested by previous studies (Wang et al., 2016; Zeng et al., 2020), meteorological elements that are highly associated with DGSR were selected as the input variables for the machine learning model. These variables include surface pressure (SP), relative humidity (RH), temperature (Tem), wind speed (WS), and sunshine duration (SSD) (corresponding short and full names are shown in Table 1). The dummy variables (i.e., DOY and month) are also used as input variables in the prediction model [similar to Zeng et al. (2020)]. DGSR is affected by cloud cover, water vapor, and aerosols before reaching the surface (Che et al., 2005). However, aerosols over the Antarctic are relatively low, so they will not be discussed in this study. In addition, bad weather events (fog, snowfall, blowing snow, and snowstorms) with low visibility are frequent at Zhongshan Station and also affect the DGSR. To further analyze the potential causal factors of the DGSR variations, ground vapor pressure (e, which is calculated by temperature and air pressure and represents water vapor content), cloud cover (CF), low cloud cover (LCF), and visibility (Vis) data are also used in this study (CF, LCF, and Vis data were collected by manual visual observation).
Solar radiation observations started relatively late (March 2008) at Zhongshan Station, although short-term observations and research projects of solar radiation were performed during the periods of January–February 1990 (Wang and Xiong, 1991) and February 1993 to December 1994 (Bian et al., 1998). The solar radiation dataset for 2008 to 2020 was first used in this study. The observation site is located in the meteorological field north of the station (as shown in Fig. 1b), where the surface is exposed rock from November to February, and there is snow for a short time in other periods, although there is usually almost no snow. A TBQ-2-B pyranometer made by Beijing Huatron company (Fig. 1c) was used to measure global solar radiation with a wavelength range of 0.3 to 3 μm and a resolution of hours at this station. The measured signal range of the TBQ-2-B pyranometer is 0–2000 W m?2, the output signal is 0–20 mV, and the annual stability is ±2%. To ensure the accuracy of observation data, the TBQ-2-B pyranometer passed the verification and calibration of the China Meteorological Administration before installation.
2
2.2. Model development
32.2.1. Data processing and time matching
The meteorological observation data with a data quality code of 0 (passing all quality control checks) were extracted and time matched. The meteorological data observed at 0000, 0600, 1200, and 1800 UTC each day were then averaged to obtain the daily mean values. The DGSR and SSD were obtained as the sums of 24 hours per day. The final available data include conventional meteorological data from March 1989 to March 2020 and radiation observations from March 2008 to March 2020. Figure 3 shows the statistical properties (minimum, maximum, mean, and standard deviation) of the variables used for model training, testing, and hindcast estimation at Zhongshan Station during 2010–20. The DGSR (SSD) ranges from 0 (0) to 36.27 MJ m?2 (21.70 hours), and the annual average value is 10.04 MJ m?2 (4.83 hours). The annual average SP, Tem, RH, and WS are 985.08 hPa, ?10.16°C, 56.90%, and 6.49 m s?1, respectively.Figure2. Steps of historical estimation and long-term trend analysis of DGSR at Zhongshan Station, Antarctica.
Figure3. The frequency distribution of (a) observed DGSR, (b) SP, (c) Tem, (d) RH, (e) WS, and (f) SSD at Zhongshan Station, Antarctica during 2010–20 for model training and cross-validation.
3
2.2.2. Model building
We evaluated the performances of multiple machine learning models for estimating DGSR at Zhongshan Station, including the RF (Chen et al., 2018), Light Gradient Boosting Machine (LightGBM) (Ke et al., 2017; Chen et al., 2019), decision tree (DT) (Quinlan, 1986), back propagation neural network (BPNN) (Wen et al., 2002), eXtreme Gradient Boosting (XGBoost) (Zelterman, 2015), support vector machine (SVM) (Cortes and Vapnik, 1995), multiple linear regression (MLR) (Zelterman, 2015), and Adaptive Boosting (AdaBoost) (Wang, 2012) models. The RF model is a widely used machine learning model that has a highly flexible algorithm and the capacity to analyze complex interactions of data classifications with noise or missing values (Chen et al., 2018). The RF model has also been used as a variable selection tool to select the input variables for a final model (Zeng et al., 2020). The RF model uses a bagging method to produce the training dataset. The out-of-the-bag (out-of-bag, OOB) data were used to evaluate the veracity of the regression predicted by the RF model. OOB estimation was an unbiased estimation when the number of trees was sufficient. In common statistical methods, overfitting occurs frequently when the methods have high degrees of freedom. In contrast to other models (e.g., the BPNN and SVM models), the RF model is an ensemble of random trees and basically has no issue of overfitting. LightGBM is a gradient promotion framework based on decision trees and can model complex nonlinear functions. LightGBM has the advantages of distributed and high performance in sorting, classification, and regression (Chen et al., 2019). The DT is a common and extensively researched solution to classification and prediction (Quinlan, 1986). The BPNN is a multilayer feedforward neural network based on a mathematical technique named Bayesian regularization to convert nonlinear regression into “well-posed” problems (Wen et al., 2002). The BPNN is composed of three layers: an input layer (first layer), hidden layer (middle layer), and output layer (last layer). XGBoost is a boosting algorithm with high performance for various regression and classification issues. The XGBoost method requires less training and time for prediction and can improve computing speed and accuracy (Gui et al., 2020). The SVM was developed by Vapnik-Chervonenkis dimension theory and structural analysis of the minimum risk principle. The SVM exhibits a unique advantage in dealing with small-sample problems, nonlinear cases, and high-dimensional pattern recognition problems by its kernel functions (Cortes and Vapnik, 1995). MLR is the regression analysis involving two or more independent variables. The MLR model can intuitively and quickly analyze correlations between multiple variables and dependent variables (Zelterman, 2015). Hence, MLR has been widely used in social science, economics, and technology. AdaBoost is an excellent boosting algorithm that combines multiple weak classifiers into a strong classifier. The main purpose of AdaBoost is to train different learning devices on the same training set and then combine these devices to construct a stronger final learning device (Wang, 2012).The model with the best estimated performance will be the final model in this study. As in previous work (Zeng et al., 2020), to obtain the optimal machine learning model, the results of 10-fold cross-validation (10-fold CV) were used to evaluate the model performances with different parameters (the final parameters of the models are shown in Table 2). In 10-fold CV, the matched pairs were partitioned into ten parts in equal proportion, with the first part as the testing subset and the other nine parts as the training subsets. This step was repeated ten times until every subset was tested, and the estimation results (mainly consisting of the coefficient of determination, R2, root-mean-square error, RMSE, and mean absolute error, MAE) of the 10 parts were averaged and used as the accuracy of the final model. The accuracy indicators, including the R2, RMSE, MAE, and difference (estimated minus observed), were used to assess the capabilities of the machine learning models and then obtain an optimal model (Gui et al., 2020; Zeng et al., 2020). In this study, we used the data from April 2010 to March 2020 for training, the 10-fold CV method for testing, and the period from April 2008 to March 2010 for evaluating the historic estimates.
Model name | Parameter | Final value |
RF | n_estimators | 500 |
oob_score | True | |
n_jobs | 2 | |
LightGBM | n_estimators | 3500 |
num_leaves | 600 | |
learning_rate | 0.05 | |
max_depth | 18 | |
XGBoost | max_depth | 16 |
learning_rate | 0.1 | |
n_estimators | 700 | |
GBRT | n_estimators | 750 |
Learning_rate | 0.5 | |
Max_depth | 3 | |
BP | Solver | adam |
Alpha | 1e-5 | |
hidden-layer-sizes | 1000, 500, 100 | |
SVM | tol | 0.000001 |
AdBoost | n_estimators | 500 |
MLR | ? | ? |
DT | ? | ? |
?: Indicates the model parameter value was set as the default. |
Table2. The final selected values of the main parameters for each model.
3
2.2.3. Model application
The optimal model obtained from the above models was applied to estimate the DGSR using meteorological measurements recorded at Zhongshan Station from March 1989 to February 2020. The time variations in DGSR estimated by the optimal model were then compared with the observed DGSR. Finally, the long-term historical estimated DGSR at Zhongshan Station was analyzed, and then changes in the trend and the possible factors influencing these changes were further investigated.2
2.3. Methods for DGSR trend analysis
Least squares regression has been applied to detect the linear trend in DGSR annual anomalies (Guo et al., 2017). Five-year running means of DGSR anomalies have been used to visually display the DGSR trend (Xue et al., 2019). In addition, the sliding trend analysis method has been used to help examine the time nodes of changes because trends often change with the span of the variable calculation period (Che et al., 2019; Gui et al., 2019). According to the method from Gui et al. (2019), we used Student’s t tests to detect the robustness of each trend, and the criterion for statistical significance was set at the 95% confidence level. Since the estimated DGSR dataset and meteorological observation dataset have complete records of the whole year for each year from 1990 to 2019, the study period is set as 1990–2019 for the analyses of the monthly variations in meteorological variables and DGSR and the long-term changes in the DGSR trend.-->
3.1. Validation and comparison of models
Figures 4a and 4b show the scatterplots of the fitted model and 10-fold CV model results of the RF model from April 2010 to March 2020 at Zhongshan Station, Antarctica. Compared with those of other models, we found that the fitted and 10-fold CV results of the RF model have higher R2 values of 0.997 and 0.988 and lower RMSE (MAE) values of 0.547 (1.189) MJ m?2 and 0.278 (0.648) MJ m?2, respectively. To assess the performance of the hindcast estimated by the RF model, the hindcast estimated results from April 2008 to March 2010 are shown in Fig. 4c. We found that the hindcast estimated DGSR presented good consistency with the observed DGSR (R2 = 0.984, RMSE = 1.377 MJ m?2, and MAE = 0.828 MJ m?2). To further examine the hindcast performance of the DGSR estimated by the RF model, as an example, we selected historical estimated DGSR in 2009 (obtained from hindcast estimated results during April 2008 to March 2010) for comparison with the ground observed DGSR. Figure 5a shows the daily time series (Fig. 5b depicts the difference) of observed DGSR and estimated DGSR at Zhongshan Station. These two time series are highly consistent with each other, and the higher daily (monthly) mean difference values mainly occur in the summer, especially in the polar day period, but do not exceed ±5 (±0.85) MJ m?2. Figure 5b also shows that approximately 96.7% (343 days in 2009) of the difference values (observed DGSR minus estimated DGSR) fell within the range of ±2 MJ m?2. The results indicate that the DGSR estimated by the RF model closely fits the observed DGSR. Considering the CV results and the accuracy of the historic estimates, the RF model is highly recommended for DGSR estimation at Zhongshan Station in this study. The relative importance of the variables in the RF model is illustrated in Fig. S1 in the electronic supplementary material (ESM). As shown in Fig. S1, SSD plays a dominant role in terms of the relative importance in the RF model and accounts for 60.3% of the overall importance. This result is consistent with previous studies showing that SSD is significantly correlated with DGSR (Wang et al., 2016; Zeng et al., 2020). The results also indicate two other dominant variables: DOY and Tem (accounting for 16.8% and 16.1% of the overall importance, respectively). These results suggest that DOY (seasonal effects) and Tem are also critical for DGSR estimation.Figure4. Scatterplots of the (a) fitted model, (b) CV model, and (c) hindcast estimation results of the RF model at Zhongshan Station, Antarctica.
Figure5. The (a) time series of observed (blue) versus estimated (red) DGSR and the (b) corresponding difference (observed DGSR minus estimated DGSR) in 2009 at Zhongshan Station, Antarctica. [?2, 2] indicates a difference within ±2 MJ m?2.
In addition to evaluating the performance of the RF model, we also evaluated the performances of other commonly used machine learning models for estimating DGSR at Zhongshan Station. From the results of the performance comparison in Table 3, the RF model performs better than the other models, and the LightGBM, XGBoost, BPNN, and gradient boosted regression tree (GBRT) models show similar historic estimation abilities, followed by those of the DT and AdBoost models. The SVM and MLR models have the worst performances. In common statistical methods, overfitting frequently occurs when the methods have high degrees of freedom. In contrast to other models, the RF model is an ensemble of random trees and has no issue of overfitting in this study.
Model name | Fitted model | 10-fold CV model | Historical estimation power | ||||||||
R2 | RMSE | MAE | R2 | RMSE | MAE | R2 | RMSE | MAE | |||
RF | 0.998 | 0.434 | 0.238 | 0.988 | 1.183 | 0.648 | 0.984 | 1.494 | 0.846 | ||
LightGBM | 0.999 | 0.039 | 0.027 | 0.987 | 1.22 | 0.78 | 0.982 | 1.481 | 0.884 | ||
XGBoost | 0.999 | 0.001 | 0.013 | 0.987 | 1.236 | 0.678 | 0.982 | 1.469 | 0.845 | ||
BP | 0.982 | 1.581 | 1.132 | 0.982 | 1.604 | 1.151 | 0.978 | 1.861 | 1.370 | ||
GBRT | 0.999 | 0.398 | 0.185 | 0.98 | 1.541 | 0.98 | 0.974 | 1.541 | 0.980 | ||
DT | 0.998 | 0.533 | 0.314 | 0.976 | 1.692 | 0.919 | 0.967 | 1.999 | 1.146 | ||
AdBoost | 0.962 | 3.487 | 3.063 | 0.959 | 3.535 | 3.097 | 0.961 | 3.417 | 3.097 | ||
SVM | 0.892 | 5.561 | 4.755 | 0.892 | 5.498 | 4.687 | 0.897 | 3.865 | 2.980 | ||
MLR | 0.895 | 3.498 | 2.773 | 0.895 | 3.500 | 2.777 | 0.882 | 3.778 | 2.988 |
Table3. The results of the fitted model, 10-fold CV model, and historical estimation power by different machine learning models.
Overall, these comprehensive results further confirm that the RF model has reliable performance in estimating historical DGSR. We can expect that it will be feasible to reconstruct the historical DGSR based on meteorological observation data and the RF model. Thus, the estimated historical dataset is used to accurately describe the comprehensive characteristics and changes in the long-term trend of DGSR. Therefore, we mainly use the DGSR estimated by the RF model for 1989 to 2020 in the following trend analysis.
2
3.2. Monthly and annual variations in DGSR
Before analyzing the changes in the DGSR trend, we calculated the monthly and annual characteristics of DGSR estimated by the RF model and the corresponding meteorological variables over the period of 1990–2019 (as shown in Fig. 6). The estimated DGSR shows significant monthly and seasonal changes, in which the DGSR during the half-year of summer (October to March) is significantly higher than that during the half-year of winter (April to September), and the higher DGSR values, up to 30 MJ m?2, are mainly during the polar day period, while the DGSR values are 0 MJ m?2 during the polar night period. The monthly average humidity varies from 56% to 61% from 1990 to 2019 and is higher in summer than in other seasons. The monthly mean temperature varies from ?2.90°C to 0.24°C in summer and from ?14.74°C to ?16.12°C in winter. The monthly average temperature is the highest in January (0.24°C) and the lowest in July (?16.18°C). Variations in surface pressure at Zhongshan Station are characterized by half-year waves. From January to June, there are periods of high pressure, and other months have periods of low pressure. The highest (lowest) monthly average surface pressure occurs in June (October) and is greater than 988 hPa (lower than 981 hPa). The wind speed is the highest in winter, followed by that in autumn, spring, and summer. The highest (lowest) monthly average wind speed is 7.94 m s?1 (5.01 m s?1) in August (January).Figure6. The monthly variation in (a) DGSR (estimated), (b) SP, (c) Tem, (d) RH, (e) WS, and (f) SSD at Zhongshan Station, Antarctica during 1990–2019.
Zhongshan Station is located on the edge of the Antarctic continent and is near the Antarctic ice sheet. The winds at this station are mainly affected by a combination of the easterly airflow in the northern part of the Antarctic continent, polar cyclones, and katabatic winds. Easterly winds prevail over this station year-round. In summer (December, January, and February), both the Antarctic continental cold high-pressure system and the circumpolar low-pressure zone are weaker, and the smaller pressure gradient between these two synoptic systems induces a lower wind speed. In winter (April, May, and June), the prevailing Antarctic continental high-pressure system strengthens, and the circumpolar low-pressure zone moves southward, which causes a larger pressure gradient and thus results in a higher wind speed. Additionally, the monthly variations in air pressure at Zhongshan Station are closely related to these two synoptic systems. Zhongshan Station is covered by snow and ice and the air has lower relative humidity in winter, while the snow and sea ice around this station melt in summer and thus cause increased saturation of water vapor in the air and higher relative humidity. It is noted that the long sunshine duration and strong solar radiation in summer play key roles in the temperature increase at this station.
The monthly variation in SSD is basically consistent with that in DGSR, and the R2 between SSD and DGSR is 0.88, indicating that SSD is the main input variable for DGSR in the machine learning models. Figures S2 and S3 in the ESM show that the yearly average value of the estimated DGSR coincides with that of the measured DGSR from 2009 to 2019 (and also in 1994), and the differences (estimated DGSR minus observed DGSR) were mainly distributed between ?0.1 and 0.1 MJ m?2. Furthermore, the annual changes in the estimated DGSR and observed SSD trends are highly consistent, suggesting that historical DGSR estimated by the RF model has high accuracy for further analysis (such as annual anomaly trends and sliding trends).
2
3.3. DGSR trend
Note there is continuous presence and absence of solar radiation at Zhongshan Station during summer and winter, respectively, because Antarctica experiences the peculiar conditions of polar day (beginning on 23 November and ending on 21 January of the following year at this station) and polar night (beginning on 27 May and ending on 18 July each year at this station). In addition, the DGSR values are 0 MJ m?2 during the polar night period. Therefore, the DGSR trend analysis was divided into the following three scenarios: all days of the year (annual, scenario 1), the polar day period (scenario 2), and the period of all days of the year except for polar day and polar night (scenario 3). Figures 7a–7c show the DGSR anomalies and their trends for the three scenarios. It can be seen that the trend of the DGSR annual anomalies increases during 1990–2004 and then begins to decrease after 2004. However, obvious differences in anomaly values exist among the three scenarios. The variation in DGSR anomalies is the greatest during the polar day period (increasing linear trend of 0.175 (MJ m?2) yr?1 and decreasing linear trend of ?0.101 (MJ m?2) yr?1, which are significant at the 95% confidence level), followed by that during all days of the year (increasing linear trend of 0.039 (MJ m?2) yr–1 and decreasing linear trend of ?0.025 (MJ m?2) yr–1, which are significant at the 95% confidence level), and it is lowest during the period of all days of the year except for polar day and polar night (increasing linear trend of 0.011 (MJ m?2) yr–1 and decreasing linear trend of ?0.001 (MJ m?2) yr–1, which are not statistically significant). In general, the trend of DGSR is similar to Europe (except for China, with a decreasing trend between 1990 and 2000) during the period of brightening, increasing slightly to early 2005, after which it shows a decrease to the present (Che et al., 2005; Ohmura, 2009).Figure7. Time series of the annual mean anomalies of estimated DGSR: (a) annual mean but no polar night, (b) polar day, and (c) no polar day and no polar night. The red lines indicate the 5-year running means of the DGSR anomalies. Sliding-window trend analyses of annual mean estimated DGSR at Zhongshan Station, Antarctica, from 1990 to 2019 for (d) all years but no polar night, (e) polar day, and (f) no polar day and no polar night.
At the Zhongshan Station, the Earth’s surface receives more solar radiation during the polar day period because the sun always stays above the horizon. From 1990 to 2019, the average DGSR during the polar day period varies between 26 MJ m?2 and 31 MJ m?2. However, the annual average DGSR varies from 9.6 MJ m?2 to 10.8 MJ m?2. Therefore, the anomaly values and the range of variation during the polar day period are higher than those during all days of the year. The anomaly values during the period of all days of the year except for polar day and polar night alternate between positive and negative, indicating that the change in total solar radiation during this period basically has no obvious trend. The maximum value of the annual anomalies occurs during approximately 2004/05 and is mainly related to the days with precipitation (such as snowfall, which can result in low visibility and then decreased sunshine duration and solar radiation) at Zhongshan Station in Antarctica. In contrast to scenario 3 (all days of the year except for polar day and polar night), scenario 2 (the polar day period) and scenario 1 (annual) have similar trends in the anomalies (and the sunshine duration during the polar day period accounts for approximately 45% of the total sunshine duration of each year), indicating that the changes in sunshine duration and DGSR during the polar day period play a leading role in the changes in the trend of the DGSR annual anomalies.
Based on Student’s t tests, the sliding trends of DGSR for all situations are shown in Figs. 7d–7f to present a more comprehensive analysis of the annual trends. Sliding trends are calculated for the three scenarios, starting in each year from 1990 to 2015 and ending in 2019 with increments of at least five years. As shown in scenario 1 (all days of the year), the trend of DGSR increases from 1990 to 2003 (although an opposite trend is found during approximately 1993 and 1995), then declines sharply after 2004 (especially in scenario 2: the polar day period), and slightly increases in 2012. The sliding trends for scenario 3 (all days of the year except for polar day and polar night) are smaller and relatively stable compared with those for the other scenarios. When the running mean window is longer than 15 years, the DGSR trends first increase and then decrease in scenario 3 (all days of the year except for polar day and polar night), and most of the trends are statistically significant.
2
3.4. The potential impact factors of DGSR
DGSR is affected by cloud cover, water vapor, and aerosols before reaching the surface (Che et al., 2005). However, aerosols are relatively low over the Antarctic, so they will not be discussed in this study. In contrast, bad weather events (fog, snowfall, blowing snow, and snowstorms) with low visibility are frequent at Antarctic Zhongshan Station and also affect DGSR. Therefore, the effects of cloud fraction, low cloud fraction, ground vapor pressure (e, which represents atmospheric water vapor content), and visibility (which represents bad weather events) on DGSR at this station are further detailed and explicitly analyzed. Note that solar radiation is greatly affected by solar altitude angle in the polar region (which experiences the phenomena of polar day and polar night). To avoid the effects of these phenomena and analyze the influence of potential factors on solar radiation, we selected only the matched samples of April and September each year for discussion in this study. Here, all the matched samples are divided into five subsets according to cloud cover (0%–20%, 20%–40%, 40%–60%, 60%–80%, and 80%–100%), low cloud cover (0%–20%, 20%–40%, 40%–60%, 60%–80%, and 80%–100%), and ground vapor pressure (0–1 hPa, 1–2 hPa, 2–3 hPa, 3–4 hPa, and >4 hPa), and then the average DGSR is calculated for each subset and the results are shown in Figs. 8a, 8b, and 8c, respectively. Similarly, the matched samples are divided into six subsets according to visibility (0–5 km, 5–10 km, 10–15 km, 15–20 km, 20–25 km, and >25 km), then the average DGSR is calculated for each subset (results shown in Fig. 8d).Figure8. The effect on the estimated DGSR by the different (a) CF, (b) LCF, (c) e, and (d) Vis conditions at Zhongshan Station, Antarctica.
We found that DGSR significantly decreases (significantly increases) with increasing cloud fraction and ground vapor pressure (visibility). This is because under cloudy conditions, solar radiation reflects back to the top of the atmosphere, reducing the amount of solar radiation reaching the Earth’s surface. However, DGSR does not change much when the low cloud fraction is greater than 40%. We also found that DGSR is generally low during severe weather with low visibility but is significantly higher under high visibility conditions. Overall, DGSR decreases with increasing cloud cover, low cloud cover, and ground vapor pressure but increases with increasing visibility. The change in DGSR with low cloud cover is not significant with the change in cloud cover, ground vapor pressure, and visibility. Meanwhile, the time series of DGSR, LCF, CF, e, Vis, and SSD and their long-term trends are examined in Figs. S4a–S4f, respectively. DGSR shows a small upward trend. Accordingly, SSD and Vis exhibit upward trends (CF, LCF, and e exhibit downward trends), implying that the cloud cover, water vapor, and abnormal weather (except as solar altitude angle in the polar region) are the common factors that influence the trend of DGSR at this station. To sum up, clouds and water vapor in the atmosphere are the main factors affecting solar radiation. Bad weather conditions, such as fog, blowing snow, and snowstorms, are also a major factor affecting solar radiation at Zhongshan Station, Antarctica.