HTML
--> --> -->Database profile | |
Database title | Tropical cyclone database in the western North Pacific |
Time range | TC best track: 1949–2019; TC wind and precipitation: 1949–2018; TC field experiment observations: 2014–2016. |
Geographical scope | TC best track: western North Pacific and South China Sea; TC wind and precipitation: mainland China; TC field experiments observations: coastal regions of China. |
Data format | TC best track: 2D database table and plain text file (.txt); TC wind and precipitation: 2D database table and comma-separated text file (.csv); TC field experiment observations:.csv and.txt text files. |
Data volume | 3 MB for TC best track; 50 MB for TC wind and precipitation; 90 MB for TC field experiments observations. |
Data service system | http://tcdata.typhoon.org.cn/en/index.html |
Sources of funding | The Key Projects of the National Key R&D Program (Grant No. 2018YFC1506300); Key Program for International S&T Cooperation Projects of China (Grant No. 2017YFE0107700). |
Database composition | The database contains three types of data: 1. TC best track dataset comprises 71 files, with 1 file per year between 1949 and 2019; 2. TC precipitation and wind dataset comprises six files containing the wind and precipitation generated by TCs between 1949 and 2018; and 3. TC field experiments observations dataset contains field observations from seven TCs between 2014 and 2016. |
-->
2.1. Overview
TC best-track, wind and precipitation, landfall, impact, and other relevant data from across the WNP are compiled every year by the CMA. To date, eight datasets have been compiled: the TC best-track dataset (1949–present); TC wind and precipitation dataset (1949–present); TC forecast dataset (1999–present); TC forecast evaluation dataset (1999–present); TC field experiments observational dataset (2007–present, only containing some TC cases, discontinuous); TC remote sensing dataset (discontinuous); global atmospheric conventional observations dataset (discontinuous); and global atmospheric reanalysis dataset (1949–present). These datasets cover the weather and climate, real-time observations, numerical model predictions, and prediction evaluations, and constitute a unique multi-source TC database for the WNP.2
2.2. Data processing methods
The remote sensing dataset, global conventional atmospheric observational dataset, TC forecast dataset, and global atmospheric reanalysis dataset are collected and collated by the operational data transmission network of the CMA or are shared internet resources. The remaining four datasets are collected in real time (TC field experiment observational dataset) or compiled after analysis and synthesis (TC best track, TC wind and precipitation, and TC forecast evaluation dataset) by the STI in collaboration with the Typhoon and Marine Meteorological Expert Working Group of the CMA. All of these datasets are unique and authoritative. In the following, the TC best track, TC wind and precipitation, and TC field experiment observational dataset are introduced in detail. The TC forecast evaluation dataset is covered in another paper.3
2.2.1. TC best track
The TC best-track dataset incorporates six-hourly estimates of a TC’s location (TC center longitude and latitude), intensity (maximum sustained wind and minimum sea level pressure), and track trend, as well as the time of the extratropical transition, and the position and intensity of landfall in China. In addition, since 2017, the temporal resolution within 24 hours of making landfall in China has been increased to three hours. The best-track analysis flow chart is shown in Fig. 1.Figure1. CMA best-track analysis flow chart.
When a TC is far out at sea, estimation of its intensity depends mainly on satellite observations and is done so using the method of Dvorak (1984) and an objective estimation approach based on convective core extraction (Lu et al., 2013a, 2013b, 2014). Determining its position depends mainly on weather maps and satellite cloud pattern recognition (Dvorak, 1975, 1984; Lu et al., 2005; Velden et al., 2012). When a TC is offshore or making landfall, high-density ground observations and ground-based radar can be used to locate and estimate its intensity. The TC surface vortex center can be clearly distinguished, and its maximum wind speed can be measured. Finally, all observations and analysis from the various methods are integrated subjectively to obtain the final best-track position and intensity.
During the evolution of the CMA TC best-track analysis, there have been many changes in the data used. Geostationary satellite and radar data have been used for TC center location and intensity estimation analysis since the 1980s (Ying et al., 2014). The satellites used are shown in Fig. 2, and the Fengyun geostationary satellite series (i.e., the Chinese FY series, of which FY-2 and FY-4 are the main focus here) have been the major source of TC data since 2005. These satellites give complete coverage of the WNP. The Geostationary Meteorological Satellite series from Japan and the Geostationary Operational Environmental Satellite series from the United States were the main satellites used before 2005. The Meteorological Satellite series from Europe were used for supplementary information for the period between 2000 and 2010. The HIMAWARI-8 (from Japan) geostationary satellite has also been used frequently since 2015, in combination with the FY series, on account of its high spatial and spectral resolutions. Also, the coastal radar density has been increased from zero radars over the whole mainland to one single-point radar station approximately every 150 to 200 km across central and eastern China [following the special planning applied to meteorological radar development (CMA, 2017)], which makes it possible to capture every TC that makes landfall or approaches the shore in real time. Further, a series of microwave observations have been applied to TC best-track analysis, including the Advanced Microwave Sounding Units (2006–2018), Quik Scatterometer data (2003–2009), Advanced Scatterometer Data (2007–present), and the Special Sensor Microwave Imager and the Special Sensor Microwave Imager Sounder (2009–present). In summary, with the application of more detailed satellite and radar observations, the accuracy of TC best-track data will certainly be improved.
Figure2. Time series of geostationary satellites and their coverage at the equator (limited to a view zenith angle of about 60° for illustrative purposes)
3
2.2.2. Wind and precipitation generated by TCs
The CMA TC wind and precipitation dataset is the only dataset in the world that preserves a long time series of ground observations, identified and integrated artificially by professionals. This work is accomplished using a combination of satellite observations, weather maps, and ground observations. Two rules are applied to distinguish between TC-affected areas and areas not affected by TCs. Firstly, the wind–rain area occurring within the TC spiral cloud belt is clearly identified as the TC-affected region. Secondly, the wind–rain area caused by TCs and other systems (e.g., cold fronts, southwest eddies) identified from the synoptic charts is also defined as the TC-affected area. Finally, the wind and rain observations observed in the TC-affected area are collected and compiled as the complete TC wind and precipitation dataset. The dataset contains total precipitation (mm), daily precipitation (mm), maximum hourly rainfall (mm), maximum sustained wind speed (m s?1), maximum gust (m s?1), and beginning and ending times affected by gales (Ying et al., 2014). It should be noted that the spatial scope of this dataset only covers the Chinese mainland.Since the founding of China in 1949, the construction of a meteorological observation network has been vigorously pursued. The increase in the number of conventional ground-based observation stations on the Chinese mainland between 1949 and 2011 is shown in Fig. 3. The number of surface meteorological stations increased from only 108 (in 1949) to 2405 (in 2011), and the amount of data recorded has increased and the quality of observations has also been improved. In view of the development of this observation network, the question of how to best use the observational data scientifically and effectively, whilst avoiding the possible impact on the climatic statistics of the spatial sampling change caused by the increase in station density, is a prime concern. This is one of the fundamental elements that scientists must consider when carrying out research. Until 2019, there were 2429 ground-based observation stations contributing to the wind and precipitation dataset (CMA, 2019), including national basic weather stations, national reference climatological stations, and national general weather stations, covering all provinces, autonomous regions, and municipalities except Xizang, Xinjiang, and Qinghai. Therefore, a fixed observation station network, composed of stations included in the Global Transmission System, was selected for this shared dataset. This shared dataset has temporal consistency and spatial homogeneity (Fig. 4).
Figure3. Evolution of the quantity of conventional ground-based observations in mainland China between 1949 and 2011.
Figure4. Spatial distribution of stations for the shared dataset.
3
2.2.3. TC field experiment observational dataset
Since 2007, the STI has conducted experimental field observations on TCs making landfall in, or affecting, China using mobile observation vehicles, gradient wind towers, and fixed observation bases. By the end of December 2019, 30 TCs had been captured by cooperation with other organizations under the National Basic Research Program of China as well as other support from other projects. They were detected using instruments such as wind-profiling radar, laser raindrop spectrometers, microwave radiometers, ultrasonic wind thermometers, automatic weather stations, wind towers, and multi-band radar. However, as some data are still being processed, and some data remain confidential at present, only those TCs targeted between 2014 and 2016 (Table 1) were listed in this dataset. The fields captured included the year, TC name, observation instruments, and observational positions. The remainder of the data will be shared step by step in the future.TC name | Instrument (s) | Observational position |
Fung-wong (2014) | Laser raindrop spectrometers | 28.4292°N, 121.5631°E |
Wind-profile radar | ||
Chan-hom (2015) | Laser raindrop spectrometers | 28.4278°N, 121.5936°E |
Wind-profile radar | ||
Mujigae (2015) | Microwave radiometer | 18.61298°N, 110.20888°E |
Nepartak (2016) | Wind-profile radar | 26.9186°N, 120.2228°E |
Meranti (2016) | Wind-profile radar | 24.6722°N, 118.6978°E |
Microwave radiometer | ||
Laser raindrop spectrometers | ||
Megi (2016) | Wind-profile radar | 26.9178°N, 120.2239°E |
Microwave radiometer | ||
Laser raindrop spectrometers | ||
Sarika (2016) | Microwave radiometer | 18.63210°N, 110.21548°E |
Laser raindrop spectrometers |
Table1. TCs targeted by field experiments between 2014 and 2016.
2
2.3. Sample description of the dataset
32.3.1. Sample description of the TC best track
The TC best-track dataset is composed of 71 files. The format of the file names is “CHyyyyBST.txt”, where CH represents Chinese, yyyy represents the year, and BST represents best track. There is a header line for every TC in each best track file. This header line contains nine items; i.e., the header line indicator (“66666” indicates best-track data), international ID, number of data lines for the current TC, TC serial number, Chinese ID, flag of the last data line, temporal resolution of the TC track, TC name, and the dataset creation time (UTC, yyyymmddHH). The TC serial number indicates the sequence number (expressed as YYYYNN, where YYYY indicates the year and NN indicates the sequence number), given by the STI, and includes all TCs above tropical depression level. The Chinese ID refers to the sequence number (expressed as YYNN, where YY indicates the last two digits of the year and NN indicates the sequence number) given by the National Meteorological Centre, and includes only TCs above tropical storm level.In Table 2, TC Pabuk (2019) is used as an example to show how the data are organized. The first row shows that Pabuk was generated at 0600 UTC 31 December 2018, centered at (8.1°N, 112.4°E), with a minimum sea level pressure of 1004 hPa and a maximum sustained wind speed of 13 m s?1. The TC intensity category is classified into six levels [i.e., tropical depression (TD), tropical storm (TS), severe tropical storm (STS), typhoon (TY), severe typhoon (STY), and super typhoon (SuperTY)] according to the Chinese National Standard for Grade of Tropical Cyclones (Ying et al., 2014).
66666 | 1901 | 20 | 0001 | 1901 | 0 | 6 | PABUK | 20200417 | |
2018123106 | 1 | 81 | 1124 | 1004 | 13 | ||||
2018123112 | 1 | 76 | 1117 | 1004 | 13 | ||||
2018123118 | 1 | 70 | 1112 | 1002 | 15 | ||||
2019010100 | 1 | 65 | 1107 | 1002 | 15 | ||||
2019010106 | 2 | 62 | 1101 | 1000 | 18 | ||||
2019010112 | 2 | 60 | 1096 | 1000 | 18 | ||||
2019010118 | 2 | 60 | 1091 | 1000 | 18 | ||||
2019010200 | 2 | 60 | 1086 | 1000 | 18 | ||||
2019010206 | 2 | 62 | 1078 | 1000 | 18 | ||||
2019010212 | 2 | 62 | 1069 | 998 | 20 | ||||
2019010218 | 2 | 62 | 1057 | 998 | 20 | ||||
2019010300 | 2 | 62 | 1050 | 998 | 20 | ||||
2019010306 | 2 | 63 | 1043 | 998 | 20 | ||||
2019010312 | 2 | 65 | 1034 | 995 | 23 | ||||
2019010318 | 2 | 72 | 1025 | 995 | 23 | ||||
2019010400 | 3 | 78 | 1016 | 990 | 25 | ||||
2019010406 | 2 | 81 | 1006 | 995 | 23 | ||||
2019010412 | 2 | 82 | 998 | 998 | 20 | ||||
2019010418 | 1 | 83 | 992 | 1006 | 15 | ||||
2019010500 | 1 | 86 | 986 | 1006 | 15 | ||||
Notes: In the header line, “66666” indicates best-track data, “1901” is the international ID of TC Pabuk, “20” indicates there are 20 rows for TC Pabuk, “0001” is the TC serial number for Pabuk, “1901” is the Chinese ID for TC Pabuk, “0” indicates that this TC died at the last point, “6” indicates that there is a record every 6 hours for TC track, PABUK is the name of current TC, “20200417 ” indicates that this dataset was created on 17 April 2020. And the following rows are the track records, including record time (yyyymmddHH, UTC), intensity category, and central latitude (0.1 degree), central longitude (0.1 degree), central minimum pressure (hPa), and central maximum wind speed (m s?1) from left to right in turn. |
Table2. Best track data for TC Pabuk (2019). TC Pabuk (2019) is used as an example to show how the data are organized.
3
2.3.2. Sample description of the TC wind and precipitation
The TC wind and precipitation dataset comprises six files named “1949?2018_W8Date.csv”, “1949?2018_Wind.csv”, “1949?2018_Gust.csv”, “1949?2018_DailyPrecipitation.csv”, “1949?2018_TotalPrecipitation.csv”, and “1951?2018_MaxHourlyPrecipitation.csv”. All row information corresponds to the attributes listed in the description of the first row. Tables 3 and 4 contain part of the TC daily precipitation and maximum hourly precipitation data for TC Son-Tinh (2018), to show the organization of the TC daily and maximum hourly precipitation datasets.TC serial number | Chinese ID | Station number | Date | Precipitation (mm) |
201810 | 1809 | 56969 | 2018-7-18 | 0.1 |
201810 | 1809 | 56964 | 2018-7-19 | 12 |
201810 | 1809 | 56969 | 2018-7-19 | 1.8 |
201810 | 1809 | 56951 | 2018-7-20 | 0.4 |
201810 | 1809 | 56964 | 2018-7-20 | 12.9 |
201810 | 1809 | 56969 | 2018-7-20 | 11.4 |
201810 | 1809 | 56964 | 2018-7-21 | 20.8 |
201810 | 1809 | 56969 | 2018-7-21 | 7.2 |
201810 | 1809 | 56778 | 2018-7-22 | 8.8 |
201810 | 1809 | 57687 | 2018-7-22 | 0.4 |
201810 | 1809 | 57799 | 2018-7-22 | 12.1 |
201810 | 1809 | 57816 | 2018-7-22 | 0.9 |
201810 | 1809 | 56778 | 2018-7-24 | 0.4 |
201810 | 1809 | 57745 | 2018-7-24 | 0.8 |
201810 | 1809 | 57816 | 2018-7-24 | 0.1 |
201810 | 1809 | 56492 | 2018-7-25 | 44.8 |
201810 | 1809 | 56691 | 2018-7-25 | 3.5 |
201810 | 1809 | 56739 | 2018-7-25 | 0.4 |
201810 | 1809 | 56778 | 2018-7-25 | 14.5 |
201810 | 1809 | 56951 | 2018-7-25 | 1.9 |
201810 | 1809 | 56964 | 2018-7-25 | 22.8 |
201810 | 1809 | 56969 | 2018-7-25 | 51.2 |
201810 | 1809 | 57411 | 2018-7-25 | 14.5 |
201810 | 1809 | 57516 | 2018-7-25 | 4.2 |
201810 | 1809 | 57816 | 2018-7-25 | 26.4 |
Table3. Partial daily precipitation data for TC Son-Tinh (2018).
TC serial number | Chinese ID | Station number | Count | Precipitation (mm) | Starting and ending time (yyyymmddHHMM, LST) |
201810 | 1809 | 56492 | 1 | 23.9 | 201807250400-201807250500 |
201810 | 1809 | 56964 | 1 | 10.6 | 201807251500-201807251600 |
201810 | 1809 | 56969 | 1 | 50.4 | 201807251900-201807252000 |
201810 | 1809 | 57411 | 1 | 12.5 | 201807250700-201807250800 |
201810 | 1809 | 57902 | 1 | 18.5 | 201807251300-201807251400 |
201810 | 1809 | 57957 | 1 | 15.5 | 201807231500-201807231600 |
201810 | 1809 | 59007 | 1 | 26.7 | 201807250500-201807250600 |
201810 | 1809 | 59023 | 1 | 11.7 | 201807241500-201807241600 |
201810 | 1809 | 59117 | 1 | 29.6 | 201807251500-201807251600 |
201810 | 1809 | 59211 | 1 | 19.2 | 201807250400-201807250500 |
201810 | 1809 | 59287 | 1 | 24.6 | 201807231200-201807231300 |
201810 | 1809 | 59316 | 1 | 11 | 201807240500-201807240600 |
201810 | 1809 | 59431 | 1 | 32.4 | 201807241500-201807241600 |
201810 | 1809 | 59501 | 1 | 7.7 | 201807190600-201807190700 |
201810 | 1809 | 59644 | 1 | 20.2 | 201807212000-201807212100 |
201810 | 1809 | 59663 | 1 | 33.1 | 201807172200-201807172300 |
201810 | 1809 | 59758 | 1 | 6.5 | 201807232000-201807232100 |
201810 | 1809 | 59838 | 1 | 16.4 | 201807221800-201807221900 |
Notes: LST, Beijing Standard Time, LST = UTC+8. |
Table4. Partial maximum hourly precipitation for TC Son-Tinh (2018).
3
2.3.3. Sample description of the TC field experiment observations
The TC field experiment observations dataset comprises three types of data, which are described as follows:(1) The first data type is the wind profile radar observations. There are three levels of data stored in a folder named by date. Half-hourly observations are stored in the sub-folder “HOBS”, hourly observations are stored in the sub-folder “OOBS”, and ten-minute observations are stored in the sub-folder “ROBS”. The format of the file names in each sub-folder is “Z_RADR_I _IIiii_yyyyMMddhhmmss_O_WPRD_RadarType_DataLevel.TXT”. Here, “RadarType” is “LC”, representing the boundary wind profile radar, and “DataLevel” contains the HOBS, OOBS, or ROBS information. Detailed information can be found in the specific description file from
Wind profile radar observations from 0100:00 UTC 15 September 2016 for TC Meranti are used as an example of the organization of this dataset in Table 5.
WNDROBS | 1.2 | ||||
58368 | 118.6978 | 24.6722 | 6 | LC | |
20160915010000 | |||||
ROBS | |||||
0 | ///// | ///// | ////// | 64 | 64 |
50 | 77.8 | 30.5 | ////// | 64 | 64 |
100 | 81.4 | 31.5 | ////// | 64 | 64 |
150 | 76.3 | 28.6 | ////// | 64 | 73 |
250 | 84.9 | 29.4 | 4.1 | 55 | 73 |
300 | 99.3 | 29.7 | 4.1 | 55 | 73 |
350 | 91.9 | 29.3 | ////// | 55 | 82 |
400 | 96 | 30 | ////// | 64 | 64 |
450 | 97.9 | 30.2 | ////// | 64 | 64 |
500 | 99 | 29.6 | ////// | 64 | 64 |
600 | 100.1 | 29.7 | ////// | 64 | 64 |
650 | 101.9 | 29.8 | ////// | 64 | 64 |
700 | 102.8 | 30.7 | 4 | 73 | 73 |
750 | 104.5 | 36.1 | 3.9 | 64 | 73 |
800 | 104.1 | 36.2 | 3.8 | 64 | 73 |
900 | 106 | 33.9 | ////// | 64 | 64 |
1000 | 104.3 | 31.4 | ////// | 64 | 64 |
1050 | 105.7 | 31.2 | ////// | 64 | 64 |
1150 | 106.6 | 31.7 | ////// | 73 | 64 |
1250 | 107.7 | 31.8 | ////// | 64 | 64 |
1350 | 108.4 | 30.8 | ////// | 64 | 64 |
1400 | 109 | 31 | ////// | 64 | 64 |
1500 | 112 | 33.2 | ////// | 64 | 64 |
1600 | ///// | ///// | ////// | 55 | 64 |
1700 | ///// | ///// | ////// | 55 | 64 |
1750 | ///// | ///// | ////// | 55 | 64 |
1850 | ///// | ///// | ////// | 45 | 64 |
1950 | ///// | ///// | ////// | 45 | 64 |
2050 | ///// | ///// | ////// | 45 | 55 |
2100 | ///// | ///// | ////// | 45 | 55 |
2200 | ///// | ///// | ////// | 45 | 45 |
2300 | ///// | ///// | ////// | 45 | 45 |
2400 | ///// | ///// | ////// | 45 | 45 |
2450 | ///// | ///// | ////// | 45 | 45 |
2550 | ///// | ///// | ////// | 45 | 45 |
2650 | ///// | ///// | ////// | 45 | 45 |
2700 | ///// | ///// | ////// | 45 | 45 |
2800 | ///// | ///// | ////// | 45 | 45 |
2900 | ///// | ///// | ////// | 45 | 45 |
3000 | ///// | ///// | ////// | 45 | 45 |
3050 | ///// | ///// | ////// | 45 | 55 |
3150 | ///// | ///// | ////// | 45 | 55 |
3250 | ///// | ///// | ////// | 45 | 55 |
3350 | ///// | ///// | ////// | 36 | 64 |
3400 | ///// | ///// | ////// | 36 | 64 |
3500 | ///// | ///// | ////// | 36 | 64 |
3600 | ///// | ///// | ////// | 45 | 64 |
3700 | ///// | ///// | ////// | 45 | 55 |
3750 | ///// | ///// | ////// | 36 | 55 |
3850 | ///// | ///// | ////// | 36 | 55 |
3950 | ///// | ///// | ////// | 36 | 55 |
4050 | ///// | ///// | ////// | 45 | 55 |
4100 | ///// | ///// | ////// | 36 | 55 |
4200 | ///// | ///// | ////// | 45 | 55 |
4300 | ///// | ///// | ////// | 45 | 55 |
4400 | ///// | ///// | ////// | 45 | 55 |
4450 | ///// | ///// | ////// | 45 | 55 |
4550 | ///// | ///// | ////// | 55 | 64 |
4650 | ///// | ///// | ////// | 64 | 64 |
4700 | 121.7 | 30.8 | 3.4 | 64 | 73 |
NNNN | |||||
Notes: “NNNN” is the end flag of this file. The missing data is represented by ‘//////’. |
Table5. Wind profile radar observations at 0100:00 LST 15 September 2016 for TC Meranti.
There are three parts to Table 5. The first part, at the top, contains some comments and covers four rows. Row one shows the key word of the wind profile radar and the file version. Row two shows the station number, station position (central longitude and latitude; units: degrees), station altitude (units: m), and radar type. Row three shows the observation time (yyyymmddHHMMSS, LST, Beijing Time, LST = UTC + 8). Row four shows the data level. The second part contains the observational data and comprises six columns—namely, the sampling height (units: m), horizontal wind direction (units: degrees), horizontal wind speed (units: m s?1), vertical wind speed (units: m s?1), horizontal credibility (%), and vertical credibility (%), from left to right. The symbol “////” indicates missing data. The third part is the end flag, represented by “NNNN”.
A preliminary example of the wind profile radar observations for TC Meranti between 14 and 15 September 2016 is shown in Fig. 5. During the observational period, TC Meranti was about 50–100 km south of the wind profile radar. With TC Meranti approaching, the wind direction shifted from northeast and east to south, demonstrating an apparent wind direction change when the TC passed. In addition, when TC Meranti was closest to the wind profile radar, the wind speed increased to the maximum value (at about 0055 15 September) from the lower to upper level. This type of data can provide detailed vertical wind structure information, enabling research on the evolution of the dynamic structure of TCs.
Figure5. Wind profile radar observations for TC Meranti between 14 and 15 September 2016.
(2) The second data type is microwave radiometer observations. The format of the files name is “yyyy-MM-dd_hh-mm-ss_DataLevel.csv”. There are three data levels for the microwave radiometer observations: lv0 (level 0 file), lv1 (level 1 file), and lv2 (level 2 file). Level0 files contain raw, unprocessed data in engineering units. Level1 files contain real-time brightness temperatures for each channel specified in the configuration file. Level2 files contain records of real-time retrievals of temperature (K), water vapor (g m?3), relative humidity (%), and liquid water (g m?3) profiles. Each file contains the record headers and the real-time observation data profile. The observation data from the microwave radiometer at 1322:27 14 September 2016 are given below as an example:
“6, 09/14/16 13:23:37, 401, Zenith26, 298.126, 299.063, 298.770, …
7, 09/14/16 13:23:38, 402, Zenith26, 22.665, 21.926, 21.706, 21.538, …
8, 09/14/16 13:23:38, 403, Zenith26, 0.152, 0.298, 0.471, 0.560, …
9, 09/14/16 13:23:39, 404, Zenith26, 96.830, 88.808, 89.927, …”
Here, field 1 is the record number, field 2 contains the date and time (LST), field 3 contains the record type, record 4 is the sort index, and the remainder are 58 variable values at 58 heights, all separated by commas. More detailed information can be found in the specific description file from
(3) The third data type is the laser raindrop spectrometer observations. The format for these files name is “NNNN_parsivel2.txt”, and there is one file for each TC. Here, NNNN represents the Chinese ID of the TC and “parsivel2” represents the model number of the laser raindrop spectrometer.
The observations are stored line by line using the following format:
yyyy;mm;dd;hh;mm;ss;<SPECTRUM>;...;</SPECTRUM>,
where yyyy is the year, mm is the month, dd is the day, hh is the hour, mm is the minutes, and ss is the seconds (LST). The sampling interval is 10 seconds. The precipitation particle spectrum data run between <SPECTRUM> and </SPECTRUM> delimited by a semicolon, and include 32 × 32 = 1024 channels of data altogether. These data record the number of particles in the first diameter channel of the first velocity channel, the number of particles in the second diameter channel of the first velocity channel, …, the number of particles in the 32nd diameter channel of the first velocity channel, the number of particles in the first diameter channel of the second velocity channel, …, the number of particles in the 32nd diameter channel of the second velocity channel, …, and the number of particles in the 32nd diameter channel of the 32nd velocity channel. Detailed information can be found in the specific description file from
2
2.4. Quality of the datasets
For the TC best-track, wind, and precipitation datasets, data quality control was conducted throughout the analysis flow chart (see Fig. 1). Firstly, the analyzed dataset drafts were submitted to the Working Group of Typhoons and Marine Meteorology Experts for checking in near-real-time (TC best track) or after the season (TC wind and precipitation). This working group comprises experienced forecasters and researchers from meteorological departments all over the country, and is organized by the CMA. Moreover, a temporary working group of experts is formed at the end of each year to carry out detailed analysis of any difficult cases. Finally, all drafts are provided to the Working Group of Typhoons and Marine Meteorology Experts for final examination at the beginning of the following year. After that, the datasets can be published.For the TC field experiment observational datasets, we used the raw data after basic quality control had been applied by the instrumental software. TC field data quality control methods vary with the research aims, and many remain under study.
2
2.5. Storage and access
The multi-source TC data described above are stored at the STI as two-dimensional tables, text files or binary files, and managed using an SQL Server 2012 Standard version. Although some data remain confidential at present, the whole database can be queried and shared in various ways. The data can be accessed through publications (e.g., the Annual Tropical Cyclone Yearbook [the latest being the Yearbook of Tropical Cyclones for 2017 (STI/CMA, 2019)] and the Tropical Cyclone Climatic Atlas [the latest being the Climatological Atlas of Tropical Cyclones over the Western North Pacific (1981?2010) (CMA, 2017)], via the operational intranet platforms of the CMA, such as the Shanghai Typhoon Warning Centre, the Typhoon Science Data Sharing Platform, and the Tropical Cyclone Retrieval System over the WNP, as well as on the internet, such as the TC data Centre of the CMA (2
2.6. Applications of the TC multi-source database
The multi-source TC database covers a long period (1949–present), has wide coverage (from mainland China, to offshore in the WNP, and globally), and multiple observation elements (from ground to high altitude, from conventional atmospheric elements such as wind, temperature, and humidity to raindrop spectra, supersonic wind temperature, and other particular observational elements), and includes TC-related historical or real-time location, intensity, dynamic and thermal structures, wind speed, precipitation, frequency, atmospheric environment, and underlying surface conditions. Consequently, it has important practical value for TC operational forecasting and scientific research. Using the above multi-source databases, TC forecasters and scientific researchers have also carried out in-depth data mining projects, such as the yearly “Tropical Cyclone Yearbook in the WNP”, summarizing and compiling the “Tropical Cyclone Climatic Atlas”, and building a “Tropical Cyclone Retrieval System over the Western North Pacific (Tropical Cyclone Retrieval System)”. In particular, this Tropical Cyclone Retrieval System is an integrated platform for TC querying and analysis, supported by computer and data analysis technologies. It can superpose and display satellite images, grid data, ground observations, and other information on a unified platform. The most important feature of this system is that users can define the retrieval criteria that they wish to use to query the real-time TC predictions, best-track data, and historical TC data from the multi-source TC database. The Tropical Cyclone Retrieval System has been one of the important reference tools for researchers and forecasters. The multi-source TC database thus plays an important role in revealing TC climatic rules in the WNP, determining those influencing factors of most physical significance, and finally establishing operational forecasting schemes.In addition, the database can provide not only basic scientific support for TC researchers, operational personnel, and governments to conduct TC research, forecasts, and disaster prevention and mitigation, but also important scientific information for other relevant professionals, including industry, agriculture, fisheries, transportation, aviation, navigation, and national defense.