删除或更新信息,请邮件至freekaoyan#163.com(#换成@)

A dataset of Ya'an Earthquake based on social media

本站小编 Free考研考试/2022-01-02

<script type="text/javascript" src="https://cdn.bootcss.com/mathjax/2.7.2-beta.0/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script> <script type='text/x-mathjax-config'> MathJax.Hub.Config( { extensions : ["tex2jax.js"], jax : ["input/TeX", "output/HTML-CSS"], tex2jax : {inlineMath: [["\\(", "\\)"]]} }); </script>
Abstract & Keywords
Abstract:?The Ya'an Earthquake occurred on April 20, 2013 (Beijing time). Its epicenter was located in Lushan County, Ya'an City, Sichuan Province, and the magnitude of this earthquake was 7.0. As of 14:30, April 24, the earthquake caused a total of 196 people dead, 21 missing and 11470 injured. With the development of information and communication technologies, microblog shows great potential in promoting emergency response as it provides an easily accessible platform on which disaster information could be assembled and rapidly disseminated to a large number of audiences. In view of this, we built the dataset of Ya'an Earthquake based on Sina-Weibo microblogs posted within Sichuan Province during 7 days after its occurrence. Sina-Weibo, a platform for information sharing and exchange, entertainment, leisure and life services, was launched in August 2009. It provides a platform where the public can communicate, express their feelings, offer suggestions, and so on – a platform that is essential for earthquake data search, query and publishing.
Keywords:?Ya'an Earthquake;?Sina-Weibo;?Sichuan Province;?data mining

Dataset Profile
Chinese Title雅安地震灾情的社交媒体数据集
English TitleA dataset of Ya'an Earthquake based on social media
Data authorsTian Chuanzhao, Li Guoqing, Yang Tengfei, Li Zhenyu
Corresponding data authorLi Guoqing
Time rangeApril 20 – 26, 2013
Geographical scopeSichuan Province
Data volume51418 records (about 5MB)
Data format.xls
Data service system<http://www.sciencedb.cn/dataSet/handle/560>
Source of fundingNational Key Research and Development Program of China (2016YFE0122600) ; International Partnership Program of Chinese Academy of Sciences(131C11KYSB20160061)
Dataset compositionThe data set consists of two parts of data:
(1) "Data.rar" contains 21 tables of Sina-Weibo text data, and each table corresponds to a region;
(2) "Classification sample.rar" is a sample subset illustrating the classification of the text data in "Data.rar".



1. ? Introduction
Ya'an Earthquake: 1 according to the China Earthquake Networks Center, the Ya'an Earthquake occurred at 8:02, April 20, 2013 (Beijing time). The epicenter was located in Lushan County, Ya'an City (30.3N, 103.0E), at a depth of 13 km, and the earthquake had a magnitude of 7.0. As of 10:00, April 24, 2013, 4045 aftershocks occurred, among which 103 were above magnitude 3, with the biggest being 5.7. An area of 12500 km2 around the epicenter was affected, involving 1.52 million people. According to the China Earthquake Administration, the earthquake caused 196 people dead, 21 missing and 11470 injured as of 14:30, April 24. Figure 1 shows the location of earthquake occurrence.




Figure 1 ? Location of Ya'an Earthquake
Sina-Weibo, 2 an information sharing and exchange platform that provides entertainment, leisure, and other life services for the public, was launched in August 2009. By the end of March 2013, Sina-Weibo had a number of 536 million registered users, with an annual increase rate of 6.6%, and the number of its daily active users increased to 49.8 million, by 7.8% as of the end of 2012. Sina-Weibo provides timely updates about earthquake disasters. It is a platform where users are free to make searches and queries, where government bodies can post dynamic information about security and rescue, where the public can communicate to express their feelings, such as blessing, sadness, anger, anxiety, etc., and where users can propose to the government actions to be taken. Figure 2 shows some earthquake information at Sina-Weibo.




Figure 2 ? Earthquake information obtained from Sina-Weibo
There is growing evidence8–11 that the public would look for disaster information most intensively during a certain period of time after its occurrence, irrespective of the sources.3,6 As citizens can both access and post disaster information at open social platforms, such information constitutes a key part of effective responses to a major disaster.
On this aspect, research abroad goes earlier than the domestic. Glaser et al.4 analyzed Twitter data during the 2007 California Wildfires. Vieweg et al.5 researched on Twitter data for the 2009 Red River Floods and the 2009 Oklahoma Grassfires. It can be seen that Twitter has already been an effective channel for real-time updates. In China, scholars also studied the application of microblogs in formulating disaster response. Qu et al.6 analyzed people's responses to the 2008 Sichuan Earthquake based on Tianya Forum data, and Qu et at.7 analyzed people's responses to the 2010 Yushu Earthquake based on Sina-Weibo microblogs.

2. ? Data collection and processing
2.1 ? Overview
Using "Ya'an Earthquake" as the keywords, we searched Sina-Weibo text data posted within the geographical location of Sichuan Province during April 20 – 26, 2013. Each data record included: microblog content, time created, number of forwards, number of likes, number of comments and other information.
We first determined a city for data crawling and collected data from 21 cities of Sichuan Province. Due to Sina-Weibo's search limitations (i.e., up to 1000 records per search), we then determined a time interval for data crawling. Because of Sina-Weibo's search limitations, the amount of data would reach a peak during a certain period, or within 72 hours, after the earthquake occurrence, which is called the golden relief time. We collected Sina-Weibo data posted from all the cities of Sichuan Province during this period at a time interval of each hour. At other special time periods when Sina-Weibo data was released in particularly large quantities, we crawled data every few hours. However, at periods when the volume was small, we crawled data every few days. The data collected at respective time intervals was then stored into an appropriate data table.
We analyzed various counts of the 51418 earthquake-related messages collected within a week period after the earthquake occurrence. We counted the number of messages posted each day (indicated by the blue line in Figure 3), the number of messages forwarded (indicated by the red line in Figure 3), the number of messages commented (indicated by the green line in Figure 3) and the number of messages liked (indicated by the orange line in Figure 3).




Figure 3 ? Counts of Ya'an Earthquake-related messages

2.2 ? Data classification
We asked what types of messages people posted at Sina-Weibo in response to the earthquake. To answer the question, we randomly sampled 200 microblog messages for analysis. We identified six categories of content: emotion-related, opinion-related, action-related, situation updates, general information and others. Table 1 shows a summary of the categories.
Table 1 ? Classification of Sina-Weibo messages
CategoryDescription
Emotion-relatedExpressing personal feelings such as blessing, sadness, anger, anxiety, etc.
Opinion-relatedCriticizing or providing suggestions to the public, the government or rescue agencies.
Action-relatedRequesting help, looking for missing people, or proposing relief actions or relief coordination.
Situation UpdatesUpdating factual information about the earthquake.
General InformationAny other earthquake relief-related information.
OthersOther earthquake-related information.

We applied the categories to sampled Sina-Weibo messages (Figure 4), and concluded 42% for emotion-related messages, 21% for action-related messages, 14% for situation updates, 8% for general information, 4% for opinion-related messages, and 11% for other messages on the earthquake.




Figure 4 ? Sample data classification


3. ? Sample description
The data retrieved from Sina-Weibo was stored into 21 tables. Each table corresponds to a city. Each data entry records information on the ID, content, location, time, forwardCount, commentCount, likeCount, keyword, province and city of the microblog posted.
Table 2 ? Sample data entry
Field NameDescription
ID2231
Content#Earthquake Live # 7.0 Ya'an Earthquake of Lushan: As of 18:00 April 21, there were 1642 aftershocks, including 78 aftershocks of magnitudes 3 and above, 4 of magnitude above 5.0 and above, and 18 of magnitudes between 4.0 – 4.9 , and 56 of magnitudes between 3.0 – 3.9. The largest aftershock occurred at 5.45 pm, April 21 at Lushan. The 5.4-magnitude aftershock occurred at the junction of the two peaks.
Location
Time2013-04-21 18:38
ForwardCount10
CommentCount5
LikeCount1
KeywordYa'an Earthquake
ProvinceSichuan
CityChengdu


4. ? Quality control and assessment
When the body of the message retrieved was removed from Sino-Weibo, we then removed this data entry from our dataset accordingly. Data without time information was also removed in the process of quality control. In addition, information with hyperlinks only or without valuable information was removed from our dataset. An example is shown below:
“# Ya’an earthquake in Sichuan # # microblogging topic details: web links, # Ya’an earthquake in Sichuan # Details: web links, # Ya’an 7 earthquake # # microblogging topic details: web link, # Ya’an 7 earthquake # Details: Web links, # Ya’an 7 earthquake # #, Ya’an earthquake microblogging reported safe # #.”

5. ? Value and significance
As time goes by, some messages retrieved have now been deleted by their bloggers, which makes it impossible to access some valuable messages posted at that time. As the only dataset that collects information about the 2013 Ya'an Earthquake, this dataset provides essential resources from Sina-Weibo for studying social media responses to the earthquake of the time.
Sina-Weibo provides a platform through which the public can communicate with others. With the development of the Internet in recent years, there has been in particular a large surge in the number of phone application users, and people are more and more concerned about hot news and events. Sina-Weibo, as a major Chinese microblogging platform, plays a crucial role in the search and dissemination of hot information, especially in the event of an earthquake. This dataset can be used by academics to study the types of information most easily forwarded, commented, liked, the ways of information dissemination and data content categorization, and so on.

Acknowledgments
This work is supported by the National Key Research and Development Program of China (2016YFE0122600). We thank Dr. Pang Lushen from the Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences for his suggestions on the collection of this dataset. Thank Li Zhenyu from Shandong University of Science and Technology for his support on data processing.


1.
Ya’an Earthquake, available at: <https://en.wikipedia.org/wiki/2013_Lushan_earth

+?CSCD?·?Baidu Scholar


quake>.

+?CSCD?·?Baidu Scholar

2.
Sina-Weibo, available at: <https://en.wikipedia.org/wiki/Sina_Weibo>.

+?CSCD?·?Baidu Scholar

3.
Sutton J, Palen L & Irina S. Backchannels on the front lines: Emergent use of social media in the 2007 Southern California Fires, Proceedings of the Information Systems for Crisis Response and Management Conference (ISCRAM 2008), Washington, DC, 2008.

+?CSCD?·?Baidu Scholar

4.
Glaser M. California wildfire coverage by local media, blogs, Twitter, maps and more. PBS MediaShift. Available at: <http://mediashift.org/2007/10/california-wildfire-coverage-by-local-media-blogs-twitter-maps-and-more298/>.

+?CSCD?·?Baidu Scholar

5.
Vieweg S, Hughes AL, Starbird K et al. Microblogging during two natural hazards events: What twitter may contribute to situational awareness, Proc. CHI (2010): 1079 – 1088.

+?CSCD?·?Baidu Scholar

6.
Qu Y, Wu PF & Wang X. Online community response to major disaster: A study of Tianya Forum in the 2008 Sichuan Earthquake, Proc. HICCS, 2009.

+?CSCD?·?Baidu Scholar

7.
Qu Y, Huang C, Zhang P et al. Microblogging after a major disaster in China: A case study of the 2010 Yushu Earthquake, Proc. CSCW, 2011.

+?CSCD?·?Baidu Scholar

8.
Li J, He Z, Plaza J et al. Social media: New perspectives to improve remote sensing for emergency response, Proceedings of the IEEE 105 (2017): 1900 – 1912.

+?CSCD?·?Baidu Scholar

9.
Reuter C, Hughes AL & Kaufhold MA. Social media in crisis management: An evaluation and analysis of crisis informatics research, International Journal of Human–Computer Interaction 34 (2018): 280 – 294.

+?CSCD?·?Baidu Scholar

10.
Williams BD, Valero JN & Kim K. Social media, trust, and disaster: Does trust in public and nonprofit organizations explain social media use during a disaster? Quality & Quantity 52 (2018): 537 – 550.

+?CSCD?·?Baidu Scholar

11.
Park HW. YouTubers’ networking activities during the 2016 South Korea earthquake, Quality & Quantity 52 (2018): 1057 – 1068.

+?CSCD?·?Baidu Scholar


Data citation
1. Tian C, Li G, Yang T et al. A dataset of Ya'an Earthquake based on social media. Science Data Bank. DOI: 10.11922/sciencedb.560

稿件与作者信息

How to cite this article
Tian C, Li G, Yang T et al. A dataset of Ya'an Earthquake based on social media. China Scientific Data 3 (2018), DOI: 10.11922/csdata.2018.0004.en
Tian Chuanzhao
social media data collection and analysis, writing.
PhD; research area: disaster data mining.

Li Guoqing
advice on dataset design and data check, writing.
ligq@radi.ac.cn
PhD, Professor, research area: geospatial data infrastructure, remote sensing, big data.

Yang Tengfei
motivation of the research, writing.
PhD; research area: natural language processing, disaster information mining.

Li Zhenyu
data processing.
MSc; research area: data mining.

National Key Research and Development Program of China (2016YFE0122600)


相关话题/信息 数据 媒体 地震 雅安

  • 领限时大额优惠券,享本站正版考研考试资料!
    大额优惠券
    优惠券领取后72小时内有效,10万种最新考研考试考证类电子打印资料任你选。涵盖全国500余所院校考研专业课、200多种职业资格考试、1100多种经典教材,产品类型包含电子书、题库、全套资料以及视频,无论您是考研复习、考证刷题,还是考前冲刺等,不同类型的产品可满足您学习上的不同需求。 ...
    本站小编 Free壹佰分学习网 2022-09-19
  • 基于化合物分子结构的量化计算结果数据库
    摘要&关键词摘要:目前,大量已知结构的化合物缺乏基本物性数据和热动力学数据。为了进一步提高化学数据库中数据的完备性和拓展使用性,本数据库利用Gaussian03软件程序基于化合物结构数据库以及化合物基本信息资源对约20万个化合物的结构进行了数据分析和量化几何结构优化、光谱和频率以及热动力学计算模拟, ...
    本站小编 Free考研考试 2022-01-02
  • 《丝绸之路历史地理信息专题》卷首语
    丝绸之路是中西方贸易路线,也是民族迁徙、交流的大通道,其形成不晚于公元前5世纪,横亘亚欧大陆,荟萃罗马西欧文化、中国文化、印度文化、闪族伊斯兰文化等人类主要文化系统,在推动人类文明与经济文化交流中发挥了重要作用。国际国内对丝绸之路的研究从上世纪初开始,已历百年。随着考古遗址的发掘,出土文献的利用,丝 ...
    本站小编 Free考研考试 2022-01-02
  • 2007–2009年黄海底层水CTD观测及沉积环境因子数据集
    摘要&关键词摘要:2007–2009年通过搭载黄海冷水团航次及中国近海开放共享航次共4个航次,在黄海利用CTD获得了154个站位的经纬度、水深、底层水温度和盐度数据;通过154个站位的沉积物样品的采集和分析,获得了调查站位沉积物的粒度、含水量、有机质含量、叶绿素a及脱镁叶绿素a含量以及各参数分层分布 ...
    本站小编 Free考研考试 2022-01-02
  • 基于土地利用的长江经济带1970s末至2015年人类活动强度数据集
    摘要&关键词摘要:人类活动强度数据集可以用于评估人类活动对生物多样性的影响等。本数据集以中国国家尺度土地利用数据库(China’sLand-Use/coverDatasets,CLUDs)为数据源,采用生态系统综合人类扰动指数赋值方案,研制了长江经济带1970年代末、1980年代末、1995年、20 ...
    本站小编 Free考研考试 2022-01-02
  • 明清时期丝绸之路沿线城市建成区范围GIS数据集
    摘要&关键词摘要:城市建设是人类利用土地的主要形式之一。城市建成区的变化记录着城市系统演变的历史,反映了城市位置、规模和形态的变迁。丝绸之路沿线城市建成区的历史数据为研究这些城市的演化过程提供了数据支撑,为更长时段及其他城市要素的复原工作提供了数据基础。本文以城墙围合范围指代城市建成区范围,以明清时 ...
    本站小编 Free考研考试 2022-01-02
  • 清至民国石羊河流域聚落数据集
    摘要&关键词摘要:石羊河流域地处河西干旱区,是丝绸之路的必经之地,流域内聚落的变化对干旱地区社会与生态环境变迁有重要的指示作用。因此石羊河流域聚落数据集,不仅是研究干旱区生态环境变迁的重要数据,也是丝绸之路研究的基础数据。本数据集合方志、地理调查表、地图资料提取了清至民国流域内的聚落信息。通过详细地 ...
    本站小编 Free考研考试 2022-01-02
  • 唐代丝绸之路东中段交通线路数据集(618–907年)
    摘要&关键词摘要:丝绸之路交通线路是研究丝绸之路的重要基础,唐代丝绸之路交通路线奠定了历史丝绸之路交通的基本框架。本文以唐代(618–907年)丝绸之路东中段交通为研究对象,综合利用历史文献、考古成果,以及历史地理学和地理信息系统方法建立交通线路数据集,尽可能客观地反映唐代丝绸之路东中段交通面貌。本 ...
    本站小编 Free考研考试 2022-01-02
  • 晚清民国新疆地区湖泊、湿地数据集
    摘要&关键词摘要:干旱区湖泊和湿地是区域环境变化的敏感因子及指示器。历史时期新疆地区湖泊与湿地的重建数据不仅是全球变化所需要的基础水文数据,而且是历史时期丝绸之路研究必备的环境数据。通过对宣统元年(1909年)的《新疆全省舆图》,民国二十四年(1935年)新疆地区一套大比例尺军用地形图数字化处理,结 ...
    本站小编 Free考研考试 2022-01-02
  • 两汉丝绸之路交通数据集
    摘要&关键词摘要:本文以谷歌地球(GoogleEarth)提供的高清晰度卫星图片为基础,通过对历史文献、考古成果、今人研究等资料的梳理尽可能地实现对两汉时期沙漠绿洲丝绸之路主要交通点的精确地理定位,进而根据地形地貌特征复原这一时期丝绸之路的主要线路走向,最终形成包括交通点、交通线在内的两汉丝绸之路交 ...
    本站小编 Free考研考试 2022-01-02
  • 蒙元时期丝绸之路旅行家行程GIS数据集
    摘要&关键词摘要:蒙元时期丝绸之路上的旅行家为数甚多,其中有约15位的行程可供复原,复原工作对研究该时期丝绸之路的走向和不同时期路线的选择意义较大。本文收集整理了文献记载的旅行家途经地点,再依据现代研究成果、古今地图、GoogleEarth卫星影像等绘制往来路线。15位旅行家、使节从最早的耶律楚材( ...
    本站小编 Free考研考试 2022-01-02