five

Sentiment and topic analysis of LastQuake app user’s pictures with comments – Zagreb 2020 earthquake

收藏
data.ncl.ac.uk2023-06-02 更新2025-01-15 收录
下载链接:
https://data.ncl.ac.uk/articles/dataset/Supervised_polarity_and_topic_classification_of_LastQuake_app_user_s_pictures_with_comments_Zagreb_2020_earthquake/14687163/3
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains the sentiment analysis (SA) and topic classification (supervised) of the comments posted with pictures by LastQuake app users related to the 22nd March 2020 Zagreb earthquake. LastQuake app is a crowdsource-based earthquake information app that allows eyewitnesses to share information about the earthquakes they felt, combined with seismic data. This app was developed by the European-Mediterranean Seismological Centre (EMSC). Attributes and data contained in the database are: - eq_evid : Number of the earthquake the comment is associated with - eq_mag : magnitude of the event - eq_t0 : Origin time (UTC) - intensity: felt report intensity (as before leaving a comment users must leave a felt report) - epidist : distance from the event of the comment, in km - dt : response time from the origin time of the associated event, in seconds - rate_pos : number of positive rates * - rate_neg : number of negative rates* - device : device from which the comment was left, i.e.desktop, mobile or app - comm_valid : 0 or 1 depending on if we validated the comment or not. We invalidate comments when we consider them inappropriate (violence, insults,...) - language: Original language on which the comment was written - polarity: Polarity on which the comment is classified, i.e. positive, negative or neutral - topic: Building damages or intensity - comment: comment posted by LastQuake app user translated to English LastQuake app obtained 31,911intensity reports from its users with comments, considered as text data, from which it has been possible to translate 31,403 (98%). The citizens included in their comments 361 pictures. After data processing, 314 (87%) pictures were selected for damage assessment. However, this database contains the classification of only those intensity reports that include pictures and comments: 45. This clarification is because some intensity reports from LasQuake app users include only comments or only pictures and some of them include both, and these are the intensity reports contained in this database. The supervised or unsupervised classification of the total number of comments posted by the LastQuake app users' with respect to the 22nd March 2020 Zagreb earthquake will be displayed in another database in the future.

本数据集包含了由 LastQuake 应用用户在 2020 年 3 月 22 日萨格勒布地震期间发表的图片评论的情感分析(SA)和主题分类(监督学习)。LastQuake 应用是一款基于众包的地震信息应用,允许目击者分享他们感受到的地震信息,并结合地震数据。该应用由欧洲地中海地震中心(EMSC)开发。数据库中包含的属性和数据如下: - eq_evid:与评论关联的地震编号 - eq_mag:事件震级 - eq_t0:发震时间(UTC时间) - intensity:感受报告强度(用户在留言前必须提交感受报告) - epidist:评论事件距离,单位为千米 - dt:从关联事件发震时间起至响应时间,单位为秒 - rate_pos:正面评价数量 - rate_neg:负面评价数量 - device:留言所使用的设备,即桌面、移动设备或应用 - comm_valid:0 或 1,表示是否验证了评论。我们认为不适当的评论(如暴力、侮辱等)将被视为无效 - language:评论所写原始语言 - polarity:评论被分类的极性,即正面、负面或中性 - topic:建筑损坏或强度 - comment:LastQuake 应用用户发表的评论,已翻译成英语 LastQuake 应用从其用户处收集了 31,911 份包含评论的强度报告,作为文本数据,从中成功翻译了 31,403 份(98%)。评论中包含了 361 张图片。经过数据处理后,选出了 314 张(87%)图片用于损害评估。然而,此数据库仅包含包含图片和评论的强度报告的分类:45 条。此说明的原因是,一些来自 LastQuake 应用用户的强度报告仅包含评论或仅包含图片,其中一些同时包含两者,而这些强度报告包含在本数据库中。关于 LastQuake 应用用户在 2020 年 3 月 22 日萨格勒布地震期间发表的评论的监督或无监督分类将在未来的另一个数据库中展示。
提供机构:
Newcastle University
二维码
社区交流群
二维码
科研交流群
商业服务