Coronavirus (COVID-19) Geo-tagged Tweets Dataset
收藏Mendeley Data2024-01-31 更新2024-06-29 收录
下载链接:
https://ieee-dataport.org/open-access/coronavirus-covid-19-geo-tagged-tweets-dataset
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains IDs and sentiment scores of the geo-tagged tweets related to the COVID-19 pandemic. The tweets are captured by an on-going project deployed at https://live.rlamsal.com.np. The model monitors the real-time Twitter feed for coronavirus-related tweets using 90+ different keywords and hashtags that are commonly used while referencing the pandemic. Complying with Twitter's content redistribution policy, only the tweet IDs are shared. You can re-construct the dataset by hydrating these IDs. The tweet IDs in this dataset belong to the tweets tweeted providing an exact location.The paper associated with this dataset is available here: Design and analysis of a large-scale COVID-19 tweets dataset-------------------------------------Related datasets: (a) Coronavirus (COVID-19) Tweets Sentiment Trend (Global)(b) Tweets Originating from India During COVID-19 Lockdowns-------------------------------------Below is the quick overview of this dataset.— Dataset name: GeoCOV19Tweets Dataset— Number of tweets : 276,539 tweets— Coverage : Global— Language : English (EN)— Dataset usage terms : By using this dataset, you agree to (i) use the content of this dataset and the data generated from the content of this dataset for non-commercial research only, (ii) remain in compliance with Twitter's Developer Policy and (iii) cite the following paper:Lamsal, R. Design and analysis of a large-scale COVID-19 tweets dataset. Applied Intelligence (2020). https://doi.org/10.1007/s10489-020-02029-z— Primary dataset : Coronavirus (COVID-19) Tweets Dataset (COV19Tweets Dataset)— Dataset updates : Everyday— Active keywords and hashtags: keywords.tsvPlease visit this page (primary dataset) for details regarding the collection date and time (and other notes) of each CSV file present in this dataset.
本数据集收录了与新型冠状病毒肺炎(COVID-19)疫情相关的带地理标记推文的ID及其情感评分。相关推文由部署于https://live.rlamsal.com.np的一项在研项目采集。该模型依托90余种疫情讨论中常用的关键词与话题标签,实时监测与冠状病毒相关的推特(Twitter)信息流。根据推特内容重分发政策,本数据集仅共享推文ID,用户可通过水化(hydrating)这些ID来重构完整的原始推文数据集。本数据集内的所有推文ID均对应带有精准地理位置信息的推文。
本数据集对应的学术论文可参见:Design and analysis of a large-scale COVID-19 tweets dataset(《大规模新冠推文数据集的设计与分析》)。
关联数据集:
(a) 冠状病毒(COVID-19)推文情感趋势(全球版)(Coronavirus (COVID-19) Tweets Sentiment Trend (Global))
(b) 新冠疫情封锁期间源自印度的推文(Tweets Originating from India During COVID-19 Lockdowns)
本数据集快速概览如下:
— 数据集名称:GeoCOV19Tweets 数据集
— 推文总量:276,539 条
— 覆盖范围:全球
— 语言:英语(EN)
— 数据集使用条款:使用本数据集即代表您同意以下三点:(i) 仅将本数据集内容及基于该内容生成的数据用于非商业性学术研究;(ii) 严格遵守推特开发者政策(Twitter's Developer Policy);(iii) 引用如下学术文献:
> Lamsal, R. Design and analysis of a large-scale COVID-19 tweets dataset. *Applied Intelligence* (2020). https://doi.org/10.1007/s10489-020-02029-z
— 基础关联数据集:冠状病毒(COVID-19)推文数据集(COV19Tweets 数据集)
— 更新频率:每日
— 活跃关键词与话题标签文件:keywords.tsv
如需了解本数据集中各CSV文件的采集时间及其他相关说明,请访问该基础关联数据集页面获取详细信息。
创建时间:
2024-01-31



