Coronavirus (COVID-19) Geo-tagged Tweets Dataset
收藏Mendeley Data2024-01-31 更新2024-06-29 收录
下载链接:
https://ieee-dataport.org/open-access/coronavirus-covid-19-geo-tagged-tweets-dataset
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains IDs and sentiment scores of the geo-tagged tweets related to the COVID-19 pandemic. The tweets are captured by an on-going project deployed at https://live.rlamsal.com.np. The model monitors the real-time Twitter feed for coronavirus-related tweets using 90+ different keywords and hashtags that are commonly used while referencing the pandemic. Complying with Twitter's content redistribution policy, only the tweet IDs are shared. You can re-construct the dataset by hydrating these IDs. The tweet IDs in this dataset belong to the tweets tweeted providing an exact location.The paper associated with this dataset is available here: Design and analysis of a large-scale COVID-19 tweets dataset-------------------------------------Related datasets: (a) Coronavirus (COVID-19) Tweets Sentiment Trend (Global)(b) Tweets Originating from India During COVID-19 Lockdowns-------------------------------------Below is the quick overview of this dataset.— Dataset name: GeoCOV19Tweets Dataset— Number of tweets : 283,960 tweets— Coverage : Global— Language : English (EN)— Dataset usage terms : By using this dataset, you agree to (i) use the content of this dataset and the data generated from the content of this dataset for non-commercial research only, (ii) remain in compliance with Twitter's Developer Policy and (iii) cite the following paper:Lamsal, R. Design and analysis of a large-scale COVID-19 tweets dataset. Applied Intelligence (2020). https://doi.org/10.1007/s10489-020-02029-z— Primary dataset : Coronavirus (COVID-19) Tweets Dataset (COV19Tweets Dataset)— Dataset updates : Everyday— Active keywords and hashtags: keywords.tsvPlease visit this page (primary dataset) for details regarding the collection date and time (and other notes) of each CSV file present in this dataset.
本数据集包含与新型冠状病毒肺炎(COVID-19)疫情相关的带地理标签推文(geo-tagged tweets)的ID与情感评分。这些推文由部署于https://live.rlamsal.com.np的一项持续运行项目采集。该模型通过90余个疫情讨论中常用的关键词与话题标签(hashtags),实时监测与冠状病毒相关的推特信息流。遵循推特内容再分发政策,本数据集仅共享推文ID,可通过推文水化(hydrating)操作重构该数据集——即通过这些ID还原完整的推文内容。本数据集内的推文ID均对应带有精确地理位置的推文。
本数据集对应的关联研究论文为:《大规模新冠推文数据集的设计与分析》(Design and analysis of a large-scale COVID-19 tweets dataset)。
相关数据集:
(a) 冠状病毒(COVID-19)推文情感趋势(全球版)
(b) 新冠疫情封锁期间源自印度的推文
以下为本数据集的快速概览:
— 数据集名称:GeoCOV19Tweets 数据集
— 推文总量:283,960条
— 覆盖范围:全球
— 语言:英语(EN)
— 数据集使用条款:使用本数据集即表示您同意以下三项要求:(i) 仅将本数据集内容及由其生成的数据用于非商业性研究;(ii) 始终遵守推特开发者政策;(iii) 引用如下学术论文:
Lamsal, R. Design and analysis of a large-scale COVID-19 tweets dataset. Applied Intelligence (2020). https://doi.org/10.1007/s10489-020-02029-z
— 基础数据集:冠状病毒(COVID-19)推文数据集(COV19Tweets 数据集)
— 数据集更新频率:每日更新
— 活跃关键词与话题标签文件:keywords.tsv
请访问该基础数据集页面,以了解本数据集中各CSV文件的采集时间及其他相关说明。
创建时间:
2024-01-31



