Coronavirus (COVID-19) Geo-tagged Tweets Dataset
收藏DataCite Commons2020-12-08 更新2025-04-16 收录
下载链接:
https://ieee-dataport.org/open-access/coronavirus-covid-19-geo-tagged-tweets-dataset
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains IDs and sentiment scores of the geo-tagged tweets related to the COVID-19 pandemic. The tweets are captured by an on-going project deployed at https://live.rlamsal.com.np. The model monitors the real-time Twitter feed for coronavirus-related tweets using 90+ different keywords and hashtags that are commonly used while referencing the pandemic. Complying with Twitter's content redistribution policy, only the tweet IDs are shared. You can re-construct the dataset by hydrating these IDs. The tweet IDs in this dataset belong to the tweets tweeted providing an exact location.The paper associated with this dataset is available here: Design and analysis of a large-scale COVID-19 tweets dataset-------------------------------------Related datasets: (a) Coronavirus (COVID-19) Tweets Sentiment Trend (Global)(b) Tweets Originating from India During COVID-19 Lockdowns-------------------------------------Below is the quick overview of this dataset.— Dataset name: GeoCOV19Tweets Dataset— Number of tweets : 276,539 tweets— Coverage : Global— Language : English (EN)— Dataset usage terms : By using this dataset, you agree to (i) use the content of this dataset and the data generated from the content of this dataset for non-commercial research only, (ii) remain in compliance with Twitter's Developer Policy and (iii) cite the following paper:Lamsal, R. Design and analysis of a large-scale COVID-19 tweets dataset. Applied Intelligence (2020). https://doi.org/10.1007/s10489-020-02029-z— Primary dataset : Coronavirus (COVID-19) Tweets Dataset (COV19Tweets Dataset)— Dataset updates : Everyday— Active keywords and hashtags: keywords.tsvPlease visit this page (primary dataset) for details regarding the collection date and time (and other notes) of each CSV file present in this dataset.
这个数据集包含与新冠疫情(COVID-19 pandemic)相关的地理标记推文(geo-tagged tweets)的ID和情感分数。这些推文由部署于https://live.rlamsal.com.np的一个持续项目捕获。该模型使用90余种在提及疫情时常用的关键词和标签,监控实时Twitter流中的新冠病毒相关推文。遵循Twitter的内容再分发政策,仅共享推文ID。您可通过激活(hydrating)这些ID来重建数据集。本数据集的推文ID对应于提供精确位置的推文。与本数据集相关的论文可在此获取:《大规模新冠疫情推文数据集的设计与分析》
-------------------------------------
相关数据集:(a) 新冠病毒(COVID-19)推文情感趋势(全球)(b) 新冠疫情封锁期间源自印度的推文
-------------------------------------
以下为本数据集的快速概览:
— 数据集名称:GeoCOV19Tweets数据集
— 推文数量:276,539条
— 覆盖范围:全球
— 语言:英语(EN)
— 数据集使用条款:使用本数据集即表示您同意(i)仅将本数据集内容及由此生成的数据用于非商业研究;(ii)遵守Twitter开发者政策;(iii)引用以下论文:Lamsal, R. 《大规模新冠疫情推文数据集的设计与分析》,《应用智能》(2020),https://doi.org/10.1007/s10489-020-02029-z
— 主数据集:新冠病毒(COVID-19)推文数据集(COV19Tweets Dataset)
— 数据集更新:每日
— 活跃关键词与标签:keywords.tsv
请访问此页面(主数据集)以获取本数据集中各CSV文件的收集日期、时间及其他说明的详细信息。
提供机构:
IEEE DataPort
创建时间:
2020-12-08



