five

Corona Virus (COVID-19) Tweets Dataset (en)

收藏
IEEE2020-04-05 更新2026-04-17 收录
下载链接:
https://ieee-dataport.org/open-access/corona-virus-covid-19-tweets-dataset-en
下载链接
链接失效反馈
官方服务:
资源简介:
Tweets Counter: 17,713,153This dataset includes CSV files which contain the tweet IDs. The tweets have been collected by the LSTM model deployed here at sentiment.live. The model monitors the real-time Twitter feed for corona virus-related tweets, using filters: language “en” and keyword “corona”. As per the Twitter Developer Policy, it is not possible for me to provide information other than the Tweet IDs (this dataset has been completely re-designed on March 20, 2020, to comply with data sharing policies set by Twitter). Note: This dataset should be solely used for non-commercial research purpose (ignore every other LICENSE category given in this page).Schema of the CSV files: First column: tweet ID, Second column: Sentiment score for the particular tweet.Files details (Tweets collected in GMT+0; Local time mentioned below: GMT+5:45):corona_tweets_01.csv: 831,327 tweets (March 20, 2020 01:37 AM - March 20, 2020 10:28 AM)corona_tweets_02.csv: 870,924 tweets (March 20, 2020 10:31 AM - March 20, 2020 09:43 PM)corona_tweets_03.csv: 773,729 tweets (March 20, 2020 09:49 PM - March 21, 2020 09:25 AM)corona_tweets_04.csv: 1,233,340 tweets (March 21, 2020 09:27 AM - March 22, 2020 07:46 AM)corona_tweets_05.csv: 1,782,157 tweets (March 22, 2020 07:50 AM - March 23, 2020 09:08 AM)corona_tweets_06.csv: 1,771,295 tweets (March 23, 2020 09:11 AM - March 24, 2020 11:35 AM)corona_tweets_07.csv: 1,479,651 tweets (March 24, 2020 11:42 AM - March 25, 2020 11:43 AM)corona_tweets_08.csv: 1,272,592 tweets (March 25, 2020 11:47 AM - March 26, 2020 12:46 PM)corona_tweets_09.csv: 1,091,429 tweets (March 26, 2020 12:51 PM - March 27, 2020 11:53 AM)corona_tweets_10.csv: 1,172,013 tweets (March 27, 2020 11:56 AM - March 28, 2020 01:59 PM)corona_tweets_11.csv: 1,141,210 tweets (March 28, 2020 02:03 PM - March 29, 2020 04:01 PM)----- March 29, 2020 04:05 PM - March 30, 2020 12:30 PM -- Some folk(s) messed around with the server. Tweets for this period won't be available. However, I'll be continuing adding the new Tweet IDs. Some preventive measures have been taken. Sorry for the inconvenience. -----corona_tweets_12.csv: 793,417 tweets. (March 30, 2020 02:01 PM - March 31, 2020 10:16 AM)corona_tweets_13.csv: 1,029,294 tweets (March 31, 2020 10:20 AM - April 01, 2020 10:59 AM)corona_tweets_14.csv: 920,076 tweets (April 01, 2020 11:02 AM - April 02, 2020 12:19 PM)corona_tweets_15.csv: 826,271 tweets (April 02, 2020 12:21 PM - April 03, 2020 02:38 PM)corona_tweets_16.csv: 612,512 tweets (April 03, 2020 02:40 PM - April 04, 2020 11:54 AM)To make it easy for the NLP researchers to get access to the sentiment analysis of each collected tweet, the sentiment score out of TextBlob [1] has been appended as the second column. New databases will be added to this dataset every day. Bookmark this page for further updates. [1] https://textblob.readthedocs.io/en/dev/
提供机构:
JNU, New Delhi
创建时间:
2020-04-05
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作