five

TM-Senti

收藏
DataCite Commons2025-06-01 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/dataset/TM-Senti/16438281/1
下载链接
链接失效反馈
官方服务:
资源简介:
This is a large-scale, multilingual and longitudinal Twitter sentiment dataset sampled through distant supervision from the Twitter Stream Grab archive (https://archive.org/details/twitterstream). It covers the time period between January 2013 and June 2020 for 7 languages:- Arabic (ar)<br>- German (de)<br>- English (en)- Spanish (es)- French (fr)- Italian (it)- Chinese (zh)<br>With the files in this repository, we provide tweet IDs that can be used to rehydrate the datasets by using the files available from the Twitter Stream Grab.<br><br>Files are formatted as TSV files, with the following columns:<br>date \t tweetid \t sentiment \t evidence<br>where:- date is the day in which the tweet was posted.- tweetid is the ID of the tweet- sentiment is either pos or neg- evidence is the set of emojis or emoticons used to determine if the tweet was positive or negative.<br><br>More details about the dataset can be found in the following paper (please cite the paper if you use the dataset):<br>TBA<br>
提供机构:
figshare
创建时间:
2021-08-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作