TM-Senti
收藏Figshare2021-08-25 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/TM-Senti/16438281/1
下载链接
链接失效反馈官方服务:
资源简介:
This is a large-scale, multilingual and longitudinal Twitter sentiment dataset sampled through distant supervision from the Twitter Stream Grab archive (https://archive.org/details/twitterstream). It covers the time period between January 2013 and June 2020 for 7 languages:- Arabic (ar)<br>- German (de)<br>- English (en)- Spanish (es)- French (fr)- Italian (it)- Chinese (zh)<br>With the files in this repository, we provide tweet IDs that can be used to rehydrate the datasets by using the files available from the Twitter Stream Grab.<br><br>Files are formatted as TSV files, with the following columns:<br>date \t tweetid \t sentiment \t evidence<br>where:- date is the day in which the tweet was posted.- tweetid is the ID of the tweet- sentiment is either pos or neg- evidence is the set of emojis or emoticons used to determine if the tweet was positive or negative.<br><br>More details about the dataset can be found in the following paper (please cite the paper if you use the dataset):<br>TBA<br>
提供机构:
Zubiaga, Arkaitz; Yin, Wenjie; Alkhalifa, Rabab
创建时间:
2021-08-25



