five

Twitter_data

收藏
IEEE2026-04-17 收录
下载链接:
https://ieee-dataport.org/documents/twitterdata
下载链接
链接失效反馈
官方服务:
资源简介:
Textual Content for Each TweetUnder the folder \twitter2015\Under the folder \twitter2017\Visual Content for Each TweetUnder the folder \twitter2015_images\Under the folder \twitter2017_images\DescriptionData SplitWe randomly split our annotated data into training (60%), development(20%), and test sets (20%).FormatWe provide two kinds of format, one is \.txt\ for LSTM-based models, and another is \tsv\ for BERT models.For example, each row of \train.tsv\ for twitter2015 is one sample:\r\n(1). the first column is index;\r\n(2). the second column is sentiment label (0 refers to negative, 1 refers to neutral, and 2 refers to positive);\r\n(3). the third column is the id for the corresponding image of this tweet, which can be found in the folder \twitter2015_images\;\r\n(4). the fourth and fifth columns respectively refer to the original tweet by masking the current opinion target and the opinion target (i.e., entity).Note that each tweet may contain multiple opinion targets (i.e., entities), it may correspond to several continuous samples. E.g., the first and second samples in \train.tsv\ for twitter2015 are about the same tweet but different entities.The \.txt\ file is similar to \train.tsv\, but every four lines in the file is one sample:\r\n(1). the first line refers to the original tweet by masking the current opinion target;\r\n  (2). the second line refers to  the opinion target (i.e., entity);\r\n  (3). the third line is sentiment label (Note that here -1 refers to negative, 0 refers to neutral, and 1 refers to positive);\r\n  (4). the fourth line is the id for the corresponding image of this tweet, which can be found in the folder \twitter2015_images\.
提供机构:
Aoqiang Zhu
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作