Sentiment140 dataset with 1.6 million tweets
收藏www.kaggle.com2017-09-13 更新2025-03-23 收录
下载链接:
https://www.kaggle.com/kazanova/sentiment140
下载链接
链接失效反馈官方服务:
资源简介:
### Context
This is the sentiment140 dataset. It contains 1,600,000 tweets extracted using the twitter api . The tweets have been annotated (0 = negative, 4 = positive) and they can be used to detect sentiment .
### Content
It contains the following 6 fields:
1. **target**: the polarity of the tweet (*0* = negative, *2* = neutral, *4* = positive)
2. **ids**: The id of the tweet ( *2087*)
3. **date**: the date of the tweet (*Sat May 16 23:58:44 UTC 2009*)
4. **flag**: The query (*lyx*). If there is no query, then this value is NO_QUERY.
5. **user**: the user that tweeted (*robotickilldozr*)
6. **text**: the text of the tweet (*Lyx is cool*)
### Acknowledgements
The official link regarding the dataset with resources about how it was generated is [here][1]
The official paper detailing the approach is [here][2]
Citation: Go, A., Bhayani, R. and Huang, L., 2009. Twitter sentiment classification using distant supervision. *CS224N Project Report, Stanford, 1(2009), p.12*.
### Inspiration
To detect severity from tweets. You [may have a look at this][3].
[1]: http://%20http://help.sentiment140.com/for-students/
[2]: http://bhttp://cs.stanford.edu/people/alecmgo/papers/TwitterDistantSupervision09.pdf
[3]: https://www.linkedin.com/pulse/social-machine-learning-h2o-twitter-python-marios-michailidis
此为 sentiment140 数据集。该数据集包含通过 Twitter API 提取的 1,600,000 条推文。推文已进行标注(0 表示负面,4 表示正面),可用于检测情感倾向。
数据集包含以下 6 个字段:
1. **目标**:推文的极性(*0* 表示负面,*2* 表示中性,*4* 表示正面)
2. **ID**:推文的 ID(例如:*2087*)
3. **日期**:推文的发布日期(例如:*Sat May 16 23:58:44 UTC 2009*)
4. **查询标记**:查询条件(例如:*lyx*)。若无查询条件,则此值显示为 NO_QUERY。
5. **用户**:发布推文的用户(例如:*robotickilldozr*)
6. **文本**:推文的内容(例如:*Lyx is cool*)
致谢:有关数据集及其生成方法的官方链接请见[此处][1]。详细描述方法的官方论文请见[此处][2]。引用:Go, A.,Bhayani, R. 和 Huang, L.,2009. 基于 distant supervision 的 Twitter 情感分类。*CS224N 项目报告,斯坦福,1(2009),p.12*。
灵感来源:用于从推文中检测严重程度。[您可以参考此处][3]。
[1]: http://help.sentiment140.com/for-students/
[2]: http://cs.stanford.edu/people/alecmgo/papers/TwitterDistantSupervision09.pdf
[3]: https://www.linkedin.com/pulse/social-machine-learning-h2o-twitter-python-marios-michailidis
提供机构:
www.kaggle.com
搜集汇总
数据集介绍

以上内容由遇见数据集搜集并总结生成



