five

Dataset

收藏
DataCite Commons2021-10-17 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/dataset/Dataset/16823500
下载链接
链接失效反馈
官方服务:
资源简介:
text -> Original tweet text, as downloaded trough the Twitter APItext_final -> Text after cleanup, as used for lexicon matchinghour -> Date and time (only hour) at which the tweet was publishedtime_cat -> Dummy. Whether the tweet was published during the event [“cuenta”] or before/after the event [“no_cuenta”]rt_count -> Number of retweets at the moment of the download of the dataurl -> Dummy. The tweet text includes a urlmedia -> Dummy. The tweet had photo or videoEmoWord -> Total number of matches for emotional words or stems in the tweet textMoralWord -> Total number of matches for moral words or stems in the tweet textEmoMoralWord -> Total number of matches for both moral and emotional words or stems in the tweet textOnlyEmoWord -> Number of matches for emotional words or stems in tweet text (excluding those which also matched moral words or stems)OnlyMoralWord -> Number of matches for moral words or stems in tweet text (excluding those which also matched emotional words or stems)foll_div10 -> Number of followers of the account that published the tweet at the moment of the data download, divided by 10,000

text:原始推文文本,即通过推特API(Twitter API)下载得到的推文原文 text_final:经过清洗处理后的文本,用于词典匹配任务 hour:推文发布的日期与时间(仅保留小时维度) time_cat:虚拟变量(Dummy),用于标记推文发布时段:属于事件时段["cuenta"]或事件前后时段["no_cuenta"] rt_count:数据下载时刻该推文的转发次数 url:虚拟变量(Dummy),用于标记推文文本是否包含链接 media:虚拟变量(Dummy),用于标记推文是否包含图片或视频内容 EmoWord:推文文本中情感词汇或词干的匹配总次数 MoralWord:推文文本中道德词汇或词干的匹配总次数 EmoMoralWord:推文文本中同时匹配情感与道德词汇或词干的总次数 OnlyEmoWord:推文文本中仅匹配情感词汇或词干(排除同时匹配道德词汇的情况)的次数 OnlyMoralWord:推文文本中仅匹配道德词汇或词干(排除同时匹配情感词汇的情况)的次数 foll_div10:数据下载时刻发布该推文的账号的粉丝数,除以10000后的结果
提供机构:
figshare
创建时间:
2021-10-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作