five

User Perceived Disinformation on Reddit: Manual Classification

收藏
Figshare2020-12-21 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/User_Perceived_Disinformation_on_Reddit_Manual_Classification/13315259
下载链接
链接失效反馈
官方服务:
资源简介:
Manual annotation of Reddit comments as "flags" or "non-flags".Annotations used in training set for ML model and validation set for POS matcher:(1.200 comments in total)classified_regex_labeled_train.csvAnnotations used in test set for ML model and POS matcher:(300 comments in total)classified_regex_labeled_test.csvCodebook (for both files)idID number starting from 1indxIndex from larger file containing all matcheswordWhich keyword does it match (disinformation, fake news, misleading, unreliable, propaganda, bullshit)subm_titleTitle of Reddit post / submission.domainWeb domain of link shared in Reddit post.comm_bodyFull text of comment.disinformationMatched by POS matcher as "disinformation".fakenewsMatched by POS matcher as "fake news"bsMatched by POS matcher as "bullshit"misleadingMatched by POS matcher as "misleading/clickbait"unreliableMatched by POS matcher as "unreliable"propagandaMatched by POS matcher as "propaganda"sampleSample from keyword matching ("all") or sample from POS matches ("pos")matches_POSMatches at least one POS pattern with the POS matcher.consensusManual annotation (consensus of both coders):- f = comment was coded as "informal flag for false information"- n = comment was coded as "NOT informal flag for false information"- u = uncertain- na/r = removed for being automated message
创建时间:
2020-12-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作