User Perceived Disinformation on Reddit: Manual Classification

Figshare2020-12-21 更新2026-04-28 收录

下载链接：

https://figshare.com/articles/dataset/User_Perceived_Disinformation_on_Reddit_Manual_Classification/13315259

下载链接

链接失效反馈

官方服务：

资源简介：

Manual annotation of Reddit comments as "flags" or "non-flags".Annotations used in training set for ML model and validation set for POS matcher:(1.200 comments in total)classified_regex_labeled_train.csvAnnotations used in test set for ML model and POS matcher:(300 comments in total)classified_regex_labeled_test.csvCodebook (for both files)idID number starting from 1indxIndex from larger file containing all matcheswordWhich keyword does it match (disinformation, fake news, misleading, unreliable, propaganda, bullshit)subm_titleTitle of Reddit post / submission.domainWeb domain of link shared in Reddit post.comm_bodyFull text of comment.disinformationMatched by POS matcher as "disinformation".fakenewsMatched by POS matcher as "fake news"bsMatched by POS matcher as "bullshit"misleadingMatched by POS matcher as "misleading/clickbait"unreliableMatched by POS matcher as "unreliable"propagandaMatched by POS matcher as "propaganda"sampleSample from keyword matching ("all") or sample from POS matches ("pos")matches_POSMatches at least one POS pattern with the POS matcher.consensusManual annotation (consensus of both coders):- f = comment was coded as "informal flag for false information"- n = comment was coded as "NOT informal flag for false information"- u = uncertain- na/r = removed for being automated message

创建时间：

2020-12-21

5,000+

优质数据集

54 个

任务类型

进入经典数据集