User Perceived Disinformation on Reddit: Manual Classification

Name: User Perceived Disinformation on Reddit: Manual Classification
Creator: figshare
Published: 2020-12-21 12:43:22
License: 暂无描述

DataCite Commons2020-12-21 更新2024-07-28 收录

下载链接：

https://figshare.com/articles/dataset/User_Perceived_Disinformation_on_Reddit_Manual_Classification/13315259

下载链接

链接失效反馈

官方服务：

资源简介：

Manual annotation of Reddit comments as "flags" or "non-flags". Annotations used in training set for ML model and validation set for POS matcher:(1.200 comments in total) classified_regex_labeled_train.csv Annotations used in test set for ML model and POS matcher: (300 comments in total) classified_regex_labeled_test.csv Codebook (for both files) idID number starting from 1indxIndex from larger file containing all matcheswordWhich keyword does it match (disinformation, fake news, misleading, unreliable, propaganda, bullshit)subm_titleTitle of Reddit post / submission.domainWeb domain of link shared in Reddit post.comm_bodyFull text of comment.disinformationMatched by POS matcher as "disinformation".fakenewsMatched by POS matcher as "fake news" bsMatched by POS matcher as "bullshit" misleadingMatched by POS matcher as "misleading/clickbait" unreliableMatched by POS matcher as "unreliable" propagandaMatched by POS matcher as "propaganda" sampleSample from keyword matching ("all") or sample from POS matches ("pos")matches_POSMatches at least one POS pattern with the POS matcher.consensusManual annotation (consensus of both coders):- f = comment was coded as "informal flag for false information"- n = comment was coded as "NOT informal flag for false information"- u = uncertain- na/r = removed for being automated message

提供机构：

figshare

创建时间：

2020-12-01

5,000+

优质数据集

54 个

任务类型

进入经典数据集