five

User Perceived Disinformation on Reddit: Manual Classification

收藏
DataCite Commons2020-12-21 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/dataset/User_Perceived_Disinformation_on_Reddit_Manual_Classification/13315259
下载链接
链接失效反馈
官方服务:
资源简介:
Manual annotation of Reddit comments as "flags" or "non-flags".<br><br>Annotations used in <b>training set</b> for ML model and validation set for POS matcher:(1.200 comments in total)<br>classified_regex_labeled_train.csv<br><br><br>Annotations used in <b>test set</b> for ML model and POS matcher:<br>(300 comments in total)<br>classified_regex_labeled_test.csv<br><br><br><b>Codebook </b>(for both files)<br><b>id</b>ID number starting from 1<b>indx</b>Index from larger file containing all matches<b>word</b>Which keyword does it match (disinformation, fake news, misleading, unreliable, propaganda, bullshit)<b>subm_title</b>Title of Reddit post / submission.<b>domain</b>Web domain of link shared in Reddit post.<b>comm_body</b>Full text of comment.<b>disinformation</b>Matched by POS matcher as "disinformation".<b>fakenews</b>Matched by POS matcher as "fake news"<b><br></b><b>bs</b>Matched by POS matcher as "bullshit"<b><br></b><b>misleading</b>Matched by POS matcher as "misleading/clickbait"<b><br></b><b>unreliable</b>Matched by POS matcher as "unreliable"<b><br></b><b>propaganda</b>Matched by POS matcher as "propaganda"<b><br></b><b>sample</b>Sample from keyword matching ("all") or sample from POS matches ("pos")<b>matches_POS</b>Matches at least one POS pattern with the POS matcher.<b>consensus</b>Manual annotation (consensus of both coders):- f = comment was coded as "informal flag for false information"- n = comment was coded as "NOT informal flag for false information"- u = uncertain- na/r = removed for being automated message<br><br><br>
提供机构:
figshare
创建时间:
2020-12-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作