CHECKED
收藏魔搭社区2025-10-03 更新2024-08-31 收录
下载链接:
https://modelscope.cn/datasets/OmniData/CHECKED
下载链接
链接失效反馈官方服务:
资源简介:
displayName: CHECKED
labelTypes:
- Chinese Corpus
license:
- CHECKED Custom
paperUrl: https://arxiv.org/pdf/2010.09029v2.pdf
publishDate: "2021"
publishUrl: https://github.com/cyang03/CHECKED
publisher:
- Syracuse University
tags:
- Text
- Fake News
taskTypes:
- Misinformation
---
# 数据集介绍
## 简介
新型冠状病毒肺炎影响了所有人的生活。为了保持社交距离和避免曝光,工作和生活逐渐转移到网上。在这种趋势下,获取新型冠状病毒肺炎新闻的社交媒体使用量有所增加。此外,新型冠状病毒肺炎的错误信息经常在社交媒体上传播。在这项工作中,我们开发了第一个关于新型冠状病毒肺炎错误信息的中文数据集。CHECKED提供了从2019年12月到2020年8月的总共2,104个与新型冠状病毒肺炎相关的已验证微博,这些微博通过使用特定的关键字列表进行识别。相应地,CHECKED包括1,868,175转发、1,185,702评论和56,852,736点赞,这些评论揭示了这些经过验证的微博是如何在微博上传播和反应的。该数据集包含每个微博的丰富多媒体信息集,包括地面真相标签,文本,视觉,时间和网络信息。在使用CHECKED预测假新闻时,已经进行了广泛的实验来分析已检查的数据并为完善的方法提供基准结果。我们希望检查可以促进针对冠状病毒错误信息的研究。
## 引文
```
@article{yang2021checked,
title={CHECKED: Chinese COVID-19 fake news dataset},
author={Yang, Chen and Zhou, Xinyi and Zafarani, Reza},
journal={Social Network Analysis and Mining},
volume={11},
number={1},
pages={1--8},
year={2021},
publisher={Springer}
}
```
## Download dataset
:modelscope-code[]{type="git"}
displayName: CHECKED
labelTypes:
- Chinese Corpus
license:
- CHECKED Custom
paperUrl: https://arxiv.org/pdf/2010.09029v2.pdf
publishDate: "2021"
publishUrl: https://github.com/cyang03/CHECKED
publisher:
- Syracuse University
tags:
- Text
- Fake News
taskTypes:
- Misinformation
---
# Dataset Introduction
## Overview
The COVID-19 pandemic has impacted the lives of people worldwide. To maintain social distancing and avoid potential exposure, work and daily life have gradually moved online. Amid this trend, social media usage for accessing COVID-19-related news has risen sharply. Meanwhile, COVID-19 misinformation has spread rampantly across social media platforms. In this study, we developed the first Chinese dataset focused on COVID-19 misinformation, named CHECKED. CHECKED contains a total of 2,104 verified COVID-19-related Weibo posts, collected between December 2019 and August 2020, which were identified using a predefined list of keywords. The dataset also includes 1,868,175 reposts, 1,185,702 comments, and 56,852,736 likes, which reveal the propagation patterns and public reactions of these verified posts on Weibo. Each Weibo post in the dataset comes with rich multimedia and metadata, including ground-truth labels, textual content, visual materials, temporal information, and network metadata. Extensive experiments have been conducted using CHECKED for fake news prediction tasks to analyze the dataset and establish benchmark results for advanced methods. We hope that CHECKED will promote research on coronavirus-related misinformation.
## Citation
@article{yang2021checked,
title={CHECKED: Chinese COVID-19 fake news dataset},
author={Yang, Chen and Zhou, Xinyi and Zafarani, Reza},
journal={Social Network Analysis and Mining},
volume={11},
number={1},
pages={1--8},
year={2021},
publisher={Springer}
}
## Download Dataset
:modelscope-code[]{type="git"}
提供机构:
maas
创建时间:
2024-07-03
搜集汇总
数据集介绍

背景与挑战
背景概述
CHECKED是一个包含2,104条与COVID-19相关的已验证微博帖子的中文虚假新闻数据集,涵盖2019年12月至2020年8月的数据,包含丰富的多媒体信息和传播数据,旨在支持虚假新闻预测研究。
以上内容由遇见数据集搜集并总结生成



