TextMining Dataset
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10359503
下载链接
链接失效反馈官方服务:
资源简介:
Purpose of the Datasets:
These datasets serve as invaluable resources in the domain of information credibility assessment, fact-checking, and deception detection.
liar.csv Dataset
The liar.csv dataset, sourced from Papers With Code, is tailored explicitly for detecting deceptive information. It comprises features aimed at analyzing and identifying falsified or misleading content. The dataset contains various attributes useful for training models and conducting research in deception detection.
factcheck.csv Dataset
The factcheck.csv dataset aggregates information from FactCheck.org and includes columns such as:
author: Identifies the individual or entity responsible for creating or verifying content.
text: Contains the textual information or claim undergoing fact-checking.
source: Indicates the platform or origin of the information.
date: Represents the timestamp or publication date associated with the information.
target: Reflects the result or assessment derived from the fact-checking process regarding the accuracy of the information. This dataset provides valuable insights for assessing factual accuracy and information credibility.
politifact.csv Dataset
Derived from PolitiFact, the politifact.csv dataset encompasses:
id: Unique identifier assigned to each entry within the dataset.
target: Represents the evaluation or rating indicating the truthfulness level of statements (e.g., true, false, mostly true, etc.).
text: Contains the textual statement or claim made by an individual or entity.
name: Denotes the headline or title of the article or statement under assessment.
link: Provides a reference URL or link supporting the context of the statement.
target_numeric: Represents a numerical value corresponding to the truthfulness rating, facilitating computational analysis and modeling.
These datasets collectively offer a rich repository of information crucial for training algorithms, conducting analyses, and exploring the realm of fact-checking, deception detection, and information authenticity assessment.
创建时间:
2023-12-11



