five

TextMining Dataset

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10359503
下载链接
链接失效反馈
官方服务:
资源简介:
Purpose of the Datasets: These datasets serve as invaluable resources in the domain of information credibility assessment, fact-checking, and deception detection. liar.csv Dataset The liar.csv dataset, sourced from Papers With Code, is tailored explicitly for detecting deceptive information. It comprises features aimed at analyzing and identifying falsified or misleading content. The dataset contains various attributes useful for training models and conducting research in deception detection. factcheck.csv Dataset The factcheck.csv dataset aggregates information from FactCheck.org and includes columns such as: author: Identifies the individual or entity responsible for creating or verifying content. text: Contains the textual information or claim undergoing fact-checking. source: Indicates the platform or origin of the information. date: Represents the timestamp or publication date associated with the information. target: Reflects the result or assessment derived from the fact-checking process regarding the accuracy of the information. This dataset provides valuable insights for assessing factual accuracy and information credibility. politifact.csv Dataset Derived from PolitiFact, the politifact.csv dataset encompasses: id: Unique identifier assigned to each entry within the dataset. target: Represents the evaluation or rating indicating the truthfulness level of statements (e.g., true, false, mostly true, etc.). text: Contains the textual statement or claim made by an individual or entity. name: Denotes the headline or title of the article or statement under assessment. link: Provides a reference URL or link supporting the context of the statement. target_numeric: Represents a numerical value corresponding to the truthfulness rating, facilitating computational analysis and modeling. These datasets collectively offer a rich repository of information crucial for training algorithms, conducting analyses, and exploring the realm of fact-checking, deception detection, and information authenticity assessment.
创建时间:
2023-12-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作