UNIDECOR

Name: UNIDECOR
Creator: 斯图加特大学机器语言处理研究所
Published: 2023-06-08 07:07:26
License: 暂无描述

arXiv2023-06-08 更新2024-06-21 收录

下载链接：

https://www.ims.uni-stuttgart.de/data/unidecor

下载链接

链接失效反馈

官方服务：

资源简介：

UNIDECOR是由斯图加特大学机器语言处理研究所创建的统一欺骗语料库，整合了来自社交媒体评论、法庭证词、特定话题意见陈述及在线策略游戏中的欺骗对话等多个领域的数据。该数据集包含164,085条记录，旨在通过分析不同数据集间的语言线索相关性，理解欺骗的差异性，并进行跨语料库建模实验，以推动欺骗检测技术的发展。数据集的创建过程涉及多种数据收集策略，如爬虫、众包等，应用于心理学、法医学、法律及计算语言学等领域，以解决人类在数字媒体中易受欺骗的问题。

UNIDECOR is a unified deception corpus developed by the Institute for Natural Language Processing at the University of Stuttgart. It integrates data from multiple domains, including social media comments, courtroom testimonies, opinion statements on specific topics, and deceptive dialogues from online strategy games. This corpus contains 164,085 records. Its core objectives are to understand the diversity of deception by analyzing correlations of linguistic cues across different datasets, conduct cross-corpus modeling experiments, and thereby advance the development of deception detection technologies. The construction of UNIDECOR adopts multiple data collection strategies such as web crawling and crowdsourcing. It is applied in fields including psychology, forensic science, law, and computational linguistics to address the issue that humans are vulnerable to deception in digital media.

提供机构：

斯图加特大学机器语言处理研究所

创建时间：

2023-06-05

5,000+

优质数据集

54 个

任务类型

进入经典数据集