five

PANC

收藏
arXiv2025-09-30 收录
下载链接:
https://github.com/jinmyeongAN/SCoRL
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集名为PANC,包含了来自ChatCoder2的正面(诱骗行为)完整聊天记录,来自PAN12的负面(正常)聊天片段,以及将正面完整聊天分割成多个部分的正面片段。此外,该数据集显示出明显的类别不平衡,负面聊天数量是正面聊天的100倍以上。每个对话轮次都采用了策略级别的标注。该数据集的任务是针对在线诱骗行为的早期检测。

This dataset is named PANC. It contains full-length positive (deceptive behavior) chat logs sourced from ChatCoder2, negative (normal) chat snippets from PAN12, as well as positive segments obtained by splitting the full-length positive chat logs into multiple parts. Notably, this dataset exhibits a significant class imbalance, with the number of negative chats being over 100 times that of positive ones. Each dialogue turn is annotated at the strategy level. The downstream task of this dataset is early detection of online deceptive behavior.
提供机构:
Vogt et al.
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作