five

Synthetic Conversation Dataset

收藏
arXiv2025-09-30 收录
下载链接:
https://github.com/belindal/ERASE
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集通过模拟两个具有不同人格特征的大型语言模型之间的对话来构建,反映了它们生活中不断变化的事实。这使得我们能够在合成对话环境中严格控制和评估推理能力。该数据集包含了3组对话,共检查了1,008个问题。其任务是对包含演变状态和推理的对话中的语言模型进行评估。

This dataset is constructed by simulating dialogues between two large language models (LLMs) with distinct personality traits, which mirrors the dynamically evolving factual details within their simulated life contexts. This setup enables rigorous control over and evaluation of reasoning capabilities within a synthetic conversational environment. The dataset includes 3 groups of conversations, with a total of 1,008 examined questions. Its core task is to evaluate large language models in conversational scenarios that involve evolving states and reasoning processes.
提供机构:
Authors of the paper
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作