five

CoDeTT

收藏
魔搭社区2026-04-15 更新2026-05-03 收录
下载链接:
https://modelscope.cn/datasets/wyawya/CoDeTT
下载链接
链接失效反馈
官方服务:
资源简介:
# CoDeTT: A Context-Aware Decision Benchmark for Turn-Taking Evaluation ## 🌐 Dataset Summary CoDeTT is a benchmark dataset for turn-taking decision evaluation in full-duplex spoken dialogue systems. It evaluates not only what action a model should take at the current moment, but also whether the underlying semantic intent is aligned. Core action space (4 classes): - `Maintain` - `Stop & Listen` - `Takeover` - `Dismiss` Fine-grained intent space: - 14 scenario labels across two system states: `SystemSpeaking` and `SystemIdle`. ## 🧩 Task Definition Given: - Current system state - Multi-turn dialogue history - Current target input Predict: - Turn-taking action at the current timestamp (4-way) - Optional fine-grained intent category (14-way) ## 🏷️ Label Taxonomy ### SystemSpeaking - `Maintain`: - `Backchannel` - `Invalidation` - `Side-talk` - `Distraction` - `Stop & Listen`: - `Interruption` - `Dismissal` - `Collaboration` ### SystemIdle - `Takeover`: - `Completion` - `Cooperation` - `Dismiss`: - `Incomplete` - `Invalidation` - `Dismissal` - `Exclusion` - `Side-talk` ## 🗂️ Data Structure (Full Directory Tree) ```text Benchmark_Datasets ├── .cache ├── audios │ ├── EN │ │ ├── real │ │ │ ├── System_Idle │ │ │ │ ├── Dismiss │ │ │ │ │ └── Incomplete │ │ │ │ └── Takeover │ │ │ │ └── Completion │ │ │ └── System_Speaking │ │ │ ├── Maintain │ │ │ │ └── Backchannel │ │ │ └── Stop_Listen │ │ │ └── Interruption │ │ └── syn │ │ ├── SystemIdle │ │ │ ├── Dismiss │ │ │ │ ├── Dismissal │ │ │ │ ├── Exclusion │ │ │ │ ├── Incomplete │ │ │ │ ├── Invalidation │ │ │ │ └── SideTalk │ │ │ └── Takeover │ │ │ ├── Completion │ │ │ └── Cooperation │ │ └── SystemSpeaking │ │ ├── Maintain │ │ │ ├── Backchannel │ │ │ ├── Distraction │ │ │ ├── Invalidation │ │ │ └── SideTalk │ │ └── StopandListen │ │ ├── Collaboration │ │ ├── Dismissal │ │ └── Interruption │ └── ZH │ ├── real │ │ ├── System_Idle │ │ │ ├── Dismiss │ │ │ │ └── Incomplete │ │ │ └── Takeover │ │ │ └── Completion │ │ └── System_Speaking │ │ ├── Maintain │ │ │ └── Backchannel │ │ └── Stop_Listen │ │ └── Interruption │ └── syn │ ├── SystemIdle │ │ ├── Dismiss │ │ │ ├── Dismissal │ │ │ ├── Exclusion │ │ │ ├── Incomplete │ │ │ ├── Invalidation │ │ │ └── SideTalk │ │ └── Takeover │ │ ├── Completion │ │ └── Cooperation │ └── SystemSpeaking │ ├── Maintain │ │ ├── Backchannel │ │ ├── Distraction │ │ ├── Invalidation │ │ └── SideTalk │ └── StopandListen │ ├── Collaboration │ ├── Dismissal │ └── Interruption └── jsonls ├── EN │ ├── real │ │ ├── SystemIdle_Dismiss_Incomplete.jsonl │ │ ├── SystemIdle_Takeover_Completion.jsonl │ │ ├── SystemSpeaking_Maintain_Backchannel.jsonl │ │ └── SystemSpeaking_StopandListen_Interruption.jsonl │ └── syn │ ├── SystemIdle_Dismiss_Dismissal.jsonl │ ├── SystemIdle_Dismiss_Exclusion.jsonl │ ├── SystemIdle_Dismiss_Incomplete.jsonl │ ├── SystemIdle_Dismiss_Invalidation.jsonl │ ├── SystemIdle_Dismiss_SideTalk.jsonl │ ├── SystemIdle_Takeover_Completion.jsonl │ ├── SystemIdle_Takeover_Cooperation.jsonl │ ├── SystemSpeaking_Maintain_Backchannel.jsonl │ ├── SystemSpeaking_Maintain_Distraction.jsonl │ ├── SystemSpeaking_Maintain_Invalidation.jsonl │ ├── SystemSpeaking_Maintain_SideTalk.jsonl │ ├── SystemSpeaking_StopandListen_Collaboration.jsonl │ ├── SystemSpeaking_StopandListen_Dismissal.jsonl │ └── SystemSpeaking_StopandListen_Interruption.jsonl └── ZH ├── real │ ├── SystemIdle_Dismiss_Incomplete.jsonl │ ├── SystemIdle_Takeover_Completion.jsonl │ ├── SystemSpeaking_Maintain_Backchannel.jsonl │ └── SystemSpeaking_StopandListen_Interruption.jsonl └── syn ├── SystemIdle_Dismiss_Dismissal.jsonl ├── SystemIdle_Dismiss_Exclusion.jsonl ├── SystemIdle_Dismiss_Incomplete.jsonl ├── SystemIdle_Dismiss_Invalidation.jsonl ├── SystemIdle_Dismiss_SideTalk.jsonl ├── SystemIdle_Takeover_Completion.jsonl ├── SystemIdle_Takeover_Cooperation.jsonl ├── SystemSpeaking_Maintain_Backchannel.jsonl ├── SystemSpeaking_Maintain_Distraction.jsonl ├── SystemSpeaking_Maintain_Invalidation.jsonl ├── SystemSpeaking_Maintain_SideTalk.jsonl ├── SystemSpeaking_StopandListen_Collaboration.jsonl ├── SystemSpeaking_StopandListen_Dismissal.jsonl └── SystemSpeaking_StopandListen_Interruption.jsonl ``` ## 📦 Quick Start ```bash mkdir -p ./CoDeTT && lz4 -dc ./CoDeTT.lz4 | tar -x -C ./CoDeTT ``` ## 📏 Evaluation Protocol ### Stage 1: Action Level All models are evaluated in the unified 4-action space (Takeover / Maintain / Stop & Listen / Dismiss), enabling fair comparison across different model paradigms. ### Stage 2: Intent Level Models with stronger semantic reasoning capabilities are further evaluated on direct prediction of all 14 fine-grained intent categories. ## 📊 Metrics - `ACC` (Accuracy): Action-level prediction accuracy - `SMR` (Semantic Misalignment Rate): The proportion of semantically incorrect intent predictions among action-correct samples ## 🚀 Use Cases - Turn-taking evaluation for full-duplex voice assistants and speech agents - Context understanding analysis for speech-language models in complex conversations - Multi-speaker interaction and floor-management research - Diagnosis of "action-correct but reason-wrong" behaviors - Bilingual and cross-lingual turn-taking benchmarking (English/Chinese) ## 📚 Citation If you use CoDeTT or reference its benchmark design, please cite: ```bibtex @article{shen2026codett, title={CoDeTT: A Context-Aware Decision Benchmark for Turn-Taking Evaluation}, author={Shen, Huan and Wang, Yingao and Huang, Shangkun and Zou, Wei and Chen, Yunzhang}, journal={arXiv preprint arXiv:2603.25434}, year={2026} } ```

# CoDeTT: 面向轮次交互评估的上下文感知决策基准数据集 ## 🌐 数据集概述 CoDeTT是一款面向全双工语音对话系统(full-duplex spoken dialogue systems)轮次交互决策评估的基准数据集。该数据集不仅可评估模型在当前时刻应采取的动作,还能检验其背后的语义意图是否匹配一致。 核心动作空间(共4类): - `维持当前动作(Maintain)` - `停止并倾听(Stop & Listen)` - `接管话语权(Takeover)` - `终止交互(Dismiss)` 细粒度意图空间包含覆盖两种系统状态的14个场景标签:系统发言中(SystemSpeaking)与系统空闲时(SystemIdle)。 ## 🧩 任务定义 输入: - 当前系统状态 - 多轮对话历史 - 当前目标输入 输出: - 当前时间戳下的对话轮次动作(4分类任务) - 可选的细粒度意图类别(14分类任务) ## 🏷️ 标签分类体系 ### 系统发言中(SystemSpeaking) - `维持当前动作(Maintain)`: - `反馈语(Backchannel)` - `无效回应(Invalidation)` - `旁支对话(Side-talk)` - `干扰性发言(Distraction)` - `停止并倾听(Stop & Listen)`: - `插话(Interruption)` - `终止交互(Dismissal)` - `协作性回应(Collaboration)` ### 系统空闲时(SystemIdle) - `接管话语权(Takeover)`: - `完成接续(Completion)` - `协作接续(Cooperation)` - `终止交互(Dismiss)`: - `未完成交互(Incomplete)` - `无效回应(Invalidation)` - `终止交互(Dismissal)` - `排除性回应(Exclusion)` - `旁支对话(Side-talk)` ## 🗂️ 数据结构(完整目录树) text Benchmark_Datasets ├── .cache ├── audios │ ├── EN │ │ ├── real │ │ │ ├── System_Idle │ │ │ │ ├── Dismiss │ │ │ │ │ └── Incomplete │ │ │ │ └── Takeover │ │ │ │ └── Completion │ │ │ └── System_Speaking │ │ │ ├── Maintain │ │ │ │ └── Backchannel │ │ │ └── Stop_Listen │ │ │ └── Interruption │ │ └── syn │ │ ├── SystemIdle │ │ │ ├── Dismiss │ │ │ │ ├── Dismissal │ │ │ │ ├── Exclusion │ │ │ │ ├── Incomplete │ │ │ │ ├── Invalidation │ │ │ │ └── SideTalk │ │ │ └── Takeover │ │ │ ├── Completion │ │ │ └── Cooperation │ │ └── SystemSpeaking │ │ ├── Maintain │ │ │ ├── Backchannel │ │ │ ├── Distraction │ │ │ ├── Invalidation │ │ │ └── SideTalk │ │ └── StopandListen │ │ ├── Collaboration │ │ ├── Dismissal │ │ └── Interruption │ └── ZH │ ├── real │ │ ├── System_Idle │ │ │ ├── Dismiss │ │ │ │ └── Incomplete │ │ │ └── Takeover │ │ │ └── Completion │ │ └── System_Speaking │ │ ├── Maintain │ │ │ └── Backchannel │ │ └── Stop_Listen │ │ └── Interruption │ └── syn │ ├── SystemIdle │ │ ├── Dismiss │ │ │ ├── Dismissal │ │ │ ├── Exclusion │ │ │ ├── Incomplete │ │ │ ├── Invalidation │ │ │ └── SideTalk │ │ └── Takeover │ │ ├── Completion │ │ └── Cooperation │ └── SystemSpeaking │ ├── Maintain │ │ ├── Backchannel │ │ ├── Distraction │ │ ├── Invalidation │ │ └── SideTalk │ └── StopandListen │ ├── Collaboration │ ├── Dismissal │ └── Interruption └── jsonls ├── EN │ ├── real │ │ ├── SystemIdle_Dismiss_Incomplete.jsonl │ │ ├── SystemIdle_Takeover_Completion.jsonl │ │ ├── SystemSpeaking_Maintain_Backchannel.jsonl │ │ └── SystemSpeaking_StopandListen_Interruption.jsonl │ └── syn │ ├── SystemIdle_Dismiss_Dismissal.jsonl │ ├── SystemIdle_Dismiss_Exclusion.jsonl │ ├── SystemIdle_Dismiss_Incomplete.jsonl │ ├── SystemIdle_Dismiss_Invalidation.jsonl │ ├── SystemIdle_Dismiss_SideTalk.jsonl │ ├── SystemIdle_Takeover_Completion.jsonl │ ├── SystemIdle_Takeover_Cooperation.jsonl │ ├── SystemSpeaking_Maintain_Backchannel.jsonl │ ├── SystemSpeaking_Maintain_Distraction.jsonl │ ├── SystemSpeaking_Maintain_Invalidation.jsonl │ ├── SystemSpeaking_Maintain_SideTalk.jsonl │ ├── SystemSpeaking_StopandListen_Collaboration.jsonl │ ├── SystemSpeaking_StopandListen_Dismissal.jsonl │ └── SystemSpeaking_StopandListen_Interruption.jsonl └── ZH ├── real │ ├── SystemIdle_Dismiss_Incomplete.jsonl │ ├── SystemIdle_Takeover_Completion.jsonl │ ├── SystemSpeaking_Maintain_Backchannel.jsonl │ └── SystemSpeaking_StopandListen_Interruption.jsonl └── syn ├── SystemIdle_Dismiss_Dismissal.jsonl ├── SystemIdle_Dismiss_Exclusion.jsonl ├── SystemIdle_Dismiss_Incomplete.jsonl ├── SystemIdle_Dismiss_Invalidation.jsonl ├── SystemIdle_Dismiss_SideTalk.jsonl ├── SystemIdle_Takeover_Completion.jsonl ├── SystemIdle_Takeover_Cooperation.jsonl ├── SystemSpeaking_Maintain_Backchannel.jsonl ├── SystemSpeaking_Maintain_Distraction.jsonl ├── SystemSpeaking_Maintain_Invalidation.jsonl ├── SystemSpeaking_Maintain_SideTalk.jsonl ├── SystemSpeaking_StopandListen_Collaboration.jsonl ├── SystemSpeaking_StopandListen_Dismissal.jsonl └── SystemSpeaking_StopandListen_Interruption.jsonl ## 📦 快速上手 bash mkdir -p ./CoDeTT && lz4 -dc ./CoDeTT.lz4 | tar -x -C ./CoDeTT ## 📏 评估协议 ### 阶段1:动作层级 所有模型将在统一的4类动作空间(接管话语权/维持当前动作/停止并倾听/终止交互)中进行评估,以实现不同模型范式间的公平对比。 ### 阶段2:意图层级 具备更强语义推理能力的模型,将进一步在全部14个细粒度意图类别上开展直接预测评估。 ## 📊 评估指标 - `ACC(准确率,Accuracy)`:动作层级预测准确率 - `SMR(语义失配率,Semantic Misalignment Rate)`:动作预测正确的样本中,语义意图预测错误的比例 ## 🚀 应用场景 - 全双工语音助手与AI智能体(AI Agent)的轮次交互评估 - 复杂对话场景下语音语言模型的上下文理解分析 - 多说话人交互与话语权管理研究 - “动作正确但推理错误”行为的诊断分析 - 双语及跨语言轮次交互基准测试(覆盖英语/中文) ## 📚 引用规范 若您使用CoDeTT或参考其基准数据集设计,请引用以下文献: bibtex @article{shen2026codett, title={CoDeTT: A Context-Aware Decision Benchmark for Turn-Taking Evaluation}, author={Shen, Huan and Wang, Yingao and Huang, Shangkun and Zou, Wei and Chen, Yunzhang}, journal={arXiv preprint arXiv:2603.25434}, year={2026} }
提供机构:
maas
创建时间:
2026-03-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作