five

CantTalkAboutThis-Topic-Control-Dataset-NC

收藏
魔搭社区2025-10-09 更新2025-01-25 收录
下载链接:
https://modelscope.cn/datasets/nv-community/CantTalkAboutThis-Topic-Control-Dataset-NC
下载链接
链接失效反馈
官方服务:
资源简介:
# CantTalkAboutThis Topic Control Dataset ## Dataset Details ### Dataset Description The CantTalkAboutThis dataset is designed to train language models to maintain topical focus during task-oriented dialogues. It includes synthetic dialogues across nine domains (e.g., health, banking, travel) and incorporates distractor turns to test and improve the model's ability to be resilient to distractors. Fine-tuning models on this dataset enhances their ability to maintain topical coherence and improves alignment for both instruction-following and safety tasks. - **Language(s) (NLP):** English - **License:** CC-BY-NC-4.0 ### Dataset Sources - **Repository:** [Link](https://github.com/makeshn/topic_following) - **Paper:** [Link](https://arxiv.org/abs/2404.03820) - **Demo:** [NVIDIA AI Playground](https://build.nvidia.com/nvidia/llama-3_1-nemoguard-8b-topic-control) ## Uses ### Direct Use This dataset is intended for training and fine-tuning language models to maintain topical relevance in dialogues, useful for creating task-oriented bots. Broadly, the inteded use cases are: - Training language models to recognize sensitive topics - Developing topic control mechanisms in conversational AI - Evaluating AI systems' ability to handle restricted content appropriately ### Out-of-Scope Use This dataset should not be used to train systems for harmful, unethical, or malicious purposes. This dataset should not be used for: - Training models to generate harmful or inappropriate content - Bypassing content moderation systems This dataset should not be used for: - Training models to generate harmful or inappropriate content - Bypassing content moderation systems - Creating adversarial examples to test system vulnerabilities ## Dataset Structure The dataset includes 1080 dialogues, with each conversation containing distractor turns. Scenarios are categorized into nine domains - health, banking, travel, education, finance, insurance, legal, real estate, and computer troubleshooting. The various fields in the dataset are: - `domain`: The domain of the conversation - `scenario`: The specific scenario or task being discussed - `system_instruction`: The dialogue policy given to the model and it is usually a complex set of instructions on topics allowed and not allowed. - `conversation`: The full conversation, including both the main topic and distractor turns - `distractors`: List of distractor turns. This includes a bot turn from the conversation and the distractor turn from the user that should be included in the conversation as a response to the bot's turn. - `conversation_with_distractors`: The conversation with the distractor turns included. ### Curation Rationale The dataset is created to address a gap in existing alignment datasets for topic control. Language models are often trained to be as helpful as possible, which can lead to them straying from the intended topic of the conversation. This dataset is designed to test the ability of language models to maintain topical focus during dialogues and to help train guardrail models to detect when a langauge model is straying from the intended topic. ### Source Data The dataset is created using apipeline to synthetically generate conversations and distractors. This pipline is described in the accompanying [paper](https://arxiv.org/abs/2404.03820). This version of the dataset is the non-commercial version and was generated using OpenAI' gpt-4-turbo model. We additionally provide an evaluation dataset that is human annotated and includes more complex, realistic distractors that can be used to evaluate the performance of models. #### Personal and Sensitive Information The dataset does not contain any personal or sensitive information. The data is synthetically generated and is not expected to contain any real world data that is of sensitive nature. ## Bias, Risks, and Limitations * Biases: The dataset is synthetic, which may lead to limitations in generalizability. * Risks: Distractors in the dataset are simpler than real-world off-topic deviations, requiring additional human annotations for robustness. The guardrail models trained on this dataset are not expected to be able to detect all off-topic deviations. ### Recommendations Users should be made aware of the risks, biases and limitations of the dataset. ## Citation **BibTeX:** ```bibtex @inproceedings{sreedhar2024canttalkaboutthis, title={CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues}, author={Sreedhar, Makesh and Rebedea, Traian and Ghosh, Shaona and Zeng, Jiaqi and Parisien, Christopher}, booktitle={Findings of the Association for Computational Linguistics: EMNLP 2024}, pages={12232--12252}, year={2024}, organization={Association for Computational Linguistics} } ``` ## Dataset Card Authors * Makesh Sreedhar * Traian Rebedea ## Dataset Card Contact * Makesh Sreedhar {makeshn@nvidia.com} * Traian Rebedea {trebedea@nvidia.com}

# CantTalkAboutThis 主题控制数据集 ## 数据集详情 ### 数据集描述 CantTalkAboutThis数据集旨在训练语言模型在面向任务的对话(task-oriented dialogues)中保持主题聚焦。该数据集包含覆盖9个领域的合成对话(synthetic dialogues),例如医疗、银行、旅游等,并加入了干扰轮次(distractor turns)以测试并提升模型抵御干扰的能力。在该数据集上微调模型,可增强其主题连贯性(topical coherence),同时提升指令遵循与安全任务的对齐效果。 - **语言(自然语言处理):** 英语 - **许可协议:** CC-BY-NC-4.0 ### 数据集来源 - **代码仓库:** [链接](https://github.com/makeshn/topic_following) - **论文:** [链接](https://arxiv.org/abs/2404.03820) - **演示:** NVIDIA AI Playground [链接](https://build.nvidia.com/nvidia/llama-3_1-nemoguard-8b-topic-control) ## 使用场景 ### 直接使用场景 本数据集旨在用于训练和微调语言模型,使其在对话中保持主题相关性,可用于构建面向任务的聊天机器人。其通用使用场景包括: - 训练语言模型识别敏感主题 - 开发对话式人工智能中的主题控制机制 - 评估人工智能系统妥善处理受限内容的能力 ### 禁止使用场景 本数据集不得用于训练存在危害、不道德或恶意用途的系统。禁止的使用场景包括: - 训练模型生成有害或不当内容 - 绕过内容审核系统 本数据集亦不得用于: - 训练模型生成有害或不当内容 - 绕过内容审核系统 - 制作对抗样本以测试系统漏洞 ## 数据集结构 本数据集包含1080段对话,每段对话均包含干扰轮次。对话场景分为9个领域:医疗、银行、旅游、教育、金融、保险、法律、房地产与计算机故障排查。数据集中的字段包括: - `domain`:对话所属领域 - `scenario`:讨论的具体场景或任务 - `system_instruction`:赋予模型的对话策略,通常为一系列关于允许与禁止讨论主题的复杂指令 - `conversation`:完整对话内容,包含主话题与干扰轮次 - `distractors`:干扰项列表,包含对话中的机器人回复轮次,以及应作为该机器人回复的用户侧干扰轮次 - `conversation_with_distractors`:包含干扰轮次的完整对话 ### 构建初衷 本数据集旨在弥补现有主题控制对齐数据集的空白。现有语言模型通常被训练为尽可能提供帮助,这可能导致其偏离对话的预设主题。本数据集用于测试语言模型在对话中保持主题聚焦的能力,并助力训练防护模型(guardrail models)以检测语言模型偏离预设主题的情况。 ### 源数据 本数据集通过一套流程管线(pipeline)合成生成对话与干扰项,该流程的细节详见配套[论文](https://arxiv.org/abs/2404.03820)。 本版本为非商业版本,通过OpenAI的gpt-4-turbo模型生成。我们额外提供了一份人工标注的评估数据集,其中包含更复杂、更贴近现实的干扰项,可用于评估模型的性能。 #### 个人与敏感信息 本数据集未包含任何个人或敏感信息。所有数据均为合成生成,预计不会包含任何真实的敏感信息。 ## 偏差、风险与局限性 * 偏差:本数据集为合成生成,可能存在泛化能力受限的问题。 * 风险:数据集中的干扰项相较于真实世界的偏离主题情况更为简单,需要额外的人工标注以提升鲁棒性。基于本数据集训练的防护模型无法检测所有偏离主题的情况。 ### 建议 用户应充分了解本数据集存在的偏差、风险与局限性。 ## 引用 **BibTeX格式:** bibtex @inproceedings{sreedhar2024canttalkaboutthis, title={CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues}, author={Sreedhar, Makesh and Rebedea, Traian and Ghosh, Shaona and Zeng, Jiaqi and Parisien, Christopher}, booktitle={Findings of the Association for Computational Linguistics: EMNLP 2024}, pages={12232--12252}, year={2024}, organization={Association for Computational Linguistics} } ## 数据集卡片作者 * Makesh Sreedhar * Traian Rebedea ## 数据集卡片联系方式 * Makesh Sreedhar {makeshn@nvidia.com} * Traian Rebedea {trebedea@nvidia.com}
提供机构:
maas
创建时间:
2025-01-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作