emozilla/soda_synthetic_dialogue
收藏数据集卡片 🥤SODA Synthetic Dialogue
数据集描述
- 语言: 英语
- 语言创建者: 机器生成
- 许可证: MIT
- 多语言性: 单语
- 数据集名称: SODA Synthetic Dialogue
- 数据集大小: 1M<n<10M
- 源数据集: 扩展自 allenai/soda
- 标签: open-assistant, conversations-summarization
- 任务类别: 对话
- 任务ID: 对话生成
数据集信息
-
特征:
- 名称: conversation
- 数据类型: string
-
配置名称: soda_synthetic_dialogue_dialogue_modeling
-
拆分:
- 训练集:
- 字节数: 1524941476
- 样本数: 1191582
- 测试集:
- 字节数: 190565707
- 样本数: 148968
- 验证集:
- 字节数: 187161878
- 样本数: 146346
- 训练集:
-
下载大小: 1952926579
-
数据集大小: 1902669061
数据集总结
🥤SODA Synthetic Dialogue 是一组由 Assistant 和 User 之间的合成对话组成的数据集。每个对话中,User 要求 Assistant 根据现有对话、故事片段或标题/主题执行总结或故事生成任务。
该数据集是通过合成 🥤Soda 中的对话并应用一组模板生成的。原始研究论文可以在 这里 找到。
示例:
User: Can you write the next few lines of dialogue for this scene: Cornell: I know what youre thinking, Dontrell. I can sense your feelings and know exactly what youre going to say or do next. Youre thinking about how much you miss your family, and how you wish you could see them again. Youre wondering if theres anything you can do to make that happen. And youre also feeling a little guilty, because you know that if you could see them again, it would only be for a short while before they died. Dontrell: Cornell, how can you possibly know all that? Its like you can read my mind! Assistant: Sure, the next dialogue for this scene could be: Cornell: I told you, I can sense your feelings. Its like I have a connection to your thoughts and emotions. Im not sure why or how it works, but ever since we met, Ive been able to understand what youre thinking and feeling pretty easily. Dontrell: Wow...thats amazing! And a little bit scary too. But it does explain why weve always been so close - even though weve only known each other for a few months. Cornell: Yeah, I guess our connection is pretty strong. But dont worry, Dontrell - Ill never use it to manipulate or control you in any way. I promise." Dontrell: Thats a relief! I was starting to feel a little bit like you could read my mind, and it was kind of creepy. But if you promise not to use your abilities to control me, then Im okay with it. Cornell: I swear, Dontrell. Ill never use my abilities to control or manipulate you in any way. You have my word. User: And a title? Assistant: Sure, a title for this dialogue could be: Cornell is intuitive. Cornell reads Dontrells mind. User: How about a short description? Assistant: Sure, a short description for this dialogue could be: Cornell knows what Dontrell is thinking. He can sense her feelings and know exactly what she is going to say or do next. User: What would be one theme of this story? Assistant: One theme of this story could be: being intuitive
使用
该数据集包含一个配置 dialogue_modeling,具有一个文本 conversation 特征。




