five

Synthetic-Context-Conversations

收藏
魔搭社区2025-12-03 更新2025-02-01 收录
下载链接:
https://modelscope.cn/datasets/prithivMLmods/Synthetic-Context-Conversations
下载链接
链接失效反馈
官方服务:
资源简介:
# Synthetic-Context-Conversations ## Overview The **Synthetic-Context-Conversations** dataset is a collection of synthetic conversations designed to simulate empathetic and context-rich dialogues. It is particularly useful for tasks such as text generation, summarization, and question answering. The dataset is available in English and contains between 10,000 to 100,000 entries. ## Dataset Details - **Modalities**: Text - **Languages**: English - **Size**: 10K-100K - **Formats**: Parquet - **License**: Apache-2.0 ## Dataset Structure The dataset is split into a single training set with 99,086 rows. Each row contains a conversation between a human and an AI, focusing on various emotional and contextual topics. ### Example Conversations ```json [ {"from": "human", "value": "I've been feeling so sad and overwhelmed lately, work has become such a massive source of stress for me."}, {"from": "get", "value": "Hey there, I'm here to listen and support you. It sounds like work has been really challenging lately, can..."} ] ``` ## Usage ### Console - **Size of downloaded dataset files**: 448 MB - **Size of the adis-connected Parquet files**: 210 MB - **Number of rows**: 99,086

# 合成语境对话数据集(Synthetic-Context-Conversations) ## 概述 **合成语境对话数据集(Synthetic-Context-Conversations)**是一类用于模拟共情式且富含语境的对话的合成对话集合,尤其适用于文本生成、摘要生成与问答等任务。该数据集以英语为载体,包含10000至100000条数据条目。 ## 数据集详情 - **模态**:文本 - **语言**:英语 - **规模**:10K-100K - **格式**:Parquet - **许可协议**:Apache-2.0 ## 数据集结构 该数据集仅划分为一个训练集,共包含99086条数据行。每条数据行对应一段人类与AI智能体(AI Agent)之间的对话,主题涵盖各类情感与语境相关话题。 ### 对话示例 json [ {"from": "human", "value": "最近我一直感到十分沮丧且不堪重负,工作已然成为我压力的重大来源。"}, {"from": "get", "value": "嗨,我在这里倾听并支持你。看起来最近工作对你来说颇具挑战,能否..."} ] ## 使用说明 ### 控制台信息 - **下载数据集文件总大小**:448 MB - **adis连接式Parquet文件大小**:210 MB - **数据行数**:99086
提供机构:
maas
创建时间:
2025-01-28
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作