Synthetic-Context-Conversations
收藏魔搭社区2025-12-03 更新2025-02-01 收录
下载链接:
https://modelscope.cn/datasets/prithivMLmods/Synthetic-Context-Conversations
下载链接
链接失效反馈官方服务:
资源简介:
# Synthetic-Context-Conversations
## Overview
The **Synthetic-Context-Conversations** dataset is a collection of synthetic conversations designed to simulate empathetic and context-rich dialogues. It is particularly useful for tasks such as text generation, summarization, and question answering. The dataset is available in English and contains between 10,000 to 100,000 entries.
## Dataset Details
- **Modalities**: Text
- **Languages**: English
- **Size**: 10K-100K
- **Formats**: Parquet
- **License**: Apache-2.0
## Dataset Structure
The dataset is split into a single training set with 99,086 rows. Each row contains a conversation between a human and an AI, focusing on various emotional and contextual topics.
### Example Conversations
```json
[
{"from": "human", "value": "I've been feeling so sad and overwhelmed lately, work has become such a massive source of stress for me."},
{"from": "get", "value": "Hey there, I'm here to listen and support you. It sounds like work has been really challenging lately, can..."}
]
```
## Usage
### Console
- **Size of downloaded dataset files**: 448 MB
- **Size of the adis-connected Parquet files**: 210 MB
- **Number of rows**: 99,086
# 合成语境对话数据集(Synthetic-Context-Conversations)
## 概述
**合成语境对话数据集(Synthetic-Context-Conversations)**是一类用于模拟共情式且富含语境的对话的合成对话集合,尤其适用于文本生成、摘要生成与问答等任务。该数据集以英语为载体,包含10000至100000条数据条目。
## 数据集详情
- **模态**:文本
- **语言**:英语
- **规模**:10K-100K
- **格式**:Parquet
- **许可协议**:Apache-2.0
## 数据集结构
该数据集仅划分为一个训练集,共包含99086条数据行。每条数据行对应一段人类与AI智能体(AI Agent)之间的对话,主题涵盖各类情感与语境相关话题。
### 对话示例
json
[
{"from": "human", "value": "最近我一直感到十分沮丧且不堪重负,工作已然成为我压力的重大来源。"},
{"from": "get", "value": "嗨,我在这里倾听并支持你。看起来最近工作对你来说颇具挑战,能否..."}
]
## 使用说明
### 控制台信息
- **下载数据集文件总大小**:448 MB
- **adis连接式Parquet文件大小**:210 MB
- **数据行数**:99086
提供机构:
maas
创建时间:
2025-01-28



