JINIAC/real-persona-chat
收藏Hugging Face2024-05-27 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/JINIAC/real-persona-chat
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-sa-4.0
dataset_info:
features:
- name: dialogue_id
dtype: int64
- name: interlocutors
list:
- name: id
dtype: string
- name: persona
sequence: string
- name: role
dtype: string
- name: conversations
list:
- name: from
dtype: string
- name: value
dtype: string
splits:
- name: train
num_bytes: 60465247
num_examples: 13583
download_size: 21258651
dataset_size: 60465247
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
以下のデータセットから、dialogue_idとutterances、話者情報(ペルソナ)を抽出し、ロールプレイを想定した形式に変更して作成しました。
https://github.com/nu-dialogue/real-persona-chat
## 文献
```
@inproceedings{yamashita-etal-2023-realpersonachat,
title = "{R}eal{P}ersona{C}hat: A Realistic Persona Chat Corpus with Interlocutors{'} Own Personalities",
author = "Yamashita, Sanae and
Inoue, Koji and
Guo, Ao and
Mochizuki, Shota and
Kawahara, Tatsuya and
Higashinaka, Ryuichiro",
booktitle = "Proceedings of the 37th Pacific Asia Conference on Language, Information and Computation",
year = "2023",
pages = "852--861"
}
@inproceedings{yamashita-etal-2024-realpersonachat-ja,
title = "{R}eal{P}ersona{C}hat: 話者本人のペルソナと性格特性を含んだ雑談対話コーパス",
author = "山下 紗苗 and 井上 昂治 and 郭 傲 and 望月 翔太 and 河原 達也 and 東中 竜一郎",
booktitle = "言語処理学会第30回年次大会発表論文集",
year = "2024",
pages = "2738--2743"
}
```
提供机构:
JINIAC
原始信息汇总
数据集概述
数据集信息
- 许可证: cc-by-sa-4.0
数据集特征
- dialogue_id: 整数类型 (int64)
- interlocutors:
- id: 字符串类型 (string)
- persona: 字符串序列 (sequence: string)
- role: 字符串类型 (string)
- conversations:
- from: 字符串类型 (string)
- value: 字符串类型 (string)
数据集分割
- train:
- num_bytes: 60465247
- num_examples: 13583
数据集大小
- download_size: 21258651
- dataset_size: 60465247



