NomaDamas/DSTC-11-Track-5
收藏Hugging Face2023-12-08 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/NomaDamas/DSTC-11-Track-5
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
dataset_info:
- config_name: default
features:
- name: log
list:
- name: speaker
dtype: string
- name: text
dtype: string
- name: target
dtype: bool
- name: knowledge
list:
- name: doc_id
dtype: int64
- name: doc_type
dtype: string
- name: domain
dtype: string
- name: entity_id
dtype: int64
- name: sent_id
dtype: int64
- name: response
dtype: string
splits:
- name: train
num_bytes: 22289817
num_examples: 28431
- name: test
num_bytes: 4412204
num_examples: 5475
- name: validation
num_bytes: 3371855
num_examples: 4173
download_size: 12543490
dataset_size: 30073876
- config_name: knowledge
features:
- name: domain
dtype: string
- name: entity_id
dtype: int64
- name: entity_name
dtype: string
- name: doc_type
dtype: string
- name: doc_id
dtype: string
- name: review_sent_id
dtype: string
- name: review_sentence
dtype: string
- name: review_metadata
struct:
- name: dishes
sequence: string
- name: drinks
sequence: string
- name: traveler_type
dtype: string
- name: faq_question
dtype: string
- name: faq_answer
dtype: string
splits:
- name: train
num_bytes: 2135411
num_examples: 10882
download_size: 535623
dataset_size: 2135411
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
- split: validation
path: data/validation-*
- config_name: knowledge
data_files:
- split: train
path: knowledge/train-*
---
提供机构:
NomaDamas
原始信息汇总
数据集概述
配置信息
-
default 配置
- 特征
- log
- speaker: 字符串类型
- text: 字符串类型
- target: 布尔类型
- knowledge
- doc_id: 64位整数类型
- doc_type: 字符串类型
- domain: 字符串类型
- entity_id: 64位整数类型
- sent_id: 64位整数类型
- response: 字符串类型
- log
- 分割
- train
- 字节数: 22289817
- 样本数: 28431
- test
- 字节数: 4412204
- 样本数: 5475
- validation
- 字节数: 3371855
- 样本数: 4173
- train
- 下载大小: 12543490
- 数据集大小: 30073876
- 特征
-
knowledge 配置
- 特征
- domain: 字符串类型
- entity_id: 64位整数类型
- entity_name: 字符串类型
- doc_type: 字符串类型
- doc_id: 字符串类型
- review_sent_id: 字符串类型
- review_sentence: 字符串类型
- review_metadata
- dishes: 字符串序列
- drinks: 字符串序列
- traveler_type: 字符串类型
- faq_question: 字符串类型
- faq_answer: 字符串类型
- 分割
- train
- 字节数: 2135411
- 样本数: 10882
- train
- 下载大小: 535623
- 数据集大小: 2135411
- 特征
数据文件路径
-
default 配置
- train: data/train-*
- test: data/test-*
- validation: data/validation-*
-
knowledge 配置
- train: knowledge/train-*



