nbalepur/doc_conflict_summary_split
收藏Hugging Face2024-06-19 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/nbalepur/doc_conflict_summary_split
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: query
dtype: string
- name: doc_urls
sequence: string
- name: doc_stances
sequence: string
- name: doc_texts
sequence:
sequence: string
splits:
- name: ConflictingQA_train
num_bytes: 696584.0
num_examples: 5
- name: ConflictingQA_test
num_bytes: 39705288.0
num_examples: 285
- name: Debatepedia_train
num_bytes: 423969.5081967213
num_examples: 5
- name: Debatepedia_test
num_bytes: 15093314.49180328
num_examples: 178
- name: DiverseSumm_train
num_bytes: 222797.61047463174
num_examples: 5
- name: DiverseSumm_test
num_bytes: 27003070.38952537
num_examples: 606
download_size: 45506816
dataset_size: 83145024.0
configs:
- config_name: default
data_files:
- split: ConflictingQA_train
path: data/ConflictingQA_train-*
- split: ConflictingQA_test
path: data/ConflictingQA_test-*
- split: Debatepedia_train
path: data/Debatepedia_train-*
- split: Debatepedia_test
path: data/Debatepedia_test-*
- split: DiverseSumm_train
path: data/DiverseSumm_train-*
- split: DiverseSumm_test
path: data/DiverseSumm_test-*
---
数据集详情:
数据特征:
- 名称:查询(query),数据类型:字符串
- 名称:文档URL列表(doc_urls),数据类型:字符串序列(sequence)
- 名称:文档立场列表(doc_stances),数据类型:字符串序列(sequence)
- 名称:文档文本序列(doc_texts),数据类型:嵌套字符串序列(sequence of sequence of string)
数据拆分集:
- 名称:ConflictingQA_train,字节占用量:696584.0,样本数量:5
- 名称:ConflictingQA_test,字节占用量:39705288.0,样本数量:285
- 名称:Debatepedia_train,字节占用量:423969.5081967213,样本数量:5
- 名称:Debatepedia_test,字节占用量:15093314.49180328,样本数量:178
- 名称:DiverseSumm_train,字节占用量:222797.61047463174,样本数量:5
- 名称:DiverseSumm_test,字节占用量:27003070.38952537,样本数量:606
下载总大小:45506816 字节,数据集总占用大小:83145024.0 字节
配置项:
- 配置名称:default(默认配置),数据文件配置:
- 拆分集ConflictingQA_train:对应文件路径 data/ConflictingQA_train-*
- 拆分集ConflictingQA_test:对应文件路径 data/ConflictingQA_test-*
- 拆分集Debatepedia_train:对应文件路径 data/Debatepedia_train-*
- 拆分集Debatepedia_test:对应文件路径 data/Debatepedia_test-*
- 拆分集DiverseSumm_train:对应文件路径 data/DiverseSumm_train-*
- 拆分集DiverseSumm_test:对应文件路径 data/DiverseSumm_test-*
提供机构:
nbalepur



