five

nbalepur/doc_conflict_summary_split

收藏
Hugging Face2024-06-19 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/nbalepur/doc_conflict_summary_split
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: query dtype: string - name: doc_urls sequence: string - name: doc_stances sequence: string - name: doc_texts sequence: sequence: string splits: - name: ConflictingQA_train num_bytes: 696584.0 num_examples: 5 - name: ConflictingQA_test num_bytes: 39705288.0 num_examples: 285 - name: Debatepedia_train num_bytes: 423969.5081967213 num_examples: 5 - name: Debatepedia_test num_bytes: 15093314.49180328 num_examples: 178 - name: DiverseSumm_train num_bytes: 222797.61047463174 num_examples: 5 - name: DiverseSumm_test num_bytes: 27003070.38952537 num_examples: 606 download_size: 45506816 dataset_size: 83145024.0 configs: - config_name: default data_files: - split: ConflictingQA_train path: data/ConflictingQA_train-* - split: ConflictingQA_test path: data/ConflictingQA_test-* - split: Debatepedia_train path: data/Debatepedia_train-* - split: Debatepedia_test path: data/Debatepedia_test-* - split: DiverseSumm_train path: data/DiverseSumm_train-* - split: DiverseSumm_test path: data/DiverseSumm_test-* ---

数据集详情: 数据特征: - 名称:查询(query),数据类型:字符串 - 名称:文档URL列表(doc_urls),数据类型:字符串序列(sequence) - 名称:文档立场列表(doc_stances),数据类型:字符串序列(sequence) - 名称:文档文本序列(doc_texts),数据类型:嵌套字符串序列(sequence of sequence of string) 数据拆分集: - 名称:ConflictingQA_train,字节占用量:696584.0,样本数量:5 - 名称:ConflictingQA_test,字节占用量:39705288.0,样本数量:285 - 名称:Debatepedia_train,字节占用量:423969.5081967213,样本数量:5 - 名称:Debatepedia_test,字节占用量:15093314.49180328,样本数量:178 - 名称:DiverseSumm_train,字节占用量:222797.61047463174,样本数量:5 - 名称:DiverseSumm_test,字节占用量:27003070.38952537,样本数量:606 下载总大小:45506816 字节,数据集总占用大小:83145024.0 字节 配置项: - 配置名称:default(默认配置),数据文件配置: - 拆分集ConflictingQA_train:对应文件路径 data/ConflictingQA_train-* - 拆分集ConflictingQA_test:对应文件路径 data/ConflictingQA_test-* - 拆分集Debatepedia_train:对应文件路径 data/Debatepedia_train-* - 拆分集Debatepedia_test:对应文件路径 data/Debatepedia_test-* - 拆分集DiverseSumm_train:对应文件路径 data/DiverseSumm_train-* - 拆分集DiverseSumm_test:对应文件路径 data/DiverseSumm_test-*
提供机构:
nbalepur
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作