unigram/fol-02c

Name: unigram/fol-02c
Creator: unigram
Published: 2024-04-28 13:32:35
License: 暂无描述

Hugging Face2024-04-28 更新2024-06-12 收录

下载链接：

https://hf-mirror.com/datasets/unigram/fol-02c

下载链接

链接失效反馈

官方服务：

资源简介：

--- configs: - config_name: default data_files: - split: train path: data/train-* - split: validation path: data/validation-* - split: test path: data/test-* dataset_info: features: - name: premise dtype: string - name: hypothesis dtype: string - name: label dtype: string - name: premise_tptp dtype: string - name: hypothesis_tptp dtype: string - name: deberta_pred dtype: string - name: proof_inputs sequence: string - name: proof dtype: string splits: - name: train num_bytes: 405407571.52804404 num_examples: 26287 - name: validation num_bytes: 50677874.23597797 num_examples: 3286 - name: test num_bytes: 50677874.23597797 num_examples: 3286 download_size: 84656101 dataset_size: 506763319.99999994 --- # Dataset Card for "fol-02c" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)

提供机构：

unigram

原始信息汇总

数据集概述

配置信息

默认配置：包含训练、验证和测试数据集。
- 训练数据：路径为 data/train-*。
- 验证数据：路径为 data/validation-*。
- 测试数据：路径为 data/test-*。

数据集特征

premise：数据类型为字符串。
hypothesis：数据类型为字符串。
label：数据类型为字符串。
premise_tptp：数据类型为字符串。
hypothesis_tptp：数据类型为字符串。
deberta_pred：数据类型为字符串。
proof_inputs：数据类型为序列字符串。
proof：数据类型为字符串。

数据集分割

训练集：包含26287个样本，总大小为405407571.52804404字节。
验证集：包含3286个样本，总大小为50677874.23597797字节。
测试集：包含3286个样本，总大小为50677874.23597797字节。

数据集大小

下载大小：84656101字节。
数据集总大小：506763319.99999994字节。

5,000+

优质数据集

54 个

任务类型

进入经典数据集