five

hitachi-nlp/JFLD_NLP_2024_proceeding_reproduction

收藏
Hugging Face2024-06-06 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/hitachi-nlp/JFLD_NLP_2024_proceeding_reproduction
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: D1 features: - name: version dtype: string - name: hypothesis dtype: string - name: hypothesis_formula dtype: string - name: facts dtype: string - name: facts_formula dtype: string - name: proofs sequence: string - name: proofs_formula sequence: string - name: negative_hypothesis dtype: string - name: negative_hypothesis_formula dtype: string - name: negative_proofs sequence: string - name: negative_original_tree_depth dtype: int64 - name: original_tree_depth dtype: int64 - name: depth dtype: int64 - name: num_formula_distractors dtype: int64 - name: num_translation_distractors dtype: int64 - name: num_all_distractors dtype: int64 - name: proof_label dtype: string - name: negative_proof_label dtype: string - name: world_assump_label dtype: string - name: negative_world_assump_label dtype: string - name: prompt_serial dtype: string - name: proof_serial dtype: string - name: prompt_serial_formula dtype: string - name: proof_serial_formula dtype: string splits: - name: train num_bytes: 126055185 num_examples: 30000 - name: validation num_bytes: 21045114 num_examples: 5000 - name: test num_bytes: 21111881 num_examples: 5000 download_size: 59655862 dataset_size: 168212180 - config_name: D1_minus features: - name: version dtype: string - name: hypothesis dtype: string - name: hypothesis_formula dtype: string - name: facts dtype: string - name: facts_formula dtype: string - name: proofs sequence: string - name: proofs_formula sequence: string - name: negative_hypothesis dtype: 'null' - name: negative_hypothesis_formula dtype: 'null' - name: negative_proofs sequence: 'null' - name: negative_original_tree_depth dtype: 'null' - name: original_tree_depth dtype: int64 - name: depth dtype: int64 - name: num_formula_distractors dtype: int64 - name: num_translation_distractors dtype: int64 - name: num_all_distractors dtype: int64 - name: proof_label dtype: string - name: negative_proof_label dtype: 'null' - name: world_assump_label dtype: string - name: negative_world_assump_label dtype: 'null' - name: prompt_serial dtype: string - name: proof_serial dtype: string - name: prompt_serial_formula dtype: string - name: proof_serial_formula dtype: string splits: - name: train num_bytes: 27344588 num_examples: 30000 - name: validation num_bytes: 4570595 num_examples: 5000 - name: test num_bytes: 4526013 num_examples: 5000 download_size: 10959467 dataset_size: 36441196 - config_name: D3 features: - name: version dtype: string - name: hypothesis dtype: string - name: hypothesis_formula dtype: string - name: facts dtype: string - name: facts_formula dtype: string - name: proofs sequence: string - name: proofs_formula sequence: string - name: negative_hypothesis dtype: string - name: negative_hypothesis_formula dtype: string - name: negative_proofs sequence: string - name: negative_original_tree_depth dtype: int64 - name: original_tree_depth dtype: int64 - name: depth dtype: int64 - name: num_formula_distractors dtype: int64 - name: num_translation_distractors dtype: int64 - name: num_all_distractors dtype: int64 - name: proof_label dtype: string - name: negative_proof_label dtype: string - name: world_assump_label dtype: string - name: negative_world_assump_label dtype: string - name: prompt_serial dtype: string - name: proof_serial dtype: string - name: prompt_serial_formula dtype: string - name: proof_serial_formula dtype: string splits: - name: train num_bytes: 144287506 num_examples: 30000 - name: validation num_bytes: 23744356 num_examples: 5000 - name: test num_bytes: 24230088 num_examples: 5000 download_size: 68178239 dataset_size: 192261950 - config_name: D8 features: - name: version dtype: string - name: hypothesis dtype: string - name: hypothesis_formula dtype: string - name: facts dtype: string - name: facts_formula dtype: string - name: proofs sequence: string - name: proofs_formula sequence: string - name: negative_hypothesis dtype: string - name: negative_hypothesis_formula dtype: string - name: negative_proofs sequence: string - name: negative_original_tree_depth dtype: int64 - name: original_tree_depth dtype: int64 - name: depth dtype: int64 - name: num_formula_distractors dtype: int64 - name: num_translation_distractors dtype: int64 - name: num_all_distractors dtype: int64 - name: proof_label dtype: string - name: negative_proof_label dtype: string - name: world_assump_label dtype: string - name: negative_world_assump_label dtype: string - name: prompt_serial dtype: string - name: proof_serial dtype: string - name: prompt_serial_formula dtype: string - name: proof_serial_formula dtype: string splits: - name: train num_bytes: 182290396 num_examples: 30000 - name: validation num_bytes: 30145838 num_examples: 5000 - name: test num_bytes: 30190990 num_examples: 5000 download_size: 84423463 dataset_size: 242627224 configs: - config_name: D1 data_files: - split: train path: D1/train-* - split: validation path: D1/validation-* - split: test path: D1/test-* - config_name: D1_minus data_files: - split: train path: D1_minus/train-* - split: validation path: D1_minus/validation-* - split: test path: D1_minus/test-* - config_name: D3 data_files: - split: train path: D3/train-* - split: validation path: D3/validation-* - split: test path: D3/test-* - config_name: D8 data_files: - split: train path: D8/train-* - split: validation path: D8/validation-* - split: test path: D8/test-* --- # Dataset Card for "JFLD_NLP_2024_proceeding_reproduction" See [here](https://github.com/hitachi-nlp/FLD-corpus.git) for the details of this corpus. For the whole of the project, see [our project page](https://github.com/hitachi-nlp/FLD/). [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
hitachi-nlp
原始信息汇总

数据集概述

数据集配置

配置 D1

  • 特征:
    • version: string
    • hypothesis: string
    • hypothesis_formula: string
    • facts: string
    • facts_formula: string
    • proofs: sequence of string
    • proofs_formula: sequence of string
    • negative_hypothesis: string
    • negative_hypothesis_formula: string
    • negative_proofs: sequence of string
    • negative_original_tree_depth: int64
    • original_tree_depth: int64
    • depth: int64
    • num_formula_distractors: int64
    • num_translation_distractors: int64
    • num_all_distractors: int64
    • proof_label: string
    • negative_proof_label: string
    • world_assump_label: string
    • negative_world_assump_label: string
    • prompt_serial: string
    • proof_serial: string
    • prompt_serial_formula: string
    • proof_serial_formula: string
  • 分割:
    • train: 126055185 字节, 30000 样本
    • validation: 21045114 字节, 5000 样本
    • test: 21111881 字节, 5000 样本
  • 下载大小: 59655862 字节
  • 数据集大小: 168212180 字节

配置 D1_minus

  • 特征:
    • version: string
    • hypothesis: string
    • hypothesis_formula: string
    • facts: string
    • facts_formula: string
    • proofs: sequence of string
    • proofs_formula: sequence of string
    • negative_hypothesis: null
    • negative_hypothesis_formula: null
    • negative_proofs: sequence of null
    • negative_original_tree_depth: null
    • original_tree_depth: int64
    • depth: int64
    • num_formula_distractors: int64
    • num_translation_distractors: int64
    • num_all_distractors: int64
    • proof_label: string
    • negative_proof_label: null
    • world_assump_label: string
    • negative_world_assump_label: null
    • prompt_serial: string
    • proof_serial: string
    • prompt_serial_formula: string
    • proof_serial_formula: string
  • 分割:
    • train: 27344588 字节, 30000 样本
    • validation: 4570595 字节, 5000 样本
    • test: 4526013 字节, 5000 样本
  • 下载大小: 10959467 字节
  • 数据集大小: 36441196 字节

配置 D3

  • 特征:
    • version: string
    • hypothesis: string
    • hypothesis_formula: string
    • facts: string
    • facts_formula: string
    • proofs: sequence of string
    • proofs_formula: sequence of string
    • negative_hypothesis: string
    • negative_hypothesis_formula: string
    • negative_proofs: sequence of string
    • negative_original_tree_depth: int64
    • original_tree_depth: int64
    • depth: int64
    • num_formula_distractors: int64
    • num_translation_distractors: int64
    • num_all_distractors: int64
    • proof_label: string
    • negative_proof_label: string
    • world_assump_label: string
    • negative_world_assump_label: string
    • prompt_serial: string
    • proof_serial: string
    • prompt_serial_formula: string
    • proof_serial_formula: string
  • 分割:
    • train: 144287506 字节, 30000 样本
    • validation: 23744356 字节, 5000 样本
    • test: 24230088 字节, 5000 样本
  • 下载大小: 68178239 字节
  • 数据集大小: 192261950 字节

配置 D8

  • 特征:
    • version: string
    • hypothesis: string
    • hypothesis_formula: string
    • facts: string
    • facts_formula: string
    • proofs: sequence of string
    • proofs_formula: sequence of string
    • negative_hypothesis: string
    • negative_hypothesis_formula: string
    • negative_proofs: sequence of string
    • negative_original_tree_depth: int64
    • original_tree_depth: int64
    • depth: int64
    • num_formula_distractors: int64
    • num_translation_distractors: int64
    • num_all_distractors: int64
    • proof_label: string
    • negative_proof_label: string
    • world_assump_label: string
    • negative_world_assump_label: string
    • prompt_serial: string
    • proof_serial: string
    • prompt_serial_formula: string
    • proof_serial_formula: string
  • 分割:
    • train: 182290396 字节, 30000 样本
    • validation: 30145838 字节, 5000 样本
    • test: 30190990 字节, 5000 样本
  • 下载大小: 84423463 字节
  • 数据集大小: 242627224 字节

数据文件路径

  • D1:
    • train: D1/train-*
    • validation: D1/validation-*
    • test: D1/test-*
  • D1_minus:
    • train: D1_minus/train-*
    • validation: D1_minus/validation-*
    • test: D1_minus/test-*
  • D3:
    • train: D3/train-*
    • validation: D3/validation-*
    • test: D3/test-*
  • D8:
    • train: D8/train-*
    • validation: D8/validation-*
    • test: D8/test-*
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作