five

qbao775/PARARULE-Plus-Depth-5

收藏
Hugging Face2023-06-05 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/qbao775/PARARULE-Plus-Depth-5
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit task_categories: - text-classification - question-answering language: - en tags: - Reasoning - Multi-Step-Deductive-Reasoning - Logical-Reasoning size_categories: - 100K<n<1M --- # PARARULE-Plus-Depth-5 This is a branch which includes the dataset from PARARULE-Plus Depth=5. PARARULE Plus is a deep multi-step reasoning dataset over natural language. It can be seen as an improvement on the dataset of PARARULE (Peter Clark et al., 2020). Both PARARULE and PARARULE-Plus follow the closed-world assumption and negation as failure. The motivation is to generate deeper PARARULE training samples. We add more training samples for the case where the depth is greater than or equal to two to explore whether Transformer has reasoning ability. PARARULE Plus is a combination of two types of entities, animals and people, and corresponding relationships and attributes. From the depth of 2 to the depth of 5, we have around 100,000 samples in the depth of each layer, and there are nearly 400,000 samples in total. Here is the original links for PARARULE-Plus including paper, project and data. Paper: https://www.cs.ox.ac.uk/isg/conferences/tmp-proceedings/NeSy2022/paper15.pdf Project: https://github.com/Strong-AI-Lab/Multi-Step-Deductive-Reasoning-Over-Natural-Language Data: https://github.com/Strong-AI-Lab/PARARULE-Plus PARARULE-Plus has been collected and merged by [LogiTorch.ai](https://www.logitorch.ai/), [ReasoningNLP](https://github.com/FreedomIntelligence/ReasoningNLP), [Prompt4ReasoningPapers](https://github.com/zjunlp/Prompt4ReasoningPapers) and [OpenAI/Evals](https://github.com/openai/evals/pull/651). In this huggingface version, we pre-processed the dataset and use `1` to represent `true` and `0` to represent `false` to better help user train model. ## How to load the dataset? ``` from datasets import load_dataset dataset = load_dataset("qbao775/PARARULE-Plus-Depth-5") ``` ## How to train a model using the dataset? We provide an [example](https://github.com/Strong-AI-Lab/PARARULE-Plus/blob/main/README.md#an-example-script-to-load-pararule-plus-and-fine-tune-bert) that you can `git clone` the project and fine-tune the dataset locally. ## Citation ``` @inproceedings{bao2022multi, title={Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation}, author={Qiming Bao and Alex Yuxuan Peng and Tim Hartill and Neset Tan and Zhenyun Deng and Michael Witbrock and Jiamou Liu}, year={2022}, publisher={The 2nd International Joint Conference on Learning and Reasoning and 16th International Workshop on Neural-Symbolic Learning and Reasoning (IJCLR-NeSy 2022)} } ```
提供机构:
qbao775
原始信息汇总

数据集概述

数据集名称

  • PARARULE-Plus-Depth-5

数据集描述

  • PARARULE-Plus-Depth-5 是 PARARULE-Plus 的一个分支,专注于深度为5的多步骤推理数据集。它是 PARARULE 数据集的改进版本,旨在生成更深层次的训练样本,以探索Transformer的推理能力。

数据集特点

  • 遵循封闭世界假设和否定作为失败的原则。
  • 结合了动物和人两类实体,以及相应的关联和属性。
  • 深度从2到5,每个深度层约有100,000个样本,总计近400,000个样本。

数据集规模

  • 规模类别:100K<n<1M

数据集应用

  • 任务类别:文本分类、问答
  • 语言:英语
  • 标签:推理、多步骤演绎推理、逻辑推理

数据集处理

  • 在Huggingface版本中,数据集经过预处理,使用1表示true0表示false,以帮助用户更好地训练模型。

数据集加载

python from datasets import load_dataset dataset = load_dataset("qbao775/PARARULE-Plus-Depth-5")

数据集引用

@inproceedings{bao2022multi, title={Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation}, author={Qiming Bao and Alex Yuxuan Peng and Tim Hartill and Neset Tan and Zhenyun Deng and Michael Witbrock and Jiamou Liu}, year={2022}, publisher={The 2nd International Joint Conference on Learning and Reasoning and 16th International Workshop on Neural-Symbolic Learning and Reasoning (IJCLR-NeSy 2022)} }

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作