Babelscape/LLM-Oasis_paraphrase_generation
收藏Hugging Face2024-12-02 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/Babelscape/LLM-Oasis_paraphrase_generation
下载链接
链接失效反馈官方服务:
资源简介:
LLM-Oasis_paraphrase_generation数据集是LLM-Oasis套件的一部分,包含从Wikipedia段落中提取的一组声明生成的改写文本。该数据集支持LLM-Oasis论文中第3.3节描述的改写生成步骤。数据集包含Wikipedia页面的标题、段落文本、从文本中提取的声明序列以及基于这些声明的改写文本。训练分割包含67,419个例子,验证分割包含13,848个例子。
LLM-Oasis_paraphrase_generation is part of the LLM-Oasis suite and contains paraphrases generated from a set of claims extracted from a Wikipedia passage. The dataset features include the title of the Wikipedia page, a passage of 5 sentences from the page, a sequence of claims extracted from the text, and a paraphrased version of the text based on the claims. The dataset is divided into a train split with 67,419 examples and a validation split with 13,848 examples. The dataset is licensed under CC BY-NC-SA 4.0.
提供机构:
Babelscape



