plaguss/dolly_tutorial

Name: plaguss/dolly_tutorial
Creator: plaguss
Published: 2023-07-26 19:10:51
License: 暂无描述

Hugging Face2023-07-26 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/plaguss/dolly_tutorial

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是通过Argilla创建的，包含一个符合Argilla数据集格式的配置文件`argilla.yaml`，以及兼容HuggingFace `datasets`库的记录。数据集可以用于不同的NLP任务，具体取决于配置。数据集结构包括字段、问题、建议和指南。字段是数据集记录本身，目前仅支持文本字段。问题是向标注者提出的问题，可以是评分、文本、单选或多选类型。建议是标注过程中的辅助信息。数据集包含一个训练集分割。

提供机构：

plaguss

原始信息汇总

数据集概述

名称: dolly_tutorial

大小: 10K<n<100K

标签:

rlfh
argilla
human-feedback

创建工具: Argilla

数据集内容

配置文件: argilla.yaml，符合Argilla数据集格式。
数据记录: 兼容HuggingFace datasets格式。
标注指南: 如有定义，可在标注指南部分找到。

加载方式

使用Argilla: python import argilla as rg ds = rg.FeedbackDataset.from_huggingface("plaguss/dolly_tutorial")
使用datasets库: python from datasets import load_dataset ds = load_dataset("plaguss/dolly_tutorial")

数据结构

字段:
- category (TextField)
- instruction (TextField)
- context (TextField)
- response (TextField)
问题:
- final-instruction (TextQuestion)
- final-context (TextQuestion)
- final-response (TextQuestion)
建议:
- final-instruction-suggestion (text)
- final-context-suggestion (text)
- final-response-suggestion (text)
外部ID: external_id (可选)

数据实例

Argilla格式示例: json { "external_id": "6", "fields": { "category": "closed_qa", "context": "...", "instruction": "...", "response": "..." }, "id": "...", "metadata": {}, "responses": [], "suggestions": [] }
HuggingFace datasets格式示例: json { "category": "closed_qa", "context": "...", "external_id": "6", "final-context": null, "final-context-suggestion": null, "final-context-suggestion-metadata": { "agent": null, "score": null, "type": null }, "final-instruction": null, "final-instruction-suggestion": null, "final-instruction-suggestion-metadata": { "agent": null, "score": null, "type": null }, "final-response": null, "final-response-suggestion": null, "final-response-suggestion-metadata": { "agent": null, "score": null, "type": null }, "instruction": "...", "metadata": null, "response": "..." }

数据分割

分割: train

5,000+

优质数据集

54 个

任务类型

进入经典数据集