plaguss/argilla_sdk_docs_queries

Name: plaguss/argilla_sdk_docs_queries
Creator: plaguss
Published: 2024-06-25 14:38:29
License: 暂无描述

Hugging Face2024-06-25 更新2024-06-12 收录

下载链接：

https://hf-mirror.com/datasets/plaguss/argilla_sdk_docs_queries

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是通过distilabel工具创建的，包含了一个`pipeline.yaml`文件，用于复现生成数据集的流程。数据集的结构包括多个字段，如`filename`、`repo_name`、`anchor`、`positive`、`negative`等，并且提供了一个示例JSON结构。数据集可以通过Hugging Face的`load_dataset`函数加载。

提供机构：

plaguss

原始信息汇总

数据集卡片 for argilla_sdk_docs_queries

数据集概述

该数据集包含一个 pipeline.yaml 文件，可以使用 distilabel CLI 在 distilabel 中重现生成该数据集的管道：

console distilabel pipeline run --config "https://huggingface.co/datasets/plaguss/argilla_sdk_docs_queries/raw/main/pipeline.yaml"

或者探索配置：

console distilabel pipeline info --config "https://huggingface.co/datasets/plaguss/argilla_sdk_docs_queries/raw/main/pipeline.yaml"

数据集结构

示例按照以下结构进行配置：

<details><summary> 配置: default </summary><hr>

json { "anchor": "# Welcome to Argilla. Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.. u003cdiv class="grid cards" markdownu003e. - Get started in 5 minutes!. ---. Install argilla with pip and deploy a Docker locally or for free on Hugging Face to get up and running in minutes.. :octicons-arrow-right-24: Quickstart. - Educational guides. ---", "distilabel_metadata": { "raw_output_generate_sentence_pair": "## Positive

Can Argillau0027s collaboration platform ensure high-quality outputs and full data ownership for AI engineers and domain experts?

Negative

The beautiful scenery of the Italian town of Argilla inspired her to write a novel about love and freedom." }, "filename": "argilla-python/docs/index.md", "model_name_query": "meta-llama/Meta-Llama-3-70B-Instruct", "negative": "The beautiful scenery of the Italian town of Argilla inspired her to write a novel about love and freedom.", "positive": "Can Argillau0027s collaboration platform ensure high-quality outputs and full data ownership for AI engineers and domain experts?" }

该子集可以加载为：

python from datasets import load_dataset

ds = load_dataset("plaguss/argilla_sdk_docs_queries", "default")

或者简单地加载，因为只有一个配置并且命名为 default：

python from datasets import load_dataset

ds = load_dataset("plaguss/argilla_sdk_docs_queries")

</details>

搜集汇总

数据集介绍