five

plaguss/argilla_sdk_docs_queries

收藏
Hugging Face2024-06-25 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/plaguss/argilla_sdk_docs_queries
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集是通过distilabel工具创建的,包含了一个`pipeline.yaml`文件,用于复现生成数据集的流程。数据集的结构包括多个字段,如`filename`、`repo_name`、`anchor`、`positive`、`negative`等,并且提供了一个示例JSON结构。数据集可以通过Hugging Face的`load_dataset`函数加载。

该数据集是通过distilabel工具创建的,包含了一个`pipeline.yaml`文件,用于复现生成数据集的流程。数据集的结构包括多个字段,如`filename`、`repo_name`、`anchor`、`positive`、`negative`等,并且提供了一个示例JSON结构。数据集可以通过Hugging Face的`load_dataset`函数加载。
提供机构:
plaguss
原始信息汇总

数据集卡片 for argilla_sdk_docs_queries

数据集概述

该数据集包含一个 pipeline.yaml 文件,可以使用 distilabel CLI 在 distilabel 中重现生成该数据集的管道:

console distilabel pipeline run --config "https://huggingface.co/datasets/plaguss/argilla_sdk_docs_queries/raw/main/pipeline.yaml"

或者探索配置:

console distilabel pipeline info --config "https://huggingface.co/datasets/plaguss/argilla_sdk_docs_queries/raw/main/pipeline.yaml"

数据集结构

示例按照以下结构进行配置:

<details><summary> 配置: default </summary><hr>

json { "anchor": "# Welcome to Argilla. Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.. u003cdiv class="grid cards" markdownu003e. - Get started in 5 minutes!. ---. Install argilla with pip and deploy a Docker locally or for free on Hugging Face to get up and running in minutes.. :octicons-arrow-right-24: Quickstart. - Educational guides. ---", "distilabel_metadata": { "raw_output_generate_sentence_pair": "## Positive

Can Argillau0027s collaboration platform ensure high-quality outputs and full data ownership for AI engineers and domain experts?

Negative

The beautiful scenery of the Italian town of Argilla inspired her to write a novel about love and freedom." }, "filename": "argilla-python/docs/index.md", "model_name_query": "meta-llama/Meta-Llama-3-70B-Instruct", "negative": "The beautiful scenery of the Italian town of Argilla inspired her to write a novel about love and freedom.", "positive": "Can Argillau0027s collaboration platform ensure high-quality outputs and full data ownership for AI engineers and domain experts?" }

该子集可以加载为:

python from datasets import load_dataset

ds = load_dataset("plaguss/argilla_sdk_docs_queries", "default")

或者简单地加载,因为只有一个配置并且命名为 default

python from datasets import load_dataset

ds = load_dataset("plaguss/argilla_sdk_docs_queries")

</details>

搜集汇总
数据集介绍
main_image_url
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作