ninaxu/faker-example

Name: ninaxu/faker-example
Creator: ninaxu
Published: 2024-07-18 19:11:39
License: 暂无描述

Hugging Face2024-07-18 更新2024-07-22 收录

下载链接：

https://hf-mirror.com/datasets/ninaxu/faker-example

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是通过distilabel工具创建的，包含一个`pipeline.yaml`文件，可以用来重现生成数据集的管道。数据集的结构包括三个主要特征：`column_name`、`id_faker_code`和`column_content`，其中`column_content`是一个字符串序列。数据集主要用于展示如何使用distilabel生成数据，并且可以通过Hugging Face的`datasets`库加载。

This dataset has been created with distilabel and contains a `pipeline.yaml` which can be used to reproduce the pipeline that generated it. The dataset structure includes three main features: `column_name`, `id_faker_code`, and `column_content`, where `column_content` is a sequence of strings. The dataset is primarily used to demonstrate how to generate data using distilabel and can be loaded via the Hugging Face `datasets` library.

提供机构：

ninaxu

原始信息汇总

数据集卡片：faker-example

数据集概述

该数据集包含一个pipeline.yaml文件，可用于在distilabel中重现生成该数据集的管道。

数据集结构

配置：default

特征

column_name: 字符串类型
id_faker_code: 字符串类型
column_content: 字符串序列

数据分割

train:
- 字节数: 2452
- 样本数: 3

数据加载

python from datasets import load_dataset

ds = load_dataset("ninaxu/faker-example", "default")

或简化为： python from datasets import load_dataset

ds = load_dataset("ninaxu/faker-example")

数据集信息

下载大小: 4240 字节
数据集大小: 2452 字节
标签:
- synthetic
- distilabel
- rlaif

5,000+

优质数据集

54 个

任务类型

进入经典数据集