Tiamz/uplimit-synthetic-data-week-2-basic
收藏Hugging Face2025-04-05 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Tiamz/uplimit-synthetic-data-week-2-basic
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个合成的数据集,用于训练和测试模型。它包含了用户提出的问题和相应的模型输出,以及一些元数据信息,如输入输出token的数量。数据集由distilabel工具生成,并可以通过提供的pipeline.yaml文件复现生成过程。
This dataset is a synthetic dataset designed for training and testing models. It includes user-generated questions and corresponding model outputs, along with metadata such as the number of input and output tokens. The dataset is generated using the distilabel tool, and the generation process can be replicated using the provided pipeline.yaml file.
提供机构:
Tiamz



