HugScriptKitty/Coding-Fish
收藏Hugging Face2024-12-19 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/HugScriptKitty/Coding-Fish
下载链接
链接失效反馈官方服务:
资源简介:
Coding-Fish数据集是一个使用distilabel工具创建的数据集,主要用于生成高质量的文本提示,特别是在软件开发和问题解决等技术领域。数据集包含三个主要特征:prompt(提示)、completion(完成)和system_prompt(系统提示)。数据集的结构为每个配置包含100个训练样本,总大小为528397字节。用户可以通过提供的`pipeline.yaml`文件复现生成数据集的流程。
The Coding-Fish dataset is a dataset created using the distilabel tool, primarily designed for generating high-quality text prompts, especially in technical domains such as software development and problem-solving. The dataset includes three main features: prompt, completion, and system_prompt. The dataset structure consists of 100 training examples per configuration, with a total size of 528397 bytes. Users can reproduce the pipeline that generated the dataset using the provided `pipeline.yaml` file.
提供机构:
HugScriptKitty



