QuixiAI/OpenCoder-LLM_opc-sft-stage1-DolphinLabeled
收藏Hugging Face2025-01-07 更新2025-08-09 收录
下载链接:
https://hf-mirror.com/datasets/QuixiAI/OpenCoder-LLM_opc-sft-stage1-DolphinLabeled
下载链接
链接失效反馈官方服务:
资源简介:
OpenCoder-LLM SFT DolphinLabeled数据集是用于过滤OpenCoder-LLM SFT数据集的数据集。它包括三个部分:Filtered_infinity_instruct(从infinity_instruct筛选出的代码相关内容),Realuser_instruct(从真实的用户对话中提取的双语代码相关指令),以及Largescale_diverse_instruct(基于种子如CommonCrawl和Source Code生成的多样化代码相关指令)。
The OpenCoder-LLM SFT DolphinLabeled dataset is designed to filter the OpenCoder-LLM SFT dataset. It consists of three parts: Filtered_infinity_instruct (code-related content filtered from infinity_instruct), Realuser_instruct (bilingual code-related instructions extracted from real user conversations), and Largescale_diverse_instruct (diverse code-related instructions generated based on seeds like CommonCrawl and Source Code).
提供机构:
QuixiAI



