mlfoundations-dev/instruction_filtering_fasttext_seed_data_code
收藏Hugging Face2025-02-14 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/mlfoundations-dev/instruction_filtering_fasttext_seed_data_code
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了对话指令种子(instruction_seed)、来源(source)、以及对应的70B蒸馏模型响应(r1_distill_70b_response)等字段。此外,每个样本还有一个原始行索引(__original_row_idx)和会话信息(conversations),其中会话信息包含发送者(from)和消息内容(value)。数据集被拆分为训练集(train),包含2500个示例,总大小为144MB。
The dataset includes fields such as conversation instruction seed (instruction_seed), source (source), and corresponding 70B distillation model responses (r1_distill_70b_response). Additionally, each sample has an original row index (__original_row_idx) and conversation information (conversations), which includes the sender (from) and message content (value). The dataset is split into a training set (train), containing 2500 examples, with a total size of 144MB.
提供机构:
mlfoundations-dev



