five

mlfoundations-dev/seed_math_multiple_samples_scale_up_all_real_run_2K

收藏
Hugging Face2025-02-18 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/mlfoundations-dev/seed_math_multiple_samples_scale_up_all_real_run_2K
下载链接
链接失效反馈
官方服务:
资源简介:
这是一个包含多个文本字段的数据集,主要用于训练语言模型。数据集由训练集组成,包含了大约6084个示例。每个示例都包含了指令种子(instruction_seed)、来源(source)、模型响应(r1_distill_70b_response)等信息。此外,每个示例中还可能包含原始行索引(__original_row_idx)、多数响应(_majority_responses)、经过验证的模型响应(verified_r1_distill_70b_response)以及对话信息(conversations),对话信息包括对话的发起者和对话内容。

This is a dataset with multiple text fields primarily intended for training language models. The dataset consists of a training set with approximately 6084 examples. Each example includes information such as instruction seed (instruction_seed), source (source), model response (r1_distill_70b_response), and more. Additionally, each example may contain the original row index (__original_row_idx), majority response (_majority_responses), verified model response (verified_r1_distill_70b_response), and conversation information (conversations), which includes the initiator of the conversation and the content of the conversation.
提供机构:
mlfoundations-dev
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作