mlfoundations-dev/seed_code_multiple_samples_random_scale_up_16k
收藏Hugging Face2025-02-19 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/mlfoundations-dev/seed_code_multiple_samples_random_scale_up_16k
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含文本生成响应和对话信息的的数据集,适用于文本处理或对话生成任务。数据集包含problem、source、domain等字段,以及文本生成响应字段如r1_distill_70b_response和verified_r1_distill_70b_response。此外,还有一个对话字段conversations,包含对话的发送者和内容。数据集分为训练集,共有16000个样本,总大小约为1.89GB。
This dataset includes text generation responses and conversation information, suitable for text processing or dialogue generation tasks. The dataset contains fields such as problem, source, domain, and text generation response fields like r1_distill_70b_response and verified_r1_distill_70b_response. Additionally, there is a conversation field, conversations, which includes the sender and content of the dialogue. The dataset is split into a training set with a total of 16,000 samples and is approximately 1.89GB in size.
提供机构:
mlfoundations-dev



