1231czx/7b_dpo_iter2_data_gen_by_sft1epoch_and_dpoiter1_sft1epoch
收藏Hugging Face2024-06-30 更新2024-07-06 收录
下载链接:
https://hf-mirror.com/datasets/1231czx/7b_dpo_iter2_data_gen_by_sft1epoch_and_dpoiter1_sft1epoch
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,用于存储问题和相关的解决方案、代码、预测等信息。具体字段包括索引(idx)、问题(question)、真实推理链(gt_cot)、真实答案(gt)、类型(type)、解决方案(solution)、我的解决方案序列(my_solu)、代码序列(code)、预测序列(pred)和报告序列(report)。数据集被分为一个训练集,包含666,592个样本,总大小为1,639,931,205字节。
This dataset contains multiple fields for storing questions and related solutions, codes, predictions, etc. Specific fields include index (idx), question (question), ground truth reasoning chain (gt_cot), ground truth answer (gt), type (type), solution (solution), my solution sequence (my_solu), code sequence (code), prediction sequence (pred), and report sequence (report). The dataset is divided into a training set containing 666,592 samples with a total size of 1,639,931,205 bytes.
提供机构:
1231czx



