1231czx/add_dard_ms_trained_orm_1e6_test_on_ms_math
收藏Hugging Face2024-12-06 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/1231czx/add_dard_ms_trained_orm_1e6_test_on_ms_math
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为add_dard_ms_trained_orm_1e6_test_on_ms_math,包含了一个训练集(train)。数据集中的特征包括:提示文本(prompt)、答案序列(answers)、奖励值(rewards)和标签(label)。提示文本和答案序列为字符串类型,奖励值为浮点数类型,标签为整数类型。训练集包含500个示例,总文件大小为318,457,011字节。
The dataset named add_dard_ms_trained_orm_1e6_test_on_ms_math includes a training set (train). The features of the dataset consist of: prompt text (prompt), answer sequence (answers), reward values (rewards), and labels (label). The prompt text and answer sequence are of string type, the reward values are of float type, and the labels are of integer type. The training set contains 500 examples, with a total file size of 318,457,011 bytes.
提供机构:
1231czx



