mothnaZl/self_rewarding_ift_Qwen2.5-7B-Instruct
收藏Hugging Face2025-04-09 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/mothnaZl/self_rewarding_ift_Qwen2.5-7B-Instruct
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一个字符串类型的gt标签,一段对话(包括对话内容和角色),一个浮点数表示的奖励值,以及对话的长度。数据集被划分为训练集,包含3210个样本,每个样本大约12050306字节。但没有具体描述数据集的应用场景或具体主题。
The dataset includes a string type gt label, a conversation consisting of content and role, a float type reward value, and the length of the conversation. The dataset is split into a training set with 3210 samples, each about 12050306 bytes in size. However, there is no specific description of the datasets application scenario or specific topic.
提供机构:
mothnaZl



