jihuny/tldr_1.4b_10k_gopt_policy_norm_penul_prev_1e6_iter0
收藏Hugging Face2025-11-03 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/jihuny/tldr_1.4b_10k_gopt_policy_norm_penul_prev_1e6_iter0
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本选择相关的特征,如提示文本(prompt)、选择的文本(chosen)、被拒绝的文本(rejected)等,以及它们对应的索引、分数、对数概率和完成原因等。数据集分为训练集,共有10000个示例。
The dataset includes features related to text selection, such as prompt text, chosen text, rejected text, and their corresponding indices, scores, log probabilities, and finish reasons, etc. The dataset is split into a training set with a total of 10,000 examples.
提供机构:
jihuny



