jihuny/tldr_1.4b_10k_gopt_policy_unnorm_penul_last_1e-3_iter0
收藏Hugging Face2025-11-03 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/jihuny/tldr_1.4b_10k_gopt_policy_unnorm_penul_last_1e-3_iter0
下载链接
链接失效反馈官方服务:
资源简介:
这是一个关于文本选择和评估的数据集,包含文本提示、选择的文本和被拒绝的文本及其相关信息,如索引、得分、对数概率、完成原因、嵌入向量差异和排名。
This is a dataset about text selection and evaluation, containing text prompts, chosen texts and rejected texts along with their related information such as indices, scores, log probabilities, finish reasons, embedding vector differences, and rankings.
提供机构:
jihuny



