five

jihuny/tldr_1.4b_10k_gopt_policy_norm_penul_prev_1e3_iter0

收藏
Hugging Face2025-11-03 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/jihuny/tldr_1.4b_10k_gopt_policy_norm_penul_prev_1e3_iter0
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含了文本提示、选择的文本和拒绝的文本及其相关索引、分数、对数概率、完成原因和嵌入向量差异等信息,适用于文本选择或评估任务。数据集分为训练集,包含10000个示例。

The dataset includes text prompts, chosen and rejected texts along with their related indices, scores, log probabilities, completion reasons, and embedding vector differences, which is suitable for text selection or evaluation tasks. The dataset is split into a training set containing 10,000 examples.
提供机构:
jihuny
二维码
社区交流群
二维码
科研交流群
商业服务