jihuny/tldr_1.4b_10k_gopt_policy_norm_penul_mean_1e-3_iter0
收藏Hugging Face2025-11-03 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/jihuny/tldr_1.4b_10k_gopt_policy_norm_penul_mean_1e-3_iter0
下载链接
链接失效反馈官方服务:
资源简介:
这是一个用于文本选择的任务的数据集,包含了提示文本、选择的文本和被拒绝的文本等字段。数据集还提供了文本的索引、分数、对数概率以及嵌入向量的差异等信息。训练集包含10000个示例,文件大小为180MB。
This is a dataset for text selection tasks, including fields such as prompt text, chosen text, and rejected text. The dataset also provides information such as text indices, scores, log probabilities, and differences in embedding vectors. The training set contains 10,000 examples and is 180MB in size.
提供机构:
jihuny



