jihuny/tldr_1.4b_10k_gopt_policy_unnorm_penul_last_1e3_iter0
收藏Hugging Face2025-11-03 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/jihuny/tldr_1.4b_10k_gopt_policy_unnorm_penul_last_1e3_iter0
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了文本提示和对应的多个选项,每个选项有选择的得分、日志概率、完成原因等信息。此外,还包括了嵌入向量差和修正嵌入向量差等特征。数据集被分为训练集,共有10000个示例。
The dataset includes text prompts and corresponding multiple options, each with selection scores, log probabilities, completion reasons, etc. It also includes features like embedding difference and corrected embedding difference. The dataset is split into a training set with a total of 10,000 examples.
提供机构:
jihuny



