InsultedByMathematics/all-online_alpha_1e-4_beta_3e-3-base-as-reference_update_387_eval
收藏Hugging Face2025-02-10 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/InsultedByMathematics/all-online_alpha_1e-4_beta_3e-3-base-as-reference_update_387_eval
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个文本生成相关的数据集,包含多个候选响应、提示文本以及对应的奖励值等信息。数据集使用了LLAMA模型,并记录了每个候选响应、拒绝响应和中立响应的token数量和对应的对数概率。数据集分为测试集,测试集名称为test_prefs,包含1678个示例,数据大小为132,683,235字节。
This dataset is related to text generation, containing multiple candidate responses, prompt text, and corresponding reward values. The dataset utilizes the LLAMA model and records the token count and log probabilities for each candidate, reject, and neutral response. The dataset is split into a test set named test_prefs, which includes 1678 examples and has a size of 132,683,235 bytes.
提供机构:
InsultedByMathematics



