five

xudongwu/RPL_Q0.5B_U10_beta0.10rho0.00K4_sf1.00

收藏
Hugging Face2026-04-29 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/xudongwu/RPL_Q0.5B_U10_beta0.10rho0.00K4_sf1.00
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: config_name: Q0.5B features: - name: prompt dtype: string - name: chosen dtype: string - name: rejected dtype: string - name: response dtype: string - name: reward_score dtype: float64 - name: gpt_score dtype: float64 splits: - name: default num_bytes: 1523040 num_examples: 256 download_size: 767384 dataset_size: 1523040 configs: - config_name: Q0.5B data_files: - split: default path: Q0.5B/default-* ---
提供机构:
xudongwu
二维码
社区交流群
二维码
科研交流群
商业服务