abhisheksagar/llm-PairRM-preference-pairs
收藏Hugging Face2025-04-11 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/abhisheksagar/llm-PairRM-preference-pairs
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含五个字段:提示(prompt)、选中(chosen)、拒绝(rejected)、选中评分(chosen_rating)和拒绝评分(rejected_rating)。提示是字符串类型,用于提供某些情境或问题。选中字段和拒绝字段分别记录了对应的选项,两者也都是字符串类型。选中评分和拒绝评分是整型,用于表示对选中或拒绝选项的评分。数据集的训练集有500个示例,总大小为1953858字节。
The dataset includes five fields: prompt, chosen, rejected, chosen_rating, and rejected_rating. The prompt is a string type used to provide certain contexts or questions. The chosen and rejected fields record the corresponding options, both are string types. Chosen_rating and rejected_rating are integers used to represent the ratings for the chosen or rejected options. The training set of the dataset has 500 examples with a total size of 1953858 bytes.
提供机构:
abhisheksagar



