Rickythechicken/llmtwin-dpo
收藏Hugging Face2025-12-11 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/Rickythechicken/llmtwin-dpo
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个字符串类型的特征:prompt、rejected和chosen。数据集分为train和test两个部分,其中训练集包含763个示例,测试集包含41个示例。
The dataset includes three string-type features: prompt, rejected, and chosen. It is divided into train and test splits, with the training set containing 763 examples and the test set containing 41 examples.
提供机构:
Rickythechicken



