gupta-tanish/Ultrafeedback-SWEPO-3-responses
收藏Hugging Face2025-01-27 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/gupta-tanish/Ultrafeedback-SWEPO-3-responses
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了用于训练和测试的文本数据,其中每个示例由一个提示(prompt)和与之相关的多个角色(role)和内容(content)组成。数据集分为偏好(train_prefs和test_prefs)、SFT(train_sft和test_sft)以及生成(train_gen和test_gen)等不同的部分,每个部分都有对应的训练和测试数据。数据集中的评分(score)可能用于评估不同内容的响应质量。
The dataset consists of text data for training and testing, where each example includes a prompt and multiple associated roles and contents. The dataset is split into different sections for preferences (train_prefs and test_prefs), SFT (train_sft and test_sft), and generation (train_gen and test_gen), each with corresponding training and test data. The scores in the dataset may be used to evaluate the quality of responses for different contents.
提供机构:
gupta-tanish



