DAMO-NLP-SG/Mistral-7B-LongPO-256K-tokenized
收藏Hugging Face2025-02-17 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/DAMO-NLP-SG/Mistral-7B-LongPO-256K-tokenized
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一系列的字段,如选定的输入ID、注意力掩码、标签等,以及被拒绝的对应字段。这表明数据集可能用于某种比较或选择任务。数据集分为训练集,其大小为204,966,255,944字节,共有15,841个示例。不过,具体的数据集内容和用途在README中并未明确描述。
The dataset includes fields such as chosen input IDs, attention masks, labels, and corresponding fields for the rejected options. This suggests that the dataset might be used for some sort of comparison or selection task. The dataset is split into a training set, which is 204,966,255,944 bytes in size and contains 15,841 examples. However, the specific content and purpose of the dataset are not explicitly described in the README.
提供机构:
DAMO-NLP-SG



