JaehyeokLee/dn_sft_part_1
收藏Hugging Face2025-02-28 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/JaehyeokLee/dn_sft_part_1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本对信息,其中包括一个名为anchor的文本和一个与之相关的positive文本。每个文本对都有一个subset标签,以及anchor和positive文本的字符计数。数据集目前只有一个训练集(train),包含大约1000603个样本,总大小为4GB。
The dataset consists of text pairs, including an anchor text and a related positive text. Each text pair has a subset label, as well as character counts for the anchor and positive texts. The dataset currently has only one training set (train) with approximately 1000603 examples, totaling 4GB in size.
提供机构:
JaehyeokLee



