corniclr25/stack-mined-go-v1
收藏Hugging Face2024-11-19 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/corniclr25/stack-mined-go-v1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含四个主要特征:查询(query)、文档(document)、负样本(negatives)和元数据(metadata)。元数据进一步包含目标(objective)结构,该结构下又分为配对(paired)、自我(self)和三元组(triplet)三个子结构。数据集分为一个训练集(train),包含7,000,000个样本,总大小为67,930,273,171字节。下载大小为24,368,886,917字节。
The dataset contains four main features: query, document, negatives, and metadata. The metadata further includes an objective structure, which is divided into three substructures: paired, self, and triplet. The dataset is divided into one training set (train) containing 7,000,000 samples, with a total size of 67,930,273,171 bytes. The download size is 24,368,886,917 bytes.
提供机构:
corniclr25



