corniclr25/stack-mined-python-v1
收藏Hugging Face2024-11-19 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/corniclr25/stack-mined-python-v1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含查询、文档、负面样本和元数据四个主要特征。元数据中进一步定义了目标结构,包括配对、自我和三重组三个子结构。数据集分为一个训练集,包含10,000,000个样本,总大小为107,750,038,700字节,下载大小为41,516,082,386字节。
The dataset includes four main features: query, document, negatives, and metadata. The metadata further defines an objective structure, which includes three substructures: paired, self, and triplet. The dataset is divided into one training set containing 10,000,000 samples, with a total size of 107,750,038,700 bytes and a download size of 41,516,082,386 bytes.
提供机构:
corniclr25



