tom-010/enwiki-abstracts-jina-v3-2410
收藏Hugging Face2024-11-04 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/tom-010/enwiki-abstracts-jina-v3-2410
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个主要特征:标题(title)、摘要(abstract)和嵌入(embeddings)。数据集仅包含一个训练集(train),该训练集包含6,758,661个示例,总大小为31,417,237,836字节。数据集的下载大小为34,386,243,251字节。数据集的配置信息包括默认配置名称(default)和数据文件路径(data/train-*)。
The dataset includes three main features: title, abstract, and embeddings. It contains only a training set (train) with 6,758,661 examples and a total size of 31,417,237,836 bytes. The download size of the dataset is 34,386,243,251 bytes. The configuration information of the dataset includes the default configuration name (default) and the data file path (data/train-*).
提供机构:
tom-010



