DylanJHJ/beir-corpus
收藏Hugging Face2025-12-03 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/DylanJHJ/beir-corpus
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: docid
dtype: string
- name: title
dtype: string
- name: text
dtype: string
- name: source
dtype: string
splits:
- name: beir.arguana
num_bytes: 9457486
num_examples: 8674
- name: beir.climate_fever
num_bytes: 3138495721
num_examples: 5416593
- name: beir.dbpedia_entity
num_bytes: 1676319011
num_examples: 4635922
- name: beir.fever
num_bytes: 3138438344
num_examples: 5416568
- name: beir.fiqa
num_bytes: 45764316
num_examples: 57638
- name: beir.hotpotqa
num_bytes: 1663152751
num_examples: 5233329
- name: beir.nfcorpus
num_bytes: 5885762
num_examples: 3633
- name: beir.nq
num_bytes: 1402869607
num_examples: 2681468
- name: beir.quora
num_bytes: 46013118
num_examples: 522931
- name: beir.scidocs
num_bytes: 32467743
num_examples: 25657
- name: beir.scifact
num_bytes: 7916434
num_examples: 5183
- name: beir.trec_covid
num_bytes: 196556433
num_examples: 171332
- name: beir.webis_touche2020
num_bytes: 681128863
num_examples: 382545
download_size: 7195507243
dataset_size: 12044465589
configs:
- config_name: default
data_files:
- split: beir.arguana
path: data/beir.arguana-*
- split: beir.climate_fever
path: data/beir.climate_fever-*
- split: beir.dbpedia_entity
path: data/beir.dbpedia_entity-*
- split: beir.fever
path: data/beir.fever-*
- split: beir.fiqa
path: data/beir.fiqa-*
- split: beir.hotpotqa
path: data/beir.hotpotqa-*
- split: beir.nfcorpus
path: data/beir.nfcorpus-*
- split: beir.nq
path: data/beir.nq-*
- split: beir.quora
path: data/beir.quora-*
- split: beir.scidocs
path: data/beir.scidocs-*
- split: beir.scifact
path: data/beir.scifact-*
- split: beir.trec_covid
path: data/beir.trec_covid-*
- split: beir.webis_touche2020
path: data/beir.webis_touche2020-*
---
数据集信息:
特征:
- 名称:文档ID(docid),数据类型:字符串
- 名称:标题(title),数据类型:字符串
- 名称:文本(text),数据类型:字符串
- 名称:来源(source),数据类型:字符串
数据划分:
- 划分名称:beir.arguana,总字节数:9457486,示例数量:8674
- 划分名称:beir.气候断言(climate_fever),总字节数:3138495721,示例数量:5416593
- 划分名称:beir.dbpedia实体(dbpedia_entity),总字节数:1676319011,示例数量:4635922
- 划分名称:beir.事实核查(fever),总字节数:3138438344,示例数量:5416568
- 划分名称:beir.fiqa,总字节数:45764316,示例数量:57638
- 划分名称:beir.hotpotqa,总字节数:1663152751,示例数量:5233329
- 划分名称:beir.nfcorpus,总字节数:5885762,示例数量:3633
- 划分名称:beir.nq,总字节数:1402869607,示例数量:2681468
- 划分名称:beir.quora,总字节数:46013118,示例数量:522931
- 划分名称:beir.scidocs,总字节数:32467743,示例数量:25657
- 划分名称:beir.scifact,总字节数:7916434,示例数量:5183
- 划分名称:beir.trec_covid,总字节数:196556433,示例数量:171332
- 划分名称:beir.webis_touche2020,总字节数:681128863,示例数量:382545
下载总大小:7195507243 字节,数据集总大小:12044465589 字节
配置项:
- 配置名称:默认(default),数据文件:
- 划分:beir.arguana,路径:data/beir.arguana-*
- 划分:beir.气候断言(climate_fever),路径:data/beir.climate_fever-*
- 划分:beir.dbpedia实体(dbpedia_entity),路径:data/beir.dbpedia_entity-*
- 划分:beir.事实核查(fever),路径:data/beir.fever-*
- 划分:beir.fiqa,路径:data/beir.fiqa-*
- 划分:beir.hotpotqa,路径:data/beir.hotpotqa-*
- 划分:beir.nfcorpus,路径:data/beir.nfcorpus-*
- 划分:beir.nq,路径:data/beir.nq-*
- 划分:beir.quora,路径:data/beir.quora-*
- 划分:beir.scidocs,路径:data/beir.scidocs-*
- 划分:beir.scifact,路径:data/beir.scifact-*
- 划分:beir.trec_covid,路径:data/beir.trec_covid-*
- 划分:beir.webis_touche2020,路径:data/beir.webis_touche2020-*
提供机构:
DylanJHJ



