hgissbkh/Mmarco-reranking
收藏Hugging Face2024-05-22 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/hgissbkh/Mmarco-reranking
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: acge_text_embedding
features:
- name: query
dtype: string
- name: docs
sequence: string
- name: query_enc
sequence: float64
- name: docs_enc
sequence:
sequence: float64
- name: cos_scores
sequence: float64
- name: target
sequence: int64
splits:
- name: train
num_bytes: 1470202760
num_examples: 100
download_size: 1116166126
dataset_size: 1470202760
- config_name: gte-large-zh
features:
- name: query
dtype: string
- name: docs
sequence: string
- name: query_enc
sequence: float64
- name: docs_enc
sequence:
sequence: float64
- name: cos_scores
sequence: float64
- name: target
sequence: int64
splits:
- name: train
num_bytes: 855028616
num_examples: 100
download_size: 648073470
dataset_size: 855028616
- config_name: multilingual-e5-base
features:
- name: query
dtype: string
- name: docs
sequence: string
- name: query_enc
sequence: float64
- name: docs_enc
sequence:
sequence: float64
- name: cos_scores
sequence: float64
- name: target
sequence: int64
splits:
- name: train
num_bytes: 649970568
num_examples: 100
download_size: 490095378
dataset_size: 649970568
- config_name: multilingual-e5-large
features:
- name: query
dtype: string
- name: docs
sequence: string
- name: query_enc
sequence: float64
- name: docs_enc
sequence:
sequence: float64
- name: cos_scores
sequence: float64
- name: target
sequence: int64
splits:
- name: train
num_bytes: 855028616
num_examples: 100
download_size: 646642321
dataset_size: 855028616
- config_name: multilingual-e5-small
features:
- name: query
dtype: string
- name: docs
sequence: string
- name: query_enc
sequence: float64
- name: docs_enc
sequence:
sequence: float64
- name: cos_scores
sequence: float64
- name: target
sequence: int64
splits:
- name: train
num_bytes: 342383496
num_examples: 100
download_size: 252619651
dataset_size: 342383496
- config_name: stella-mrl-large-zh-v3.5-1792d
features:
- name: query
dtype: string
- name: docs
sequence: string
- name: query_enc
sequence: float64
- name: docs_enc
sequence:
sequence: float64
- name: cos_scores
sequence: float64
- name: target
sequence: int64
splits:
- name: train
num_bytes: 1470202760
num_examples: 100
download_size: 1116553570
dataset_size: 1470202760
configs:
- config_name: acge_text_embedding
data_files:
- split: train
path: acge_text_embedding/train-*
- config_name: gte-large-zh
data_files:
- split: train
path: gte-large-zh/train-*
- config_name: multilingual-e5-base
data_files:
- split: train
path: multilingual-e5-base/train-*
- config_name: multilingual-e5-large
data_files:
- split: train
path: multilingual-e5-large/train-*
- config_name: multilingual-e5-small
data_files:
- split: train
path: multilingual-e5-small/train-*
- config_name: stella-mrl-large-zh-v3.5-1792d
data_files:
- split: train
path: stella-mrl-large-zh-v3.5-1792d/train-*
---
提供机构:
hgissbkh
原始信息汇总
数据集概述
1. acge_text_embedding
- 特征:
- query: string
- docs: sequence of string
- query_enc: sequence of float64
- docs_enc: sequence of float64
- cos_scores: sequence of float64
- target: sequence of int64
- 分割:
- train: 100 examples, 1470202760 bytes
- 下载大小: 1116166126 bytes
- 数据集大小: 1470202760 bytes
2. gte-large-zh
- 特征:
- query: string
- docs: sequence of string
- query_enc: sequence of float64
- docs_enc: sequence of float64
- cos_scores: sequence of float64
- target: sequence of int64
- 分割:
- train: 100 examples, 855028616 bytes
- 下载大小: 648073470 bytes
- 数据集大小: 855028616 bytes
3. multilingual-e5-base
- 特征:
- query: string
- docs: sequence of string
- query_enc: sequence of float64
- docs_enc: sequence of float64
- cos_scores: sequence of float64
- target: sequence of int64
- 分割:
- train: 100 examples, 649970568 bytes
- 下载大小: 490095378 bytes
- 数据集大小: 649970568 bytes
4. multilingual-e5-large
- 特征:
- query: string
- docs: sequence of string
- query_enc: sequence of float64
- docs_enc: sequence of float64
- cos_scores: sequence of float64
- target: sequence of int64
- 分割:
- train: 100 examples, 855028616 bytes
- 下载大小: 646642321 bytes
- 数据集大小: 855028616 bytes
5. multilingual-e5-small
- 特征:
- query: string
- docs: sequence of string
- query_enc: sequence of float64
- docs_enc: sequence of float64
- cos_scores: sequence of float64
- target: sequence of int64
- 分割:
- train: 100 examples, 342383496 bytes
- 下载大小: 252619651 bytes
- 数据集大小: 342383496 bytes
6. stella-mrl-large-zh-v3.5-1792d
- 特征:
- query: string
- docs: sequence of string
- query_enc: sequence of float64
- docs_enc: sequence of float64
- cos_scores: sequence of float64
- target: sequence of int64
- 分割:
- train: 100 examples, 1470202760 bytes
- 下载大小: 1116553570 bytes
- 数据集大小: 1470202760 bytes



