andreribeiro87/mmarco-more-hard-negatives
收藏Hugging Face2025-12-18 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/andreribeiro87/mmarco-more-hard-negatives
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc
task_categories:
- question-answering
language:
- pt
pretty_name: mmarco-with-hard-negatives
size_categories:
- 100M<n<1B
---
# mMarco with more hard negatives
At least 5 hard negatives per each pair (query, answer) on training set
On eval set 30 hard negatives for each pair (query, answer).
Here it is a Dataset mined to finetune a reranker.
Feel free to resplit the dataset.
Who else have a lot of gpu's and a cpu with over 128 cores take a look on this :)
## Model Card Authors
André Ribeiro [@andreribeiro87](https://huggingface.co/andreribeiro87)
Rúben Garrido [@RGarrido03](https://huggingface.co/RGarrido03)
提供机构:
andreribeiro87



