illuin-conteb/covid-qa

Name: illuin-conteb/covid-qa
Creator: illuin-conteb
Published: 2025-06-02 10:45:09
License: 暂无描述

Hugging Face2025-06-02 更新2025-10-18 收录

下载链接：

https://hf-mirror.com/datasets/illuin-conteb/covid-qa

下载链接

链接失效反馈

官方服务：

资源简介：

ConTEB数据集是ConTEB（上下文感知文本嵌入基准）的一部分，旨在评估上下文嵌入模型的能力。该数据集关注健康护理主题，特别是源自关于COVID-19疫情的文章。数据集基于COVID-QA数据集构建，包含经过精心挑选的原始文档、从这些文档中提取的片段以及查询。共有115个文档，3351个片段和1111个查询，平均每个文档有153.9个标记。

ConTEB dataset is part of ConTEB (Context-aware Text Embedding Benchmark), designed to evaluate the capabilities of contextual embedding models. The dataset focuses on the theme of Healthcare, particularly from articles about the COVID-19 pandemic. It is built upon the COVID-QA dataset and includes a curated set of original documents, chunks derived from them, and queries. There are 115 documents, 3351 chunks, and 1111 queries, with an average of 153.9 tokens per document.

提供机构：

illuin-conteb

5,000+

优质数据集

54 个

任务类型

进入经典数据集