five

illuin-conteb/covid-qa

收藏
Hugging Face2025-06-02 更新2025-10-18 收录
下载链接:
https://hf-mirror.com/datasets/illuin-conteb/covid-qa
下载链接
链接失效反馈
官方服务:
资源简介:
ConTEB数据集是ConTEB(上下文感知文本嵌入基准)的一部分,旨在评估上下文嵌入模型的能力。该数据集关注健康护理主题,特别是源自关于COVID-19疫情的文章。数据集基于COVID-QA数据集构建,包含经过精心挑选的原始文档、从这些文档中提取的片段以及查询。共有115个文档,3351个片段和1111个查询,平均每个文档有153.9个标记。

ConTEB dataset is part of ConTEB (Context-aware Text Embedding Benchmark), designed to evaluate the capabilities of contextual embedding models. The dataset focuses on the theme of Healthcare, particularly from articles about the COVID-19 pandemic. It is built upon the COVID-QA dataset and includes a curated set of original documents, chunks derived from them, and queries. There are 115 documents, 3351 chunks, and 1111 queries, with an average of 153.9 tokens per document.
提供机构:
illuin-conteb
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作