fine-tuned/NFCorpus-256-24-gpt-4o-2024-05-13-396610
收藏Hugging Face2024-05-23 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/fine-tuned/NFCorpus-256-24-gpt-4o-2024-05-13-396610
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为医疗信息检索,是一个生成的数据集,旨在支持特定领域嵌入模型在检索任务中的开发。该数据集与一个特定的模型相关联,并可通过Hugging Face的`datasets`库用于模型训练或评估。数据集包含与医疗、营养、查询、文档和相关性相关的特征,并归类于特征提取和句子相似性等任务。
The dataset medical information retrieval is a generated dataset designed to support the development of domain specific embedding models for retrieval tasks. This dataset is associated with a specific model and can be used for training or evaluation using the Hugging Face `datasets` library. The dataset includes features related to medical, nutrition, queries, documents, and relevance, and it is categorized under tasks such as feature extraction and sentence similarity.
提供机构:
fine-tuned
原始信息汇总
NFCorpus-256-24-gpt-4o-2024-05-13-396610 Dataset 概述
数据集描述
- 名称: medical information retrieval
- 目的: 支持特定领域嵌入模型的发展,用于检索任务。
语言与类别
- 语言: 英语 (en)
- 任务类别:
- 特征提取 (feature-extraction)
- 句子相似度 (sentence-similarity)
标签
- sentence-transformers
- feature-extraction
- sentence-similarity
- mteb
- Medical
- Nutrition
- Queries
- Documents
- Relevance
数据集大小
- 规模: 小于1千条记录 (n<1K)
相关模型
使用方法
- 加载方式: 使用Hugging Face
datasets库加载数据集。 python from datasets import load_dataset dataset = load_dataset("fine-tuned/NFCorpus-256-24-gpt-4o-2024-05-13-396610") print(dataset[test][0])



