fine-tuned/NFCorpus-256-24-gpt-4o-2024-05-13-546049
收藏Hugging Face2024-05-23 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/fine-tuned/NFCorpus-256-24-gpt-4o-2024-05-13-546049
下载链接
链接失效反馈官方服务:
资源简介:
数据集“medical information retrieval”是一个生成的数据集,旨在支持特定领域嵌入模型的开发,用于检索任务。
The dataset medical information retrieval is designed to support the development of domain-specific embedding models for retrieval tasks, particularly in the medical and nutrition domains. It includes queries, documents, and relevance information. The dataset is categorized under feature extraction, sentence similarity, and is tagged with sentence-transformers, mteb, Medical, Nutrition, and Relevance. The dataset size is less than 1K. The associated model trained using this dataset is named NFCorpus-256-24-gpt-4o-2024-05-13-546049.
提供机构:
fine-tuned
原始信息汇总
数据集概述
数据集名称
- 名称: medical information retrieval
数据集描述
- 描述: 该数据集是一个生成的数据集,旨在支持特定领域嵌入模型的发展,用于检索任务。
数据集用途
- 用途: 用于模型训练或评估。
数据集使用方法
-
使用方法: 通过Hugging Face
datasets库加载数据集,示例代码如下: python from datasets import load_datasetdataset = load_dataset("fine-tuned/NFCorpus-256-24-gpt-4o-2024-05-13-546049") print(dataset[test][0])
数据集特征
- 语言: 英语 (
en) - 任务类别:
- 特征提取 (
feature-extraction) - 句子相似度 (
sentence-similarity)
- 特征提取 (
- 标签:
- sentence-transformers
- feature-extraction
- sentence-similarity
- mteb
- Medical
- Nutrition
- Queries
- Documents
- Relevance
- 大小类别: n<1K
许可证
- 许可证: Apache-2.0



