fine-tuned/SciFact-32000-384-gpt-4o-2024-05-13-83349675
收藏Hugging Face2024-05-29 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/fine-tuned/SciFact-32000-384-gpt-4o-2024-05-13-83349675
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
task_categories:
- feature-extraction
- sentence-similarity
language:
- en
tags:
- sentence-transformers
- feature-extraction
- sentence-similarity
- mteb
- Health
- Medicine
- Treatment
- Diagnosis
- Research
pretty_name: medical domain
size_categories:
- n<1K
---
# SciFact-32000-384-gpt-4o-2024-05-13-83349675 Dataset
## Dataset Description
The dataset "medical domain" is a generated dataset designed to support the development of domain specific embedding models for retrieval tasks.
## Associated Model
This dataset was used to train the [**SciFact-32000-384-gpt-4o-2024-05-13-83349675**](https://huggingface.co/fine-tuned/SciFact-32000-384-gpt-4o-2024-05-13-83349675) model.
## How to Use
To use this dataset for model training or evaluation, you can load it using the Hugging Face `datasets` library as follows:
```python
from datasets import load_dataset
dataset = load_dataset("fine-tuned/SciFact-32000-384-gpt-4o-2024-05-13-83349675")
print(dataset['test'][0])
```
提供机构:
fine-tuned
原始信息汇总
数据集概述
基本信息
- 许可证: Apache-2.0
- 任务类别:
- 特征提取
- 句子相似度
- 语言: 英语
- 标签:
- sentence-transformers
- 特征提取
- 句子相似度
- mteb
- 健康
- 医学
- 治疗
- 诊断
- 研究
- 美观名称: 医疗领域
- 大小类别: n<1K
数据集描述
"医疗领域"数据集是一个生成的数据集,旨在支持特定领域嵌入模型的发展,用于检索任务。
关联模型
该数据集用于训练SciFact-32000-384-gpt-4o-2024-05-13-83349675模型。
使用方法
使用此数据集进行模型训练或评估,可以通过Hugging Face的datasets库加载:
python
from datasets import load_dataset
dataset = load_dataset("fine-tuned/SciFact-32000-384-gpt-4o-2024-05-13-83349675") print(dataset[test][0])



