fine-tuned/SCIDOCS-32000-384-gpt-4o-2024-05-13-78042877
收藏Hugging Face2024-05-29 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/fine-tuned/SCIDOCS-32000-384-gpt-4o-2024-05-13-78042877
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
task_categories:
- feature-extraction
- sentence-similarity
language:
- en
tags:
- sentence-transformers
- feature-extraction
- sentence-similarity
- mteb
- Research
- Academic
- Papers
- Studies
- Publications
pretty_name: arxiv paper domain
size_categories:
- n<1K
---
# SCIDOCS-32000-384-gpt-4o-2024-05-13-78042877 Dataset
## Dataset Description
The dataset "arxiv paper domain" is a generated dataset designed to support the development of domain specific embedding models for retrieval tasks.
## Associated Model
This dataset was used to train the [**SCIDOCS-32000-384-gpt-4o-2024-05-13-78042877**](https://huggingface.co/fine-tuned/SCIDOCS-32000-384-gpt-4o-2024-05-13-78042877) model.
## How to Use
To use this dataset for model training or evaluation, you can load it using the Hugging Face `datasets` library as follows:
```python
from datasets import load_dataset
dataset = load_dataset("fine-tuned/SCIDOCS-32000-384-gpt-4o-2024-05-13-78042877")
print(dataset['test'][0])
```
The dataset arxiv paper domain is a generated dataset designed to support the development of domain specific embedding models for retrieval tasks. This dataset is associated with the [**SCIDOCS-32000-384-gpt-4o-2024-05-13-78042877**](https://huggingface.co/fine-tuned/SCIDOCS-32000-384-gpt-4o-2024-05-13-78042877) model and can be loaded and used via the Hugging Face `datasets` library.
提供机构:
fine-tuned
原始信息汇总
SCIDOCS-32000-384-gpt-4o-2024-05-13-78042877 Dataset
数据集描述
- 名称: arxiv paper domain
- 目的: 支持开发特定领域嵌入模型,用于检索任务。
相关模型
- 训练模型: SCIDOCS-32000-384-gpt-4o-2024-05-13-78042877
使用方法
- 加载数据集: 使用Hugging Face
datasets库加载数据集,示例代码如下: python from datasets import load_dataset dataset = load_dataset("fine-tuned/SCIDOCS-32000-384-gpt-4o-2024-05-13-78042877") print(dataset[test][0])



