fine-tuned/jinaai_jina-embeddings-v2-base-en-scientific-papers-from-arxiv
收藏Hugging Face2024-05-10 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/fine-tuned/jinaai_jina-embeddings-v2-base-en-scientific-papers-from-arxiv
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
task_categories:
- feature-extraction
- sentence-similarity
language:
- en
tags:
- sentence-transformers
- feature-extraction
- sentence-similarity
- mteb
- Science
- Research
- Academic
- Papers
- Arxiv
pretty_name: academic research papers search engine
size_categories:
- n<1K
---
# jinaai_jina-embeddings-v2-base-en-scientific-papers-from-arxiv Dataset
## Dataset Description
The dataset "academic research papers search engine" is a generated dataset designed to support the development of domain specific embedding models for retrieval tasks.
## Associated Model
This dataset was used to train the [**jinaai_jina-embeddings-v2-base-en-scientific-papers-from-arxiv**](https://huggingface.co/fine-tuned/jinaai_jina-embeddings-v2-base-en-scientific-papers-from-arxiv) model.
## How to Use
To use this dataset for model training or evaluation, you can load it using the Hugging Face `datasets` library as follows:
```python
from datasets import load_dataset
dataset = load_dataset("fine-tuned/jinaai_jina-embeddings-v2-base-en-scientific-papers-from-arxiv")
print(dataset['test'][0])
```
提供机构:
fine-tuned
原始信息汇总
数据集概述
数据集名称
- 名称: academic research papers search engine
- 完整名称: jinaai_jina-embeddings-v2-base-en-scientific-papers-from-arxiv
数据集描述
- 目的: 支持开发特定领域的嵌入模型,用于检索任务。
数据集特征
- 语言: 英语 (en)
- 任务类别:
- 特征提取 (feature-extraction)
- 句子相似度 (sentence-similarity)
- 标签:
- sentence-transformers
- feature-extraction
- sentence-similarity
- mteb
- Science
- Research
- Academic
- Papers
- Arxiv
- 大小类别: n<1K
关联模型
- 模型名称: jinaai_jina-embeddings-v2-base-en-scientific-papers-from-arxiv
- 模型链接: jinaai_jina-embeddings-v2-base-en-scientific-papers-from-arxiv
使用方法
-
加载方式: 使用Hugging Face的
datasets库加载数据集,示例代码如下: python from datasets import load_datasetdataset = load_dataset("fine-tuned/jinaai_jina-embeddings-v2-base-en-scientific-papers-from-arxiv") print(dataset[test][0])



