five

fine-tuned/jinaai_jina-embeddings-v2-base-en-scientific-papers-from-arxiv

收藏
Hugging Face2024-05-10 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/fine-tuned/jinaai_jina-embeddings-v2-base-en-scientific-papers-from-arxiv
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 task_categories: - feature-extraction - sentence-similarity language: - en tags: - sentence-transformers - feature-extraction - sentence-similarity - mteb - Science - Research - Academic - Papers - Arxiv pretty_name: academic research papers search engine size_categories: - n<1K --- # jinaai_jina-embeddings-v2-base-en-scientific-papers-from-arxiv Dataset ## Dataset Description The dataset "academic research papers search engine" is a generated dataset designed to support the development of domain specific embedding models for retrieval tasks. ## Associated Model This dataset was used to train the [**jinaai_jina-embeddings-v2-base-en-scientific-papers-from-arxiv**](https://huggingface.co/fine-tuned/jinaai_jina-embeddings-v2-base-en-scientific-papers-from-arxiv) model. ## How to Use To use this dataset for model training or evaluation, you can load it using the Hugging Face `datasets` library as follows: ```python from datasets import load_dataset dataset = load_dataset("fine-tuned/jinaai_jina-embeddings-v2-base-en-scientific-papers-from-arxiv") print(dataset['test'][0]) ```
提供机构:
fine-tuned
原始信息汇总

数据集概述

数据集名称

  • 名称: academic research papers search engine
  • 完整名称: jinaai_jina-embeddings-v2-base-en-scientific-papers-from-arxiv

数据集描述

  • 目的: 支持开发特定领域的嵌入模型,用于检索任务。

数据集特征

  • 语言: 英语 (en)
  • 任务类别:
    • 特征提取 (feature-extraction)
    • 句子相似度 (sentence-similarity)
  • 标签:
    • sentence-transformers
    • feature-extraction
    • sentence-similarity
    • mteb
    • Science
    • Research
    • Academic
    • Papers
    • Arxiv
  • 大小类别: n<1K

关联模型

使用方法

  • 加载方式: 使用Hugging Face的datasets库加载数据集,示例代码如下: python from datasets import load_dataset

    dataset = load_dataset("fine-tuned/jinaai_jina-embeddings-v2-base-en-scientific-papers-from-arxiv") print(dataset[test][0])

二维码
社区交流群
二维码
科研交流群
商业服务