five

fine-tuned/scidocs-c

收藏
Hugging Face2024-05-14 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/fine-tuned/scidocs-c
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 task_categories: - feature-extraction - sentence-similarity language: - en tags: - sentence-transformers - feature-extraction - sentence-similarity - mteb - Science - Research - Academic - Papers - Arxiv pretty_name: academic research papers search engine size_categories: - n<1K --- # scidocs-c Dataset ## Dataset Description The dataset "academic research papers search engine" is a generated dataset designed to support the development of domain specific embedding models for retrieval tasks. ## Associated Model This dataset was used to train the [**scidocs-c**](https://huggingface.co/fine-tuned/scidocs-c) model. ## How to Use To use this dataset for model training or evaluation, you can load it using the Hugging Face `datasets` library as follows: ```python from datasets import load_dataset dataset = load_dataset("fine-tuned/scidocs-c") print(dataset['test'][0]) ```
提供机构:
fine-tuned
原始信息汇总

scidocs-c Dataset 概述

数据集描述

  • 名称: academic research papers search engine
  • 目的: 支持特定领域嵌入模型的开发,用于检索任务。

语言和类别

  • 语言: 英语 (en)
  • 任务类别:
    • 特征提取 (feature-extraction)
    • 句子相似度 (sentence-similarity)

相关模型

  • 训练模型: scidocs-c
  • 模型链接: scidocs-c

使用方法

  • 使用 Hugging Face datasets 库加载数据集: python from datasets import load_dataset dataset = load_dataset("fine-tuned/scidocs-c") print(dataset[test][0])

许可

  • 许可证: Apache-2.0
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作