five

fine-tuned/jinaai_jina-embeddings-v2-base-code-stackoverflow

收藏
Hugging Face2024-05-13 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/fine-tuned/jinaai_jina-embeddings-v2-base-code-stackoverflow
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 task_categories: - feature-extraction - sentence-similarity language: - en tags: - sentence-transformers - feature-extraction - sentence-similarity - mteb - Programming - Development - Code - Software - Technology pretty_name: code snippet search engine size_categories: - n<1K --- # jinaai_jina-embeddings-v2-base-code-stackoverflow Dataset ## Dataset Description The dataset "code snippet search engine" is a generated dataset designed to support the development of domain specific embedding models for retrieval tasks. ## Associated Model This dataset was used to train the [**jinaai_jina-embeddings-v2-base-code-stackoverflow**](https://huggingface.co/fine-tuned/jinaai_jina-embeddings-v2-base-code-stackoverflow) model. ## How to Use To use this dataset for model training or evaluation, you can load it using the Hugging Face `datasets` library as follows: ```python from datasets import load_dataset dataset = load_dataset("fine-tuned/jinaai_jina-embeddings-v2-base-code-stackoverflow") print(dataset['test'][0]) ```
提供机构:
fine-tuned
原始信息汇总

jinaai_jina-embeddings-v2-base-code-stackoverflow 数据集概述

数据集描述

  • 名称: code snippet search engine
  • 目的: 用于支持特定领域嵌入模型开发,主要用于检索任务。

数据集特征

  • 语言: 英语 (en)
  • 任务类别:
    • 特征提取 (feature-extraction)
    • 句子相似度 (sentence-similarity)
  • 标签:
    • sentence-transformers
    • feature-extraction
    • sentence-similarity
    • mteb
    • Programming
    • Development
    • Code
    • Software
    • Technology
  • 大小类别: 小于1K (n<1K)

使用方法

  • 通过Hugging Face的datasets库加载数据集: python from datasets import load_dataset dataset = load_dataset("fine-tuned/jinaai_jina-embeddings-v2-base-code-stackoverflow") print(dataset[test][0])

相关模型

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作