Supabase/dbpedia-openai-3-large-1M
收藏Hugging Face2024-02-06 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Supabase/dbpedia-openai-3-large-1M
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
dataset_info:
features:
- name: _id
dtype: string
- name: title
dtype: string
- name: text
dtype: string
- name: embedding
sequence: float32
splits:
- name: train
num_bytes: 17782586772
num_examples: 1000000
download_size: 17782586772
dataset_size: 1000000
language:
- en
pretty_name: OpenAI text-embedding-3-large with 1M DBPedia Entities
size_categories:
- 1M<n<10M
---
1 million OpenAI Embeddings - 3072 dimensions
Created: February 2024.
Text used for Embedding: title (string) + text (string)
Embedding Model: text-embedding-3-large
## Credits:
This dataset was generated from the first 1M entries of https://huggingface.co/datasets/BeIR/dbpedia-entity
提供机构:
Supabase
原始信息汇总
数据集概述
数据集信息
- 特征:
_id: 字符串类型title: 字符串类型text: 字符串类型embedding: 浮点数序列类型
- 分割:
train: 包含1,000,000个样本,总大小为17,782,586,772字节
- 下载大小: 17,782,586,772字节
- 数据集大小: 1,000,000个样本
- 语言: 英语
- 名称: OpenAI text-embedding-3-large with 1M DBPedia Entities
- 大小类别: 1M<n<10M
其他信息
- 创建日期: 2024年2月
- 用于嵌入的文本:
title(字符串) +text(字符串) - 嵌入模型: text-embedding-3-large
- 数据来源: 从https://huggingface.co/datasets/BeIR/dbpedia-entity的前1,000,000条记录生成



