ivanleomk/wikipedia-embeddings-trial
收藏Hugging Face2023-12-05 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/ivanleomk/wikipedia-embeddings-trial
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: text
dtype: string
- name: embedding
sequence: float64
splits:
- name: train
num_bytes: 41921160
num_examples: 6400
download_size: 42703209
dataset_size: 41921160
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
This dataset contains text and corresponding embeddings. The text feature is of string type, and the embedding feature is a sequence of float numbers. The dataset includes only a training set with 6400 examples, totaling 41921160 bytes. The download size of the dataset is 42703209 bytes. The training data files are located at data/train-* path.
提供机构:
ivanleomk
原始信息汇总
数据集概述
数据特征
- 名称: text
- 数据类型: string
- 名称: embedding
- 序列类型: float64
数据划分
- 名称: train
- 字节数: 41921160
- 样本数: 6400
数据集大小
- 下载大小: 42703209
- 实际大小: 41921160
配置
- 配置名称: default
- 数据文件:
- 划分: train
- 路径: data/train-*
- 数据文件:



