sivan22/yalkut-yosef-embeddings
收藏Hugging Face2024-05-26 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/sivan22/yalkut-yosef-embeddings
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- he
dataset_info:
features:
- name: 'Unnamed: 0'
dtype: int64
- name: bookname
dtype: string
- name: topic
dtype: string
- name: siman
dtype: string
- name: sek
dtype: string
- name: text
dtype: string
- name: embeddings
sequence: float64
splits:
- name: train
num_bytes: 83344888
num_examples: 9299
download_size: 64144813
dataset_size: 83344888
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
The dataset includes various features such as book name, topic, chapter, section, text content, and embeddings. It is divided into a training set with 9299 samples. The dataset is primarily in Hebrew.
提供机构:
sivan22
原始信息汇总
数据集概述
语言
- 数据集语言:希伯来语(he)
数据集信息
特征
- Unnamed: 0: 数据类型为
int64 - bookname: 数据类型为
string - topic: 数据类型为
string - siman: 数据类型为
string - sek: 数据类型为
string - text: 数据类型为
string - embeddings: 数据类型为
float64序列
数据分割
- train:
- 字节数: 83344888
- 样本数: 9299
数据大小
- 下载大小: 64144813
- 数据集大小: 83344888
配置
- config_name: default
- data_files:
- 分割: train
- 路径: data/train-*



