SaeedRahmani/persian_poems_stanzas_vocabbase
收藏Hugging Face2024-03-06 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/SaeedRahmani/persian_poems_stanzas_vocabbase
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: text
dtype: string
splits:
- name: vocab_size_100
num_bytes: 38861
num_examples: 2326
- name: vocab_size_500
num_bytes: 1040187
num_examples: 22466
- name: vocab_size_1000
num_bytes: 3715139
num_examples: 72944
- name: vocab_size_2000
num_bytes: 9926663
num_examples: 188120
- name: vocab_size_5000
num_bytes: 24740219
num_examples: 460281
- name: vocab_size_10000
num_bytes: 37788996
num_examples: 699075
- name: vocab_size_20000
num_bytes: 49308928
num_examples: 908630
- name: vocab_size_50000
num_bytes: 60453788
num_examples: 1109769
- name: vocab_size_100000
num_bytes: 66270656
num_examples: 1213977
- name: vocab_size_150000
num_bytes: 68900773
num_examples: 1259552
- name: vocab_size_200000
num_bytes: 71330036
num_examples: 1301730
- name: vocab_size_264362
num_bytes: 74771010
num_examples: 1360129
download_size: 246876501
dataset_size: 468285256
configs:
- config_name: default
data_files:
- split: vocab_size_100
path: data/vocab_size_100-*
- split: vocab_size_500
path: data/vocab_size_500-*
- split: vocab_size_1000
path: data/vocab_size_1000-*
- split: vocab_size_2000
path: data/vocab_size_2000-*
- split: vocab_size_5000
path: data/vocab_size_5000-*
- split: vocab_size_10000
path: data/vocab_size_10000-*
- split: vocab_size_20000
path: data/vocab_size_20000-*
- split: vocab_size_50000
path: data/vocab_size_50000-*
- split: vocab_size_100000
path: data/vocab_size_100000-*
- split: vocab_size_150000
path: data/vocab_size_150000-*
- split: vocab_size_200000
path: data/vocab_size_200000-*
- split: vocab_size_264362
path: data/vocab_size_264362-*
---
提供机构:
SaeedRahmani
原始信息汇总
数据集概述
特征
- 名称: text
- 数据类型: string
数据分割
- vocab_size_100
- 字节数: 38861
- 样本数: 2326
- vocab_size_500
- 字节数: 1040187
- 样本数: 22466
- vocab_size_1000
- 字节数: 3715139
- 样本数: 72944
- vocab_size_2000
- 字节数: 9926663
- 样本数: 188120
- vocab_size_5000
- 字节数: 24740219
- 样本数: 460281
- vocab_size_10000
- 字节数: 37788996
- 样本数: 699075
- vocab_size_20000
- 字节数: 49308928
- 样本数: 908630
- vocab_size_50000
- 字节数: 60453788
- 样本数: 1109769
- vocab_size_100000
- 字节数: 66270656
- 样本数: 1213977
- vocab_size_150000
- 字节数: 68900773
- 样本数: 1259552
- vocab_size_200000
- 字节数: 71330036
- 样本数: 1301730
- vocab_size_264362
- 字节数: 74771010
- 样本数: 1360129
数据集大小
- 下载大小: 246876501 字节
- 数据集大小: 468285256 字节
配置
- 配置名称: default
- 数据文件:
- vocab_size_100: data/vocab_size_100-*
- vocab_size_500: data/vocab_size_500-*
- vocab_size_1000: data/vocab_size_1000-*
- vocab_size_2000: data/vocab_size_2000-*
- vocab_size_5000: data/vocab_size_5000-*
- vocab_size_10000: data/vocab_size_10000-*
- vocab_size_20000: data/vocab_size_20000-*
- vocab_size_50000: data/vocab_size_50000-*
- vocab_size_100000: data/vocab_size_100000-*
- vocab_size_150000: data/vocab_size_150000-*
- vocab_size_200000: data/vocab_size_200000-*
- vocab_size_264362: data/vocab_size_264362-*
- 数据文件:



