Team-PIXEL/PIXELSum_zh_wiki_for_TA
收藏Hugging Face2024-01-21 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Team-PIXEL/PIXELSum_zh_wiki_for_TA
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
dataset_info:
features:
- name: text
struct:
- name: bytes
dtype: binary
- name: path
dtype: 'null'
- name: target
dtype: string
- name: num_text_patches
dtype: int64
splits:
- name: train
num_bytes: 103154872722
num_examples: 2555904
download_size: 102774842417
dataset_size: 103154872722
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
The dataset includes features such as text (containing bytes and path), target (string type), and number of text patches (integer type). It is divided into a training set with approximately 2.55 million examples, totaling 103154872722 bytes. Both the download size and the actual size of the dataset are 103154872722 bytes. The dataset configuration is set to default, with training data files located at data/train-* path.
提供机构:
Team-PIXEL
原始信息汇总
数据集概述
许可证
- Apache 2.0
数据集信息
-
特征
- text
- bytes: 二进制数据类型
- path: 空值数据类型
- target: 字符串数据类型
- num_text_patches: 64位整数数据类型
- text
-
分割
- train
- 字节数: 103,154,872,722
- 样本数: 2,555,904
- train
数据大小
- 下载大小: 102,774,842,417 字节
- 数据集大小: 103,154,872,722 字节
配置
- default
- 数据文件
- train:
data/train-*
- train:
- 数据文件



