rakshya34/my-tokenized-test-dataset
收藏Hugging Face2024-05-31 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/rakshya34/my-tokenized-test-dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: text
dtype: string
- name: input_ids
sequence: int32
- name: attention_mask
sequence: int8
splits:
- name: test
num_bytes: 832470563
num_examples: 268189
download_size: 100312490
dataset_size: 832470563
configs:
- config_name: default
data_files:
- split: test
path: data/test-*
---
The dataset includes three features: text, input_ids, and attention_mask. It is divided into a test set with 268189 examples, occupying 832470563 bytes. The download size of the dataset is 100312490 bytes, and the total size is 832470563 bytes. The dataset configuration is default, with test set data files located at data/test-* path.
提供机构:
rakshya34
原始信息汇总
数据集概述
数据集特征
- text:字符串类型
- input_ids:整数序列,类型为int32
- attention_mask:整数序列,类型为int8
数据集分割
- test:
- 示例数量:268189
- 数据大小:832470563字节
数据集大小与下载信息
- 下载大小:100312490字节
- 数据集总大小:832470563字节
配置信息
- config_name: default
- data_files:
- split: test
- path: data/test-*



