kliu-flappy/fineweb-edu-gpt2-100m-difficulty-ordered-test
收藏Hugging Face2026-03-02 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/kliu-flappy/fineweb-edu-gpt2-100m-difficulty-ordered-test
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: token_count
dtype: int64
- name: input_ids
list: uint16
- name: pad_mask
list: uint16
- name: sequence_ids
list: uint16
- name: token_density
dtype: float64
- name: tokenizer
dtype: string
- name: max_length
dtype: int64
- name: total_tokens
dtype: int64
- name: train_portion
dtype: float64
- name: test_portion
dtype: float64
- name: difficulty_score
dtype: float64
- name: difficulty_grade
dtype: float64
- name: text
dtype: string
splits:
- name: train
num_bytes: 1011035897
num_examples: 194932
download_size: 401942381
dataset_size: 1011035897
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
提供机构:
kliu-flappy



