Kyle1668/dclm-dedup-25B_20251112-0611
收藏Hugging Face2025-11-12 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/Kyle1668/dclm-dedup-25B_20251112-0611
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: id
dtype: string
- name: word_filter
dtype: bool
- name: word_filter_metadata
struct:
- name: keywords
dtype: string
- name: combined_filter
dtype: bool
splits:
- name: train
num_bytes: 1304545429
num_examples: 20462594
download_size: 763483584
dataset_size: 1304545429
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
提供机构:
Kyle1668



