xiaozeroone/c4_derived
收藏Hugging Face2023-10-08 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/xiaozeroone/c4_derived
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: c4
path: data/c4-*
- split: biomedical
path: data/biomedical-*
- split: counterfactual
path: data/counterfactual-*
- split: academic
path: data/academic-*
dataset_info:
features:
- name: text
dtype: string
- name: url
dtype: string
splits:
- name: c4
num_bytes: 1820234
num_examples: 1000
- name: biomedical
num_bytes: 1803036
num_examples: 989
- name: counterfactual
num_bytes: 1813882
num_examples: 985
- name: academic
num_bytes: 1199491
num_examples: 986
download_size: 4124290
dataset_size: 6636643
---
# Dataset Card for "c4_derived"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
xiaozeroone
原始信息汇总
数据集卡片 "c4_derived"
配置
- 默认配置
- 数据文件:
- 分割: c4
- 路径: data/c4-*
- 分割: biomedical
- 路径: data/biomedical-*
- 分割: counterfactual
- 路径: data/counterfactual-*
- 分割: academic
- 路径: data/academic-*
- 分割: c4
- 数据文件:
数据集信息
-
特征:
- 名称: text
- 数据类型: string
- 名称: url
- 数据类型: string
- 名称: text
-
分割:
- 名称: c4
- 字节数: 1820234
- 样本数: 1000
- 名称: biomedical
- 字节数: 1803036
- 样本数: 989
- 名称: counterfactual
- 字节数: 1813882
- 样本数: 985
- 名称: academic
- 字节数: 1199491
- 样本数: 986
- 名称: c4
-
下载大小: 4124290 字节
-
数据集大小: 6636643 字节



