gayanin/woz-noised
收藏Hugging Face2024-02-12 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/gayanin/woz-noised
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: babylon-01
features:
- name: 'Unnamed: 0'
dtype: int64
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 8112324
num_examples: 60913
- name: test
num_bytes: 1008294
num_examples: 7614
- name: validation
num_bytes: 1006154
num_examples: 7615
download_size: 5223245
dataset_size: 10126772
- config_name: babylon-02
features:
- name: 'Unnamed: 0'
dtype: int64
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 8216996
num_examples: 60913
- name: test
num_bytes: 1020688
num_examples: 7614
- name: validation
num_bytes: 1019701
num_examples: 7615
download_size: 5413512
dataset_size: 10257385
- config_name: babylon-03
features:
- name: 'Unnamed: 0'
dtype: int64
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 8324195
num_examples: 60913
- name: test
num_bytes: 1033187
num_examples: 7614
- name: validation
num_bytes: 1032177
num_examples: 7615
download_size: 5592910
dataset_size: 10389559
- config_name: babylon-04
features:
- name: 'Unnamed: 0'
dtype: int64
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 8428874
num_examples: 60913
- name: test
num_bytes: 1046773
num_examples: 7614
- name: validation
num_bytes: 1044412
num_examples: 7615
download_size: 5751362
dataset_size: 10520059
- config_name: gcd-01
features:
- name: 'Unnamed: 0'
dtype: int64
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 8011683
num_examples: 60913
- name: test
num_bytes: 994916
num_examples: 7614
- name: validation
num_bytes: 993581
num_examples: 7615
download_size: 5158129
dataset_size: 10000180
- config_name: gcd-02
features:
- name: 'Unnamed: 0'
dtype: int64
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 8016470
num_examples: 60913
- name: test
num_bytes: 995014
num_examples: 7614
- name: validation
num_bytes: 994504
num_examples: 7615
download_size: 5275632
dataset_size: 10005988
- config_name: gcd-03
features:
- name: 'Unnamed: 0'
dtype: int64
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 8020814
num_examples: 60913
- name: test
num_bytes: 996773
num_examples: 7614
- name: validation
num_bytes: 993740
num_examples: 7615
download_size: 5370129
dataset_size: 10011327
- config_name: gcd-04
features:
- name: 'Unnamed: 0'
dtype: int64
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 8026821
num_examples: 60913
- name: test
num_bytes: 996875
num_examples: 7614
- name: validation
num_bytes: 994555
num_examples: 7615
download_size: 5437522
dataset_size: 10018251
- config_name: kaggle-01
features:
- name: 'Unnamed: 0'
dtype: int64
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 8010348
num_examples: 60913
- name: test
num_bytes: 995757
num_examples: 7614
- name: validation
num_bytes: 993878
num_examples: 7615
download_size: 5154817
dataset_size: 9999983
- config_name: kaggle-03
features:
- name: 'Unnamed: 0'
dtype: int64
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 8020978
num_examples: 60913
- name: test
num_bytes: 995483
num_examples: 7614
- name: validation
num_bytes: 994210
num_examples: 7615
download_size: 5367278
dataset_size: 10010671
- config_name: kaggle-04
features:
- name: 'Unnamed: 0'
dtype: int64
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 8023749
num_examples: 60913
- name: test
num_bytes: 997411
num_examples: 7614
- name: validation
num_bytes: 994380
num_examples: 7615
download_size: 5439468
dataset_size: 10015540
configs:
- config_name: babylon-01
data_files:
- split: train
path: babylon-01/train-*
- split: test
path: babylon-01/test-*
- split: validation
path: babylon-01/validation-*
- config_name: babylon-02
data_files:
- split: train
path: babylon-02/train-*
- split: test
path: babylon-02/test-*
- split: validation
path: babylon-02/validation-*
- config_name: babylon-03
data_files:
- split: train
path: babylon-03/train-*
- split: test
path: babylon-03/test-*
- split: validation
path: babylon-03/validation-*
- config_name: babylon-04
data_files:
- split: train
path: babylon-04/train-*
- split: test
path: babylon-04/test-*
- split: validation
path: babylon-04/validation-*
- config_name: gcd-01
data_files:
- split: train
path: gcd-01/train-*
- split: test
path: gcd-01/test-*
- split: validation
path: gcd-01/validation-*
- config_name: gcd-02
data_files:
- split: train
path: gcd-02/train-*
- split: test
path: gcd-02/test-*
- split: validation
path: gcd-02/validation-*
- config_name: gcd-03
data_files:
- split: train
path: gcd-03/train-*
- split: test
path: gcd-03/test-*
- split: validation
path: gcd-03/validation-*
- config_name: gcd-04
data_files:
- split: train
path: gcd-04/train-*
- split: test
path: gcd-04/test-*
- split: validation
path: gcd-04/validation-*
- config_name: kaggle-01
data_files:
- split: train
path: kaggle-01/train-*
- split: test
path: kaggle-01/test-*
- split: validation
path: kaggle-01/validation-*
- config_name: kaggle-03
data_files:
- split: train
path: kaggle-03/train-*
- split: test
path: kaggle-03/test-*
- split: validation
path: kaggle-03/validation-*
- config_name: kaggle-04
data_files:
- split: train
path: kaggle-04/train-*
- split: test
path: kaggle-04/test-*
- split: validation
path: kaggle-04/validation-*
---
提供机构:
gayanin
原始信息汇总
数据集概述
数据集配置
babylon-01
- 特征:
Unnamed: 0: int64refs: stringtrans: string
- 分割:
train: 8112324 字节, 60913 样本test: 1008294 字节, 7614 样本validation: 1006154 字节, 7615 样本
- 下载大小: 5223245 字节
- 数据集大小: 10126772 字节
- 数据文件路径:
train: babylon-01/train-*test: babylon-01/test-*validation: babylon-01/validation-*
babylon-02
- 特征:
Unnamed: 0: int64refs: stringtrans: string
- 分割:
train: 8216996 字节, 60913 样本test: 1020688 字节, 7614 样本validation: 1019701 字节, 7615 样本
- 下载大小: 5413512 字节
- 数据集大小: 10257385 字节
- 数据文件路径:
train: babylon-02/train-*test: babylon-02/test-*validation: babylon-02/validation-*
babylon-03
- 特征:
Unnamed: 0: int64refs: stringtrans: string
- 分割:
train: 8324195 字节, 60913 样本test: 1033187 字节, 7614 样本validation: 1032177 字节, 7615 样本
- 下载大小: 5592910 字节
- 数据集大小: 10389559 字节
- 数据文件路径:
train: babylon-03/train-*test: babylon-03/test-*validation: babylon-03/validation-*
babylon-04
- 特征:
Unnamed: 0: int64refs: stringtrans: string
- 分割:
train: 8428874 字节, 60913 样本test: 1046773 字节, 7614 样本validation: 1044412 字节, 7615 样本
- 下载大小: 5751362 字节
- 数据集大小: 10520059 字节
- 数据文件路径:
train: babylon-04/train-*test: babylon-04/test-*validation: babylon-04/validation-*
gcd-01
- 特征:
Unnamed: 0: int64refs: stringtrans: string
- 分割:
train: 8011683 字节, 60913 样本test: 994916 字节, 7614 样本validation: 993581 字节, 7615 样本
- 下载大小: 5158129 字节
- 数据集大小: 10000180 字节
- 数据文件路径:
train: gcd-01/train-*test: gcd-01/test-*validation: gcd-01/validation-*
gcd-02
- 特征:
Unnamed: 0: int64refs: stringtrans: string
- 分割:
train: 8016470 字节, 60913 样本test: 995014 字节, 7614 样本validation: 994504 字节, 7615 样本
- 下载大小: 5275632 字节
- 数据集大小: 10005988 字节
- 数据文件路径:
train: gcd-02/train-*test: gcd-02/test-*validation: gcd-02/validation-*
gcd-03
- 特征:
Unnamed: 0: int64refs: stringtrans: string
- 分割:
train: 8020814 字节, 60913 样本test: 996773 字节, 7614 样本validation: 993740 字节, 7615 样本
- 下载大小: 5370129 字节
- 数据集大小: 10011327 字节
- 数据文件路径:
train: gcd-03/train-*test: gcd-03/test-*validation: gcd-03/validation-*
gcd-04
- 特征:
Unnamed: 0: int64refs: stringtrans: string
- 分割:
train: 8026821 字节, 60913 样本test: 996875 字节, 7614 样本validation: 994555 字节, 7615 样本
- 下载大小: 5437522 字节
- 数据集大小: 10018251 字节
- 数据文件路径:
train: gcd-04/train-*test: gcd-04/test-*validation: gcd-04/validation-*
kaggle-01
- 特征:
Unnamed: 0: int64refs: stringtrans: string
- 分割:
train: 8010348 字节, 60913 样本test: 995757 字节, 7614 样本validation: 993878 字节, 7615 样本
- 下载大小: 5154817 字节
- 数据集大小: 9999983 字节
- 数据文件路径:
train: kaggle-01/train-*test: kaggle-01/test-*validation: kaggle-01/validation-*
kaggle-03
- 特征:
Unnamed: 0: int64refs: stringtrans: string
- 分割:
train: 8020978 字节, 60913 样本test: 995483 字节, 7614 样本validation: 994210 字节, 7615 样本
- 下载大小: 5367278 字节
- 数据集大小: 10010671 字节
- 数据文件路径:
train: kaggle-03/train-*test: kaggle-03/test-*validation: kaggle-03/validation-*
kaggle-04
- 特征:
Unnamed: 0: int64refs: stringtrans: string
- 分割:
train: 8023749 字节, 60913 样本test: 997411 字节, 7614 样本validation: 994380 字节, 7615 样本
- 下载大小: 5439468 字节
- 数据集大小: 10015540 字节
- 数据文件路径:
train: kaggle-04/train-*test: kaggle-04/test-*validation: kaggle-04/validation-*



