five

gayanin/woz-noised

收藏
Hugging Face2024-02-12 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/gayanin/woz-noised
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: babylon-01 features: - name: 'Unnamed: 0' dtype: int64 - name: refs dtype: string - name: trans dtype: string splits: - name: train num_bytes: 8112324 num_examples: 60913 - name: test num_bytes: 1008294 num_examples: 7614 - name: validation num_bytes: 1006154 num_examples: 7615 download_size: 5223245 dataset_size: 10126772 - config_name: babylon-02 features: - name: 'Unnamed: 0' dtype: int64 - name: refs dtype: string - name: trans dtype: string splits: - name: train num_bytes: 8216996 num_examples: 60913 - name: test num_bytes: 1020688 num_examples: 7614 - name: validation num_bytes: 1019701 num_examples: 7615 download_size: 5413512 dataset_size: 10257385 - config_name: babylon-03 features: - name: 'Unnamed: 0' dtype: int64 - name: refs dtype: string - name: trans dtype: string splits: - name: train num_bytes: 8324195 num_examples: 60913 - name: test num_bytes: 1033187 num_examples: 7614 - name: validation num_bytes: 1032177 num_examples: 7615 download_size: 5592910 dataset_size: 10389559 - config_name: babylon-04 features: - name: 'Unnamed: 0' dtype: int64 - name: refs dtype: string - name: trans dtype: string splits: - name: train num_bytes: 8428874 num_examples: 60913 - name: test num_bytes: 1046773 num_examples: 7614 - name: validation num_bytes: 1044412 num_examples: 7615 download_size: 5751362 dataset_size: 10520059 - config_name: gcd-01 features: - name: 'Unnamed: 0' dtype: int64 - name: refs dtype: string - name: trans dtype: string splits: - name: train num_bytes: 8011683 num_examples: 60913 - name: test num_bytes: 994916 num_examples: 7614 - name: validation num_bytes: 993581 num_examples: 7615 download_size: 5158129 dataset_size: 10000180 - config_name: gcd-02 features: - name: 'Unnamed: 0' dtype: int64 - name: refs dtype: string - name: trans dtype: string splits: - name: train num_bytes: 8016470 num_examples: 60913 - name: test num_bytes: 995014 num_examples: 7614 - name: validation num_bytes: 994504 num_examples: 7615 download_size: 5275632 dataset_size: 10005988 - config_name: gcd-03 features: - name: 'Unnamed: 0' dtype: int64 - name: refs dtype: string - name: trans dtype: string splits: - name: train num_bytes: 8020814 num_examples: 60913 - name: test num_bytes: 996773 num_examples: 7614 - name: validation num_bytes: 993740 num_examples: 7615 download_size: 5370129 dataset_size: 10011327 - config_name: gcd-04 features: - name: 'Unnamed: 0' dtype: int64 - name: refs dtype: string - name: trans dtype: string splits: - name: train num_bytes: 8026821 num_examples: 60913 - name: test num_bytes: 996875 num_examples: 7614 - name: validation num_bytes: 994555 num_examples: 7615 download_size: 5437522 dataset_size: 10018251 - config_name: kaggle-01 features: - name: 'Unnamed: 0' dtype: int64 - name: refs dtype: string - name: trans dtype: string splits: - name: train num_bytes: 8010348 num_examples: 60913 - name: test num_bytes: 995757 num_examples: 7614 - name: validation num_bytes: 993878 num_examples: 7615 download_size: 5154817 dataset_size: 9999983 - config_name: kaggle-03 features: - name: 'Unnamed: 0' dtype: int64 - name: refs dtype: string - name: trans dtype: string splits: - name: train num_bytes: 8020978 num_examples: 60913 - name: test num_bytes: 995483 num_examples: 7614 - name: validation num_bytes: 994210 num_examples: 7615 download_size: 5367278 dataset_size: 10010671 - config_name: kaggle-04 features: - name: 'Unnamed: 0' dtype: int64 - name: refs dtype: string - name: trans dtype: string splits: - name: train num_bytes: 8023749 num_examples: 60913 - name: test num_bytes: 997411 num_examples: 7614 - name: validation num_bytes: 994380 num_examples: 7615 download_size: 5439468 dataset_size: 10015540 configs: - config_name: babylon-01 data_files: - split: train path: babylon-01/train-* - split: test path: babylon-01/test-* - split: validation path: babylon-01/validation-* - config_name: babylon-02 data_files: - split: train path: babylon-02/train-* - split: test path: babylon-02/test-* - split: validation path: babylon-02/validation-* - config_name: babylon-03 data_files: - split: train path: babylon-03/train-* - split: test path: babylon-03/test-* - split: validation path: babylon-03/validation-* - config_name: babylon-04 data_files: - split: train path: babylon-04/train-* - split: test path: babylon-04/test-* - split: validation path: babylon-04/validation-* - config_name: gcd-01 data_files: - split: train path: gcd-01/train-* - split: test path: gcd-01/test-* - split: validation path: gcd-01/validation-* - config_name: gcd-02 data_files: - split: train path: gcd-02/train-* - split: test path: gcd-02/test-* - split: validation path: gcd-02/validation-* - config_name: gcd-03 data_files: - split: train path: gcd-03/train-* - split: test path: gcd-03/test-* - split: validation path: gcd-03/validation-* - config_name: gcd-04 data_files: - split: train path: gcd-04/train-* - split: test path: gcd-04/test-* - split: validation path: gcd-04/validation-* - config_name: kaggle-01 data_files: - split: train path: kaggle-01/train-* - split: test path: kaggle-01/test-* - split: validation path: kaggle-01/validation-* - config_name: kaggle-03 data_files: - split: train path: kaggle-03/train-* - split: test path: kaggle-03/test-* - split: validation path: kaggle-03/validation-* - config_name: kaggle-04 data_files: - split: train path: kaggle-04/train-* - split: test path: kaggle-04/test-* - split: validation path: kaggle-04/validation-* ---
提供机构:
gayanin
原始信息汇总

数据集概述

数据集配置

babylon-01

  • 特征:
    • Unnamed: 0: int64
    • refs: string
    • trans: string
  • 分割:
    • train: 8112324 字节, 60913 样本
    • test: 1008294 字节, 7614 样本
    • validation: 1006154 字节, 7615 样本
  • 下载大小: 5223245 字节
  • 数据集大小: 10126772 字节
  • 数据文件路径:
    • train: babylon-01/train-*
    • test: babylon-01/test-*
    • validation: babylon-01/validation-*

babylon-02

  • 特征:
    • Unnamed: 0: int64
    • refs: string
    • trans: string
  • 分割:
    • train: 8216996 字节, 60913 样本
    • test: 1020688 字节, 7614 样本
    • validation: 1019701 字节, 7615 样本
  • 下载大小: 5413512 字节
  • 数据集大小: 10257385 字节
  • 数据文件路径:
    • train: babylon-02/train-*
    • test: babylon-02/test-*
    • validation: babylon-02/validation-*

babylon-03

  • 特征:
    • Unnamed: 0: int64
    • refs: string
    • trans: string
  • 分割:
    • train: 8324195 字节, 60913 样本
    • test: 1033187 字节, 7614 样本
    • validation: 1032177 字节, 7615 样本
  • 下载大小: 5592910 字节
  • 数据集大小: 10389559 字节
  • 数据文件路径:
    • train: babylon-03/train-*
    • test: babylon-03/test-*
    • validation: babylon-03/validation-*

babylon-04

  • 特征:
    • Unnamed: 0: int64
    • refs: string
    • trans: string
  • 分割:
    • train: 8428874 字节, 60913 样本
    • test: 1046773 字节, 7614 样本
    • validation: 1044412 字节, 7615 样本
  • 下载大小: 5751362 字节
  • 数据集大小: 10520059 字节
  • 数据文件路径:
    • train: babylon-04/train-*
    • test: babylon-04/test-*
    • validation: babylon-04/validation-*

gcd-01

  • 特征:
    • Unnamed: 0: int64
    • refs: string
    • trans: string
  • 分割:
    • train: 8011683 字节, 60913 样本
    • test: 994916 字节, 7614 样本
    • validation: 993581 字节, 7615 样本
  • 下载大小: 5158129 字节
  • 数据集大小: 10000180 字节
  • 数据文件路径:
    • train: gcd-01/train-*
    • test: gcd-01/test-*
    • validation: gcd-01/validation-*

gcd-02

  • 特征:
    • Unnamed: 0: int64
    • refs: string
    • trans: string
  • 分割:
    • train: 8016470 字节, 60913 样本
    • test: 995014 字节, 7614 样本
    • validation: 994504 字节, 7615 样本
  • 下载大小: 5275632 字节
  • 数据集大小: 10005988 字节
  • 数据文件路径:
    • train: gcd-02/train-*
    • test: gcd-02/test-*
    • validation: gcd-02/validation-*

gcd-03

  • 特征:
    • Unnamed: 0: int64
    • refs: string
    • trans: string
  • 分割:
    • train: 8020814 字节, 60913 样本
    • test: 996773 字节, 7614 样本
    • validation: 993740 字节, 7615 样本
  • 下载大小: 5370129 字节
  • 数据集大小: 10011327 字节
  • 数据文件路径:
    • train: gcd-03/train-*
    • test: gcd-03/test-*
    • validation: gcd-03/validation-*

gcd-04

  • 特征:
    • Unnamed: 0: int64
    • refs: string
    • trans: string
  • 分割:
    • train: 8026821 字节, 60913 样本
    • test: 996875 字节, 7614 样本
    • validation: 994555 字节, 7615 样本
  • 下载大小: 5437522 字节
  • 数据集大小: 10018251 字节
  • 数据文件路径:
    • train: gcd-04/train-*
    • test: gcd-04/test-*
    • validation: gcd-04/validation-*

kaggle-01

  • 特征:
    • Unnamed: 0: int64
    • refs: string
    • trans: string
  • 分割:
    • train: 8010348 字节, 60913 样本
    • test: 995757 字节, 7614 样本
    • validation: 993878 字节, 7615 样本
  • 下载大小: 5154817 字节
  • 数据集大小: 9999983 字节
  • 数据文件路径:
    • train: kaggle-01/train-*
    • test: kaggle-01/test-*
    • validation: kaggle-01/validation-*

kaggle-03

  • 特征:
    • Unnamed: 0: int64
    • refs: string
    • trans: string
  • 分割:
    • train: 8020978 字节, 60913 样本
    • test: 995483 字节, 7614 样本
    • validation: 994210 字节, 7615 样本
  • 下载大小: 5367278 字节
  • 数据集大小: 10010671 字节
  • 数据文件路径:
    • train: kaggle-03/train-*
    • test: kaggle-03/test-*
    • validation: kaggle-03/validation-*

kaggle-04

  • 特征:
    • Unnamed: 0: int64
    • refs: string
    • trans: string
  • 分割:
    • train: 8023749 字节, 60913 样本
    • test: 997411 字节, 7614 样本
    • validation: 994380 字节, 7615 样本
  • 下载大小: 5439468 字节
  • 数据集大小: 10015540 字节
  • 数据文件路径:
    • train: kaggle-04/train-*
    • test: kaggle-04/test-*
    • validation: kaggle-04/validation-*
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作