gayanin/kaggle-native-mixed
收藏Hugging Face2024-01-29 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/gayanin/kaggle-native-mixed
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: prob-0.1
features:
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 559822
num_examples: 5272
- name: test
num_bytes: 70347
num_examples: 659
- name: validation
num_bytes: 71743
num_examples: 660
download_size: 319724
dataset_size: 701912
- config_name: prob-0.2
features:
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 561099
num_examples: 5272
- name: test
num_bytes: 70430
num_examples: 659
- name: validation
num_bytes: 71672
num_examples: 660
download_size: 346018
dataset_size: 703201
- config_name: prob-0.3
features:
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 561587
num_examples: 5272
- name: test
num_bytes: 70197
num_examples: 659
- name: validation
num_bytes: 71684
num_examples: 660
download_size: 361331
dataset_size: 703468
- config_name: prob-0.4
features:
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 562321
num_examples: 5272
- name: test
num_bytes: 70580
num_examples: 659
- name: validation
num_bytes: 71924
num_examples: 660
download_size: 372313
dataset_size: 704825
- config_name: prob-0.5
features:
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 562216
num_examples: 5272
- name: test
num_bytes: 70353
num_examples: 659
- name: validation
num_bytes: 72357
num_examples: 660
download_size: 380547
dataset_size: 704926
configs:
- config_name: prob-0.1
data_files:
- split: train
path: prob-0.1/train-*
- split: test
path: prob-0.1/test-*
- split: validation
path: prob-0.1/validation-*
- config_name: prob-0.2
data_files:
- split: train
path: prob-0.2/train-*
- split: test
path: prob-0.2/test-*
- split: validation
path: prob-0.2/validation-*
- config_name: prob-0.3
data_files:
- split: train
path: prob-0.3/train-*
- split: test
path: prob-0.3/test-*
- split: validation
path: prob-0.3/validation-*
- config_name: prob-0.4
data_files:
- split: train
path: prob-0.4/train-*
- split: test
path: prob-0.4/test-*
- split: validation
path: prob-0.4/validation-*
- config_name: prob-0.5
data_files:
- split: train
path: prob-0.5/train-*
- split: test
path: prob-0.5/test-*
- split: validation
path: prob-0.5/validation-*
---
提供机构:
gayanin
原始信息汇总
数据集概述
数据集配置
-
prob-0.1
- 特征
refs: 字符串类型trans: 字符串类型
- 分割
train: 559822 字节, 5272 个样本test: 70347 字节, 659 个样本validation: 71743 字节, 660 个样本
- 下载大小: 319724 字节
- 数据集大小: 701912 字节
- 特征
-
prob-0.2
- 特征
refs: 字符串类型trans: 字符串类型
- 分割
train: 561099 字节, 5272 个样本test: 70430 字节, 659 个样本validation: 71672 字节, 660 个样本
- 下载大小: 346018 字节
- 数据集大小: 703201 字节
- 特征
-
prob-0.3
- 特征
refs: 字符串类型trans: 字符串类型
- 分割
train: 561587 字节, 5272 个样本test: 70197 字节, 659 个样本validation: 71684 字节, 660 个样本
- 下载大小: 361331 字节
- 数据集大小: 703468 字节
- 特征
-
prob-0.4
- 特征
refs: 字符串类型trans: 字符串类型
- 分割
train: 562321 字节, 5272 个样本test: 70580 字节, 659 个样本validation: 71924 字节, 660 个样本
- 下载大小: 372313 字节
- 数据集大小: 704825 字节
- 特征
-
prob-0.5
- 特征
refs: 字符串类型trans: 字符串类型
- 分割
train: 562216 字节, 5272 个样本test: 70353 字节, 659 个样本validation: 72357 字节, 660 个样本
- 下载大小: 380547 字节
- 数据集大小: 704926 字节
- 特征
数据文件路径
-
prob-0.1
train: prob-0.1/train-*test: prob-0.1/test-*validation: prob-0.1/validation-*
-
prob-0.2
train: prob-0.2/train-*test: prob-0.2/test-*validation: prob-0.2/validation-*
-
prob-0.3
train: prob-0.3/train-*test: prob-0.3/test-*validation: prob-0.3/validation-*
-
prob-0.4
train: prob-0.4/train-*test: prob-0.4/test-*validation: prob-0.4/validation-*
-
prob-0.5
train: prob-0.5/train-*test: prob-0.5/test-*validation: prob-0.5/validation-*



