JsSparkYyx/NLP524
收藏Hugging Face2023-11-17 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/JsSparkYyx/NLP524
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
dataset_info:
- config_name: mnli
features:
- name: source
dtype: string
- name: target
dtype: string
splits:
- name: train
num_bytes: 109962823
num_examples: 392702
- name: test
num_bytes: 5527941
num_examples: 19643
- name: valid
num_bytes: 5548772
num_examples: 19647
download_size: 53460884
dataset_size: 121039536
- config_name: qnli
features:
- name: source
dtype: string
- name: target
dtype: string
splits:
- name: train
num_bytes: 39384063
num_examples: 104743
- name: test
num_bytes: 2088746
num_examples: 5463
- name: valid
num_bytes: 2086659
num_examples: 5463
download_size: 19044246
dataset_size: 43559468
- config_name: qqp
features:
- name: source
dtype: string
- name: target
dtype: string
splits:
- name: train
num_bytes: 65589038
num_examples: 363846
- name: test
num_bytes: 71200676
num_examples: 390965
- name: valid
num_bytes: 7285839
num_examples: 40430
download_size: 67404067
dataset_size: 144075553
- config_name: sst2
features:
- name: source
dtype: string
- name: target
dtype: string
splits:
- name: train
num_bytes: 8730332
num_examples: 67349
- name: test
num_bytes: 327721
num_examples: 1821
- name: valid
num_bytes: 158588
num_examples: 872
download_size: 3370766
dataset_size: 9216641
configs:
- config_name: mnli
data_files:
- split: train
path: mnli/train-*
- split: test
path: mnli/test-*
- split: valid
path: mnli/valid-*
- config_name: qnli
data_files:
- split: train
path: qnli/train-*
- split: test
path: qnli/test-*
- split: valid
path: qnli/valid-*
- config_name: qqp
data_files:
- split: train
path: qqp/train-*
- split: test
path: qqp/test-*
- split: valid
path: qqp/valid-*
- config_name: sst2
data_files:
- split: train
path: sst2/train-*
- split: test
path: sst2/test-*
- split: valid
path: sst2/valid-*
---
提供机构:
JsSparkYyx
原始信息汇总
数据集概述
数据集配置
1. MNLI
- 特征:
source: stringtarget: string
- 分割:
train:- 字节数: 109962823
- 样本数: 392702
test:- 字节数: 5527941
- 样本数: 19643
valid:- 字节数: 5548772
- 样本数: 19647
- 下载大小: 53460884
- 数据集大小: 121039536
2. QNLI
- 特征:
source: stringtarget: string
- 分割:
train:- 字节数: 39384063
- 样本数: 104743
test:- 字节数: 2088746
- 样本数: 5463
valid:- 字节数: 2086659
- 样本数: 5463
- 下载大小: 19044246
- 数据集大小: 43559468
3. QQP
- 特征:
source: stringtarget: string
- 分割:
train:- 字节数: 65589038
- 样本数: 363846
test:- 字节数: 71200676
- 样本数: 390965
valid:- 字节数: 7285839
- 样本数: 40430
- 下载大小: 67404067
- 数据集大小: 144075553
4. SST2
- 特征:
source: stringtarget: string
- 分割:
train:- 字节数: 8730332
- 样本数: 67349
test:- 字节数: 327721
- 样本数: 1821
valid:- 字节数: 158588
- 样本数: 872
- 下载大小: 3370766
- 数据集大小: 9216641
数据文件路径
1. MNLI
train: mnli/train-*test: mnli/test-*valid: mnli/valid-*
2. QNLI
train: qnli/train-*test: qnli/test-*valid: qnli/valid-*
3. QQP
train: qqp/train-*test: qqp/test-*valid: qqp/valid-*
4. SST2
train: sst2/train-*test: sst2/test-*valid: sst2/valid-*



