five

JsSparkYyx/NLP524

收藏
Hugging Face2023-11-17 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/JsSparkYyx/NLP524
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 dataset_info: - config_name: mnli features: - name: source dtype: string - name: target dtype: string splits: - name: train num_bytes: 109962823 num_examples: 392702 - name: test num_bytes: 5527941 num_examples: 19643 - name: valid num_bytes: 5548772 num_examples: 19647 download_size: 53460884 dataset_size: 121039536 - config_name: qnli features: - name: source dtype: string - name: target dtype: string splits: - name: train num_bytes: 39384063 num_examples: 104743 - name: test num_bytes: 2088746 num_examples: 5463 - name: valid num_bytes: 2086659 num_examples: 5463 download_size: 19044246 dataset_size: 43559468 - config_name: qqp features: - name: source dtype: string - name: target dtype: string splits: - name: train num_bytes: 65589038 num_examples: 363846 - name: test num_bytes: 71200676 num_examples: 390965 - name: valid num_bytes: 7285839 num_examples: 40430 download_size: 67404067 dataset_size: 144075553 - config_name: sst2 features: - name: source dtype: string - name: target dtype: string splits: - name: train num_bytes: 8730332 num_examples: 67349 - name: test num_bytes: 327721 num_examples: 1821 - name: valid num_bytes: 158588 num_examples: 872 download_size: 3370766 dataset_size: 9216641 configs: - config_name: mnli data_files: - split: train path: mnli/train-* - split: test path: mnli/test-* - split: valid path: mnli/valid-* - config_name: qnli data_files: - split: train path: qnli/train-* - split: test path: qnli/test-* - split: valid path: qnli/valid-* - config_name: qqp data_files: - split: train path: qqp/train-* - split: test path: qqp/test-* - split: valid path: qqp/valid-* - config_name: sst2 data_files: - split: train path: sst2/train-* - split: test path: sst2/test-* - split: valid path: sst2/valid-* ---
提供机构:
JsSparkYyx
原始信息汇总

数据集概述

数据集配置

1. MNLI

  • 特征:
    • source: string
    • target: string
  • 分割:
    • train:
      • 字节数: 109962823
      • 样本数: 392702
    • test:
      • 字节数: 5527941
      • 样本数: 19643
    • valid:
      • 字节数: 5548772
      • 样本数: 19647
  • 下载大小: 53460884
  • 数据集大小: 121039536

2. QNLI

  • 特征:
    • source: string
    • target: string
  • 分割:
    • train:
      • 字节数: 39384063
      • 样本数: 104743
    • test:
      • 字节数: 2088746
      • 样本数: 5463
    • valid:
      • 字节数: 2086659
      • 样本数: 5463
  • 下载大小: 19044246
  • 数据集大小: 43559468

3. QQP

  • 特征:
    • source: string
    • target: string
  • 分割:
    • train:
      • 字节数: 65589038
      • 样本数: 363846
    • test:
      • 字节数: 71200676
      • 样本数: 390965
    • valid:
      • 字节数: 7285839
      • 样本数: 40430
  • 下载大小: 67404067
  • 数据集大小: 144075553

4. SST2

  • 特征:
    • source: string
    • target: string
  • 分割:
    • train:
      • 字节数: 8730332
      • 样本数: 67349
    • test:
      • 字节数: 327721
      • 样本数: 1821
    • valid:
      • 字节数: 158588
      • 样本数: 872
  • 下载大小: 3370766
  • 数据集大小: 9216641

数据文件路径

1. MNLI

  • train: mnli/train-*
  • test: mnli/test-*
  • valid: mnli/valid-*

2. QNLI

  • train: qnli/train-*
  • test: qnli/test-*
  • valid: qnli/valid-*

3. QQP

  • train: qqp/train-*
  • test: qqp/test-*
  • valid: qqp/valid-*

4. SST2

  • train: sst2/train-*
  • test: sst2/test-*
  • valid: sst2/valid-*
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作