liuyanchen1015/MULTI_VALUE_mnli_simple_past_for_present_perfect

Name: liuyanchen1015/MULTI_VALUE_mnli_simple_past_for_present_perfect
Creator: liuyanchen1015
Published: 2022-12-12 01:41:04
License: 暂无描述

Hugging Face2022-12-12 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/liuyanchen1015/MULTI_VALUE_mnli_simple_past_for_present_perfect

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: premise dtype: string - name: hypothesis dtype: string - name: label dtype: int64 - name: idx dtype: int64 - name: score dtype: int64 splits: - name: dev_matched num_bytes: 182245 num_examples: 793 - name: dev_mismatched num_bytes: 195941 num_examples: 788 - name: test_matched num_bytes: 215490 num_examples: 875 - name: test_mismatched num_bytes: 192851 num_examples: 826 - name: train num_bytes: 7833094 num_examples: 32860 download_size: 5311259 dataset_size: 8619621 --- # Dataset Card for "MULTI_VALUE_mnli_simple_past_for_present_perfect" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)

提供机构：

liuyanchen1015

原始信息汇总

数据集概述

数据集名称

名称: MULTI_VALUE_mnli_simple_past_for_present_perfect

数据集特征

特征列表:
- premise: 数据类型 - string
- hypothesis: 数据类型 - string
- label: 数据类型 - int64
- idx: 数据类型 - int64
- score: 数据类型 - int64

数据集分割

分割详情:
- dev_matched: 示例数 - 793, 字节数 - 182245
- dev_mismatched: 示例数 - 788, 字节数 - 195941
- test_matched: 示例数 - 875, 字节数 - 215490
- test_mismatched: 示例数 - 826, 字节数 - 192851
- train: 示例数 - 32860, 字节数 - 7833094

数据集大小

下载大小: 5311259 字节
数据集大小: 8619621 字节

搜集汇总

数据集介绍

背景与挑战

背景概述

该数据集是一个基于MNLI（多类型自然语言推理）任务的文本分类数据集，包含约3.6万行数据，每行由前提（premise）和假设（hypothesis）文本对组成，并标注了类别标签（label，取值0、1或2）和分数（score）。数据集的特点是可能涉及时态语法转换（如简单过去时替换现在完成时），用于训练和评估模型在自然语言推理任务中的性能，并提供了标准的数据分割（训练集、开发集和测试集）。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集