liuyanchen1015/MULTI_VALUE_mnli_regularized_past_tense

Name: liuyanchen1015/MULTI_VALUE_mnli_regularized_past_tense
Creator: liuyanchen1015
Published: 2022-12-12 01:42:25
License: 暂无描述

Hugging Face2022-12-12 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/liuyanchen1015/MULTI_VALUE_mnli_regularized_past_tense

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: premise dtype: string - name: hypothesis dtype: string - name: label dtype: int64 - name: idx dtype: int64 - name: score dtype: int64 splits: - name: dev_matched num_bytes: 333958 num_examples: 1578 - name: dev_mismatched num_bytes: 386493 num_examples: 1671 - name: test_matched num_bytes: 335720 num_examples: 1504 - name: test_mismatched num_bytes: 367789 num_examples: 1625 - name: train num_bytes: 13723643 num_examples: 62518 download_size: 9637787 dataset_size: 15147603 --- # Dataset Card for "MULTI_VALUE_mnli_regularized_past_tense" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)

数据集信息：特征： - 名称：前提（premise），数据类型：字符串（string） - 名称：假设（hypothesis），数据类型：字符串（string） - 名称：标签（label），数据类型：64位整数（int64） - 名称：索引（idx），数据类型：64位整数（int64） - 名称：得分（score），数据类型：64位整数（int64）数据集划分： - 名称：匹配开发集（dev_matched），字节数：333958，样本数量：1578 - 名称：不匹配开发集（dev_mismatched），字节数：386493，样本数量：1671 - 名称：匹配测试集（test_matched），字节数：335720，样本数量：1504 - 名称：不匹配测试集（test_mismatched），字节数：367789，样本数量：1625 - 名称：训练集（train），字节数：13723643，样本数量：62518 下载大小：9637787 字节数据集总大小：15147603 字节 # "MULTI_VALUE_mnli_regularized_past_tense" 数据集卡片 [更多信息待补充](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)

提供机构：

liuyanchen1015

原始信息汇总

数据集概述

数据集名称

MULTI_VALUE_mnli_regularized_past_tense

数据集特征

premise: 数据类型 - string
hypothesis: 数据类型 - string
label: 数据类型 - int64
idx: 数据类型 - int64
score: 数据类型 - int64

数据集分割

dev_matched:
- 数据大小: 333958 bytes
- 示例数量: 1578
dev_mismatched:
- 数据大小: 386493 bytes
- 示例数量: 1671
test_matched:
- 数据大小: 335720 bytes
- 示例数量: 1504
test_mismatched:
- 数据大小: 367789 bytes
- 示例数量: 1625
train:
- 数据大小: 13723643 bytes
- 示例数量: 62518

数据集大小

下载大小: 9637787 bytes
数据集总大小: 15147603 bytes

搜集汇总

数据集介绍

背景与挑战

背景概述

该数据集是一个基于MNLI（多体裁自然语言推理）任务的变体，专注于过去时态的规则化处理，包含约6.9万行数据，涵盖前提、假设、标签和分数等字段。数据示例显示大量不规则过去时态动词的使用，旨在支持自然语言理解模型在时态变化下的推理能力评估。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集