s-nlp/Mintaka_Sequences_T5-large-ssm

Name: s-nlp/Mintaka_Sequences_T5-large-ssm
Creator: s-nlp
Published: 2024-04-05 10:12:57
License: 暂无描述

Hugging Face2024-04-05 更新2024-06-11 收录

下载链接：

https://hf-mirror.com/datasets/s-nlp/Mintaka_Sequences_T5-large-ssm

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: id dtype: string - name: question dtype: string - name: answerEntity dtype: string - name: questionEntity dtype: string - name: groundTruthAnswerEntity dtype: string - name: complexityType dtype: string - name: graph dtype: string - name: correct dtype: bool - name: g2t_sequence dtype: string - name: gap_sequence dtype: string - name: highlighted_g2t_sequence dtype: string - name: no_highlighted_g2t_sequence dtype: string - name: highlighted_gap_sequence dtype: string - name: no_highlighted_gap_sequence dtype: string - name: highlighted_determ_sequence dtype: string - name: no_highlighted_determ_sequence dtype: string splits: - name: train num_bytes: 156273506 num_examples: 54179 - name: validation num_bytes: 31978611 num_examples: 10369 - name: test num_bytes: 44824721 num_examples: 15583 download_size: 41480863 dataset_size: 233076838 --- # Dataset Card for "Mintaka_Sequences_T5-large-ssm" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)

--- 数据集信息：特征字段： - 名称：ID，数据类型：字符串 - 名称：问题，数据类型：字符串 - 名称：答案实体（Answer Entity），数据类型：字符串 - 名称：问题实体（Question Entity），数据类型：字符串 - 名称：真实答案实体（Ground Truth Answer Entity），数据类型：字符串 - 名称：复杂度类型（Complexity Type），数据类型：字符串 - 名称：图（Graph），数据类型：字符串 - 名称：正确，数据类型：布尔值 - 名称：G2T序列（G2T Sequence），数据类型：字符串 - 名称：GAP序列（GAP Sequence），数据类型：字符串 - 名称：高亮G2T序列（Highlighted G2T Sequence），数据类型：字符串 - 名称：非高亮G2T序列（No-highlighted G2T Sequence），数据类型：字符串 - 名称：高亮GAP序列（Highlighted GAP Sequence），数据类型：字符串 - 名称：非高亮GAP序列（No-highlighted GAP Sequence），数据类型：字符串 - 名称：高亮判定序列（Highlighted Determ Sequence），数据类型：字符串 - 名称：非高亮判定序列（No-highlighted Determ Sequence），数据类型：字符串拆分集： - 名称：训练集，字节大小：156273506，样本数量：54179 - 名称：验证集，字节大小：31978611，样本数量：10369 - 名称：测试集，字节大小：44824721，样本数量：15583 下载总大小：41480863 数据集总存储大小：233076838 --- # "Mintaka_Sequences_T5-large-ssm"数据集卡片 [需补充更多信息](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)

提供机构：

s-nlp

原始信息汇总

数据集概述

数据集名称

Mintaka_Sequences_T5-large-ssm

数据集特征

id: 字符串类型
question: 字符串类型
answerEntity: 字符串类型
questionEntity: 字符串类型
groundTruthAnswerEntity: 字符串类型
complexityType: 字符串类型
graph: 字符串类型
correct: 布尔类型
g2t_sequence: 字符串类型
gap_sequence: 字符串类型
highlighted_g2t_sequence: 字符串类型
no_highlighted_g2t_sequence: 字符串类型
highlighted_gap_sequence: 字符串类型
no_highlighted_gap_sequence: 字符串类型
highlighted_determ_sequence: 字符串类型
no_highlighted_determ_sequence: 字符串类型

数据集分割

train: 54179个样本，占用156273506字节
validation: 10369个样本，占用31978611字节
test: 15583个样本，占用44824721字节

数据集大小

下载大小: 41480863字节
数据集总大小: 233076838字节

5,000+

优质数据集

54 个

任务类型

进入经典数据集