s-nlp/Mintaka_Sequences_T5-large-ssm
收藏Hugging Face2024-04-05 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/s-nlp/Mintaka_Sequences_T5-large-ssm
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: id
dtype: string
- name: question
dtype: string
- name: answerEntity
dtype: string
- name: questionEntity
dtype: string
- name: groundTruthAnswerEntity
dtype: string
- name: complexityType
dtype: string
- name: graph
dtype: string
- name: correct
dtype: bool
- name: g2t_sequence
dtype: string
- name: gap_sequence
dtype: string
- name: highlighted_g2t_sequence
dtype: string
- name: no_highlighted_g2t_sequence
dtype: string
- name: highlighted_gap_sequence
dtype: string
- name: no_highlighted_gap_sequence
dtype: string
- name: highlighted_determ_sequence
dtype: string
- name: no_highlighted_determ_sequence
dtype: string
splits:
- name: train
num_bytes: 156273506
num_examples: 54179
- name: validation
num_bytes: 31978611
num_examples: 10369
- name: test
num_bytes: 44824721
num_examples: 15583
download_size: 41480863
dataset_size: 233076838
---
# Dataset Card for "Mintaka_Sequences_T5-large-ssm"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
---
数据集信息:
特征字段:
- 名称:ID,数据类型:字符串
- 名称:问题,数据类型:字符串
- 名称:答案实体(Answer Entity),数据类型:字符串
- 名称:问题实体(Question Entity),数据类型:字符串
- 名称:真实答案实体(Ground Truth Answer Entity),数据类型:字符串
- 名称:复杂度类型(Complexity Type),数据类型:字符串
- 名称:图(Graph),数据类型:字符串
- 名称:正确,数据类型:布尔值
- 名称:G2T序列(G2T Sequence),数据类型:字符串
- 名称:GAP序列(GAP Sequence),数据类型:字符串
- 名称:高亮G2T序列(Highlighted G2T Sequence),数据类型:字符串
- 名称:非高亮G2T序列(No-highlighted G2T Sequence),数据类型:字符串
- 名称:高亮GAP序列(Highlighted GAP Sequence),数据类型:字符串
- 名称:非高亮GAP序列(No-highlighted GAP Sequence),数据类型:字符串
- 名称:高亮判定序列(Highlighted Determ Sequence),数据类型:字符串
- 名称:非高亮判定序列(No-highlighted Determ Sequence),数据类型:字符串
拆分集:
- 名称:训练集,字节大小:156273506,样本数量:54179
- 名称:验证集,字节大小:31978611,样本数量:10369
- 名称:测试集,字节大小:44824721,样本数量:15583
下载总大小:41480863
数据集总存储大小:233076838
---
# "Mintaka_Sequences_T5-large-ssm"数据集卡片
[需补充更多信息](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
s-nlp
原始信息汇总
数据集概述
数据集名称
Mintaka_Sequences_T5-large-ssm
数据集特征
- id: 字符串类型
- question: 字符串类型
- answerEntity: 字符串类型
- questionEntity: 字符串类型
- groundTruthAnswerEntity: 字符串类型
- complexityType: 字符串类型
- graph: 字符串类型
- correct: 布尔类型
- g2t_sequence: 字符串类型
- gap_sequence: 字符串类型
- highlighted_g2t_sequence: 字符串类型
- no_highlighted_g2t_sequence: 字符串类型
- highlighted_gap_sequence: 字符串类型
- no_highlighted_gap_sequence: 字符串类型
- highlighted_determ_sequence: 字符串类型
- no_highlighted_determ_sequence: 字符串类型
数据集分割
- train: 54179个样本,占用156273506字节
- validation: 10369个样本,占用31978611字节
- test: 15583个样本,占用44824721字节
数据集大小
- 下载大小: 41480863字节
- 数据集总大小: 233076838字节



