s-nlp/Mintaka_Graph_Features_T5-large-ssm
收藏Hugging Face2024-04-05 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/s-nlp/Mintaka_Graph_Features_T5-large-ssm
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: question
dtype: string
- name: question_answer
dtype: string
- name: num_nodes
dtype: int64
- name: num_edges
dtype: int64
- name: density
dtype: float64
- name: cycle
dtype: int64
- name: bridge
dtype: int64
- name: katz_centrality
dtype: float64
- name: page_rank
dtype: float64
- name: avg_ssp_length
dtype: float64
- name: determ_sequence
dtype: string
- name: gap_sequence
dtype: string
- name: g2t_sequence
dtype: string
- name: determ_sequence_embedding
dtype: string
- name: gap_sequence_embedding
dtype: string
- name: g2t_sequence_embedding
dtype: string
- name: question_answer_embedding
dtype: string
- name: tfidf_vector
dtype: string
- name: correct
dtype: float64
splits:
- name: train
num_bytes: 8914009001
num_examples: 78828
- name: validation
num_bytes: 545719014
num_examples: 14076
- name: test
num_bytes: 2579614925
num_examples: 22772
download_size: 1943971513
dataset_size: 12039342940
---
# Dataset Card for "Mintaka_Graph_Features_T5-large-ssm"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
s-nlp
原始信息汇总
数据集概述
特征信息
数据集包含以下特征:
- question: 类型为字符串
- question_answer: 类型为字符串
- num_nodes: 类型为整数
- num_edges: 类型为整数
- density: 类型为浮点数
- cycle: 类型为整数
- bridge: 类型为整数
- katz_centrality: 类型为浮点数
- page_rank: 类型为浮点数
- avg_ssp_length: 类型为浮点数
- determ_sequence: 类型为字符串
- gap_sequence: 类型为字符串
- g2t_sequence: 类型为字符串
- determ_sequence_embedding: 类型为字符串
- gap_sequence_embedding: 类型为字符串
- g2t_sequence_embedding: 类型为字符串
- question_answer_embedding: 类型为字符串
- tfidf_vector: 类型为字符串
- correct: 类型为浮点数
数据分割
数据集分为以下几个部分:
- train: 包含78828个样本,大小为8914009001字节
- validation: 包含14076个样本,大小为545719014字节
- test: 包含22772个样本,大小为2579614925字节
数据集大小
- 下载大小: 1943971513字节
- 数据集总大小: 12039342940字节



