s-nlp/MKQASubgraphsRanking
收藏Hugging Face2025-12-05 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/s-nlp/MKQASubgraphsRanking
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: mkqa_t5large_subgraphs
features:
- name: question
dtype: string
- name: question_answer
dtype: string
- name: num_nodes
dtype: int64
- name: num_edges
dtype: int64
- name: density
dtype: float64
- name: cycle
dtype: int64
- name: bridge
dtype: int64
- name: katz_centrality
dtype: float64
- name: page_rank
dtype: float64
- name: avg_ssp_length
dtype: float64
- name: answerEntity
sequence: string
- name: groundTruthAnswerEntity
sequence: string
- name: questionEntity
sequence: string
- name: graph
dtype: string
- name: correct
dtype: float64
- name: no_highlighted_determ_sequence
dtype: string
- name: no_highlighted_determ_sequence_embedding
dtype: string
- name: highlighted_determ_sequence
dtype: string
- name: highlighted_determ_sequence_embedding
dtype: string
- name: question_answer_embedding
dtype: string
- name: tfidf_vector
sequence: float64
splits:
- name: test
num_bytes: 220110954
num_examples: 7239
download_size: 121123993
dataset_size: 220110954
- config_name: mkqa_t5largessm_outputs
features:
- name: question
dtype: string
- name: target
dtype: string
- name: answer_0
dtype: string
- name: answer_1
dtype: string
- name: answer_2
dtype: string
- name: answer_3
dtype: string
- name: answer_4
dtype: string
- name: answer_5
dtype: string
- name: answer_6
dtype: string
- name: answer_7
dtype: string
- name: answer_8
dtype: string
- name: answer_9
dtype: string
- name: answer_10
dtype: string
- name: answer_11
dtype: string
- name: answer_12
dtype: string
- name: answer_13
dtype: string
- name: answer_14
dtype: string
- name: answer_15
dtype: string
- name: answer_16
dtype: string
- name: answer_17
dtype: string
- name: answer_18
dtype: string
- name: answer_19
dtype: string
- name: answer_20
dtype: string
- name: answer_21
dtype: string
- name: answer_22
dtype: string
- name: answer_23
dtype: string
- name: answer_24
dtype: string
- name: answer_25
dtype: string
- name: answer_26
dtype: string
- name: answer_27
dtype: string
- name: answer_28
dtype: string
- name: answer_29
dtype: string
- name: target_out_of_vocab
dtype: bool
splits:
- name: train
num_bytes: 1681181
num_examples: 2237
- name: test
num_bytes: 193836
num_examples: 281
download_size: 1333228
dataset_size: 1875017
- config_name: mkqa_t5largessm_subgraphs
features:
- name: question
dtype: string
- name: question_answer
dtype: string
- name: num_nodes
dtype: int64
- name: num_edges
dtype: int64
- name: density
dtype: float64
- name: cycle
dtype: int64
- name: bridge
dtype: int64
- name: katz_centrality
dtype: float64
- name: page_rank
dtype: float64
- name: avg_ssp_length
dtype: float64
- name: answerEntity
sequence: string
- name: groundTruthAnswerEntity
sequence: string
- name: questionEntity
sequence: string
- name: graph
dtype: string
- name: correct
dtype: float64
- name: no_highlighted_determ_sequence
dtype: string
- name: no_highlighted_determ_sequence_embedding
dtype: string
- name: highlighted_determ_sequence
dtype: string
- name: highlighted_determ_sequence_embedding
dtype: string
- name: question_answer_embedding
dtype: string
splits:
- name: train
num_bytes: 1300419788
num_examples: 42704
- name: test
num_bytes: 220081998
num_examples: 7239
download_size: 809052508
dataset_size: 1520501786
- config_name: mkqa_t5xlssm_outputs
features:
- name: question
dtype: string
- name: target
dtype: string
- name: answer_0
dtype: string
- name: answer_1
dtype: string
- name: answer_2
dtype: string
- name: answer_3
dtype: string
- name: answer_4
dtype: string
- name: answer_5
dtype: string
- name: answer_6
dtype: string
- name: answer_7
dtype: string
- name: answer_8
dtype: string
- name: answer_9
dtype: string
- name: answer_10
dtype: string
- name: answer_11
dtype: string
- name: answer_12
dtype: string
- name: answer_13
dtype: string
- name: answer_14
dtype: string
- name: answer_15
dtype: string
- name: answer_16
dtype: string
- name: answer_17
dtype: string
- name: answer_18
dtype: string
- name: answer_19
dtype: string
- name: answer_20
dtype: string
- name: answer_21
dtype: string
- name: answer_22
dtype: string
- name: answer_23
dtype: string
- name: answer_24
dtype: string
- name: answer_25
dtype: string
- name: answer_26
dtype: string
- name: answer_27
dtype: string
- name: answer_28
dtype: string
- name: answer_29
dtype: string
- name: target_out_of_vocab
dtype: bool
splits:
- name: train
num_bytes: 1874077
num_examples: 2237
- name: test
num_bytes: 226790
num_examples: 281
download_size: 1556610
dataset_size: 2100867
- config_name: mkqa_t5xlssm_subgraphs
features:
- name: question
dtype: string
- name: question_answer
dtype: string
- name: num_nodes
dtype: int64
- name: num_edges
dtype: int64
- name: density
dtype: float64
- name: cycle
dtype: int64
- name: bridge
dtype: int64
- name: katz_centrality
dtype: float64
- name: page_rank
dtype: float64
- name: avg_ssp_length
dtype: float64
- name: answerEntity
sequence: string
- name: groundTruthAnswerEntity
sequence: string
- name: questionEntity
sequence: string
- name: graph
dtype: string
- name: correct
dtype: float64
- name: no_highlighted_determ_sequence
dtype: string
- name: no_highlighted_determ_sequence_embedding
dtype: string
- name: highlighted_determ_sequence
dtype: string
- name: highlighted_determ_sequence_embedding
dtype: string
- name: question_answer_embedding
dtype: string
splits:
- name: train
num_bytes: 976233063
num_examples: 32173
- name: test
num_bytes: 125665572
num_examples: 4138
download_size: 607719690
dataset_size: 1101898635
configs:
- config_name: mkqa_t5large_subgraphs
data_files:
- split: test
path: mkqa_t5large_subgraphs/test-*
- config_name: mkqa_t5largessm_outputs
data_files:
- split: train
path: mkqa_t5largessm_outputs/train-*
- split: test
path: mkqa_t5largessm_outputs/test-*
- config_name: mkqa_t5largessm_subgraphs
data_files:
- split: train
path: mkqa_t5largessm_subgraphs/train-*
- split: test
path: mkqa_t5largessm_subgraphs/test-*
- config_name: mkqa_t5xlssm_outputs
data_files:
- split: train
path: mkqa_t5xlssm_outputs/train-*
- split: test
path: mkqa_t5xlssm_outputs/test-*
- config_name: mkqa_t5xlssm_subgraphs
data_files:
- split: train
path: mkqa_t5xlssm_subgraphs/train-*
- split: test
path: mkqa_t5xlssm_subgraphs/test-*
---
---
数据集信息:
- 配置名称:mkqa_t5large_subgraphs
特征列表:
- 特征名称:question,数据类型:字符串
- 特征名称:question_answer,数据类型:字符串
- 特征名称:num_nodes,数据类型:64位整数
- 特征名称:num_edges,数据类型:64位整数
- 特征名称:density,数据类型:64位浮点数
- 特征名称:cycle,数据类型:64位整数
- 特征名称:bridge,数据类型:64位整数
- 特征名称:katz_centrality,数据类型:64位浮点数(卡茨中心性,Katz Centrality)
- 特征名称:page_rank,数据类型:64位浮点数(PageRank)
- 特征名称:avg_ssp_length,数据类型:64位浮点数(平均最短路径长度,Average Shortest Path Length)
- 特征名称:answerEntity,数据类型:字符串序列
- 特征名称:groundTruthAnswerEntity,数据类型:字符串序列
- 特征名称:questionEntity,数据类型:字符串序列
- 特征名称:graph,数据类型:字符串
- 特征名称:correct,数据类型:64位浮点数
- 特征名称:no_highlighted_determ_sequence,数据类型:字符串
- 特征名称:no_highlighted_determ_sequence_embedding,数据类型:字符串
- 特征名称:highlighted_determ_sequence,数据类型:字符串
- 特征名称:highlighted_determ_sequence_embedding,数据类型:字符串
- 特征名称:question_answer_embedding,数据类型:字符串
- 特征名称:tfidf_vector,数据类型:64位浮点数序列
数据拆分:
- 拆分名称:test,字节数:220110954,样本数量:7239
下载大小:121123993
数据集总大小:220110954
- 配置名称:mkqa_t5largessm_outputs
特征列表:
- 特征名称:question,数据类型:字符串
- 特征名称:target,数据类型:字符串
- 特征名称:answer_0,数据类型:字符串
- 特征名称:answer_1,数据类型:字符串
- 特征名称:answer_2,数据类型:字符串
- 特征名称:answer_3,数据类型:字符串
- 特征名称:answer_4,数据类型:字符串
- 特征名称:answer_5,数据类型:字符串
- 特征名称:answer_6,数据类型:字符串
- 特征名称:answer_7,数据类型:字符串
- 特征名称:answer_8,数据类型:字符串
- 特征名称:answer_9,数据类型:字符串
- 特征名称:answer_10,数据类型:字符串
- 特征名称:answer_11,数据类型:字符串
- 特征名称:answer_12,数据类型:字符串
- 特征名称:answer_13,数据类型:字符串
- 特征名称:answer_14,数据类型:字符串
- 特征名称:answer_15,数据类型:字符串
- 特征名称:answer_16,数据类型:字符串
- 特征名称:answer_17,数据类型:字符串
- 特征名称:answer_18,数据类型:字符串
- 特征名称:answer_19,数据类型:字符串
- 特征名称:answer_20,数据类型:字符串
- 特征名称:answer_21,数据类型:字符串
- 特征名称:answer_22,数据类型:字符串
- 特征名称:answer_23,数据类型:字符串
- 特征名称:answer_24,数据类型:字符串
- 特征名称:answer_25,数据类型:字符串
- 特征名称:answer_26,数据类型:字符串
- 特征名称:answer_27,数据类型:字符串
- 特征名称:answer_28,数据类型:字符串
- 特征名称:answer_29,数据类型:字符串
- 特征名称:target_out_of_vocab,数据类型:布尔值
数据拆分:
- 拆分名称:train,字节数:1681181,样本数量:2237
- 拆分名称:test,字节数:193836,样本数量:281
下载大小:1333228
数据集总大小:1875017
- 配置名称:mkqa_t5largessm_subgraphs
特征列表:
- 特征名称:question,数据类型:字符串
- 特征名称:question_answer,数据类型:字符串
- 特征名称:num_nodes,数据类型:64位整数
- 特征名称:num_edges,数据类型:64位整数
- 特征名称:density,数据类型:64位浮点数
- 特征名称:cycle,数据类型:64位整数
- 特征名称:bridge,数据类型:64位整数
- 特征名称:katz_centrality,数据类型:64位浮点数(卡茨中心性,Katz Centrality)
- 特征名称:page_rank,数据类型:64位浮点数(PageRank)
- 特征名称:avg_ssp_length,数据类型:64位浮点数(平均最短路径长度,Average Shortest Path Length)
- 特征名称:answerEntity,数据类型:字符串序列
- 特征名称:groundTruthAnswerEntity,数据类型:字符串序列
- 特征名称:questionEntity,数据类型:字符串序列
- 特征名称:graph,数据类型:字符串
- 特征名称:correct,数据类型:64位浮点数
- 特征名称:no_highlighted_determ_sequence,数据类型:字符串
- 特征名称:no_highlighted_determ_sequence_embedding,数据类型:字符串
- 特征名称:highlighted_determ_sequence,数据类型:字符串
- 特征名称:highlighted_determ_sequence_embedding,数据类型:字符串
- 特征名称:question_answer_embedding,数据类型:字符串
数据拆分:
- 拆分名称:train,字节数:1300419788,样本数量:42704
- 拆分名称:test,字节数:220081998,样本数量:7239
下载大小:809052508
数据集总大小:1520501786
- 配置名称:mkqa_t5xlssm_outputs
特征列表:
- 特征名称:question,数据类型:字符串
- 特征名称:target,数据类型:字符串
- 特征名称:answer_0,数据类型:字符串
- 特征名称:answer_1,数据类型:字符串
- 特征名称:answer_2,数据类型:字符串
- 特征名称:answer_3,数据类型:字符串
- 特征名称:answer_4,数据类型:字符串
- 特征名称:answer_5,数据类型:字符串
- 特征名称:answer_6,数据类型:字符串
- 特征名称:answer_7,数据类型:字符串
- 特征名称:answer_8,数据类型:字符串
- 特征名称:answer_9,数据类型:字符串
- 特征名称:answer_10,数据类型:字符串
- 特征名称:answer_11,数据类型:字符串
- 特征名称:answer_12,数据类型:字符串
- 特征名称:answer_13,数据类型:字符串
- 特征名称:answer_14,数据类型:字符串
- 特征名称:answer_15,数据类型:字符串
- 特征名称:answer_16,数据类型:字符串
- 特征名称:answer_17,数据类型:字符串
- 特征名称:answer_18,数据类型:字符串
- 特征名称:answer_19,数据类型:字符串
- 特征名称:answer_20,数据类型:字符串
- 特征名称:answer_21,数据类型:字符串
- 特征名称:answer_22,数据类型:字符串
- 特征名称:answer_23,数据类型:字符串
- 特征名称:answer_24,数据类型:字符串
- 特征名称:answer_25,数据类型:字符串
- 特征名称:answer_26,数据类型:字符串
- 特征名称:answer_27,数据类型:字符串
- 特征名称:answer_28,数据类型:字符串
- 特征名称:answer_29,数据类型:字符串
- 特征名称:target_out_of_vocab,数据类型:布尔值
数据拆分:
- 拆分名称:train,字节数:1874077,样本数量:2237
- 拆分名称:test,字节数:226790,样本数量:281
下载大小:1556610
数据集总大小:2100867
- 配置名称:mkqa_t5xlssm_subgraphs
特征列表:
- 特征名称:question,数据类型:字符串
- 特征名称:question_answer,数据类型:字符串
- 特征名称:num_nodes,数据类型:64位整数
- 特征名称:num_edges,数据类型:64位整数
- 特征名称:density,数据类型:64位浮点数
- 特征名称:cycle,数据类型:64位整数
- 特征名称:bridge,数据类型:64位整数
- 特征名称:katz_centrality,数据类型:64位浮点数(卡茨中心性,Katz Centrality)
- 特征名称:page_rank,数据类型:64位浮点数(PageRank)
- 特征名称:avg_ssp_length,数据类型:64位浮点数(平均最短路径长度,Average Shortest Path Length)
- 特征名称:answerEntity,数据类型:字符串序列
- 特征名称:groundTruthAnswerEntity,数据类型:字符串序列
- 特征名称:questionEntity,数据类型:字符串序列
- 特征名称:graph,数据类型:字符串
- 特征名称:correct,数据类型:64位浮点数
- 特征名称:no_highlighted_determ_sequence,数据类型:字符串
- 特征名称:no_highlighted_determ_sequence_embedding,数据类型:字符串
- 特征名称:highlighted_determ_sequence,数据类型:字符串
- 特征名称:highlighted_determ_sequence_embedding,数据类型:字符串
- 特征名称:question_answer_embedding,数据类型:字符串
数据拆分:
- 拆分名称:train,字节数:976233063,样本数量:32173
- 拆分名称:test,字节数:125665572,样本数量:4138
下载大小:607719690
数据集总大小:1101898635
配置项:
- 配置名称:mkqa_t5large_subgraphs
数据文件:
- 拆分:test,路径:mkqa_t5large_subgraphs/test-*
- 配置名称:mkqa_t5largessm_outputs
数据文件:
- 拆分:train,路径:mkqa_t5largessm_outputs/train-*
- 拆分:test,路径:mkqa_t5largessm_outputs/test-*
- 配置名称:mkqa_t5largessm_subgraphs
数据文件:
- 拆分:train,路径:mkqa_t5largessm_subgraphs/train-*
- 拆分:test,路径:mkqa_t5largessm_subgraphs/test-*
- 配置名称:mkqa_t5xlssm_outputs
数据文件:
- 拆分:train,路径:mkqa_t5xlssm_outputs/train-*
- 拆分:test,路径:mkqa_t5xlssm_outputs/test-*
- 配置名称:mkqa_t5xlssm_subgraphs
数据文件:
- 拆分:train,路径:mkqa_t5xlssm_subgraphs/train-*
- 拆分:test,路径:mkqa_t5xlssm_subgraphs/test-*
---
提供机构:
s-nlp



