five

s-nlp/MKQASubgraphsRanking

收藏
Hugging Face2025-12-05 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/s-nlp/MKQASubgraphsRanking
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: mkqa_t5large_subgraphs features: - name: question dtype: string - name: question_answer dtype: string - name: num_nodes dtype: int64 - name: num_edges dtype: int64 - name: density dtype: float64 - name: cycle dtype: int64 - name: bridge dtype: int64 - name: katz_centrality dtype: float64 - name: page_rank dtype: float64 - name: avg_ssp_length dtype: float64 - name: answerEntity sequence: string - name: groundTruthAnswerEntity sequence: string - name: questionEntity sequence: string - name: graph dtype: string - name: correct dtype: float64 - name: no_highlighted_determ_sequence dtype: string - name: no_highlighted_determ_sequence_embedding dtype: string - name: highlighted_determ_sequence dtype: string - name: highlighted_determ_sequence_embedding dtype: string - name: question_answer_embedding dtype: string - name: tfidf_vector sequence: float64 splits: - name: test num_bytes: 220110954 num_examples: 7239 download_size: 121123993 dataset_size: 220110954 - config_name: mkqa_t5largessm_outputs features: - name: question dtype: string - name: target dtype: string - name: answer_0 dtype: string - name: answer_1 dtype: string - name: answer_2 dtype: string - name: answer_3 dtype: string - name: answer_4 dtype: string - name: answer_5 dtype: string - name: answer_6 dtype: string - name: answer_7 dtype: string - name: answer_8 dtype: string - name: answer_9 dtype: string - name: answer_10 dtype: string - name: answer_11 dtype: string - name: answer_12 dtype: string - name: answer_13 dtype: string - name: answer_14 dtype: string - name: answer_15 dtype: string - name: answer_16 dtype: string - name: answer_17 dtype: string - name: answer_18 dtype: string - name: answer_19 dtype: string - name: answer_20 dtype: string - name: answer_21 dtype: string - name: answer_22 dtype: string - name: answer_23 dtype: string - name: answer_24 dtype: string - name: answer_25 dtype: string - name: answer_26 dtype: string - name: answer_27 dtype: string - name: answer_28 dtype: string - name: answer_29 dtype: string - name: target_out_of_vocab dtype: bool splits: - name: train num_bytes: 1681181 num_examples: 2237 - name: test num_bytes: 193836 num_examples: 281 download_size: 1333228 dataset_size: 1875017 - config_name: mkqa_t5largessm_subgraphs features: - name: question dtype: string - name: question_answer dtype: string - name: num_nodes dtype: int64 - name: num_edges dtype: int64 - name: density dtype: float64 - name: cycle dtype: int64 - name: bridge dtype: int64 - name: katz_centrality dtype: float64 - name: page_rank dtype: float64 - name: avg_ssp_length dtype: float64 - name: answerEntity sequence: string - name: groundTruthAnswerEntity sequence: string - name: questionEntity sequence: string - name: graph dtype: string - name: correct dtype: float64 - name: no_highlighted_determ_sequence dtype: string - name: no_highlighted_determ_sequence_embedding dtype: string - name: highlighted_determ_sequence dtype: string - name: highlighted_determ_sequence_embedding dtype: string - name: question_answer_embedding dtype: string splits: - name: train num_bytes: 1300419788 num_examples: 42704 - name: test num_bytes: 220081998 num_examples: 7239 download_size: 809052508 dataset_size: 1520501786 - config_name: mkqa_t5xlssm_outputs features: - name: question dtype: string - name: target dtype: string - name: answer_0 dtype: string - name: answer_1 dtype: string - name: answer_2 dtype: string - name: answer_3 dtype: string - name: answer_4 dtype: string - name: answer_5 dtype: string - name: answer_6 dtype: string - name: answer_7 dtype: string - name: answer_8 dtype: string - name: answer_9 dtype: string - name: answer_10 dtype: string - name: answer_11 dtype: string - name: answer_12 dtype: string - name: answer_13 dtype: string - name: answer_14 dtype: string - name: answer_15 dtype: string - name: answer_16 dtype: string - name: answer_17 dtype: string - name: answer_18 dtype: string - name: answer_19 dtype: string - name: answer_20 dtype: string - name: answer_21 dtype: string - name: answer_22 dtype: string - name: answer_23 dtype: string - name: answer_24 dtype: string - name: answer_25 dtype: string - name: answer_26 dtype: string - name: answer_27 dtype: string - name: answer_28 dtype: string - name: answer_29 dtype: string - name: target_out_of_vocab dtype: bool splits: - name: train num_bytes: 1874077 num_examples: 2237 - name: test num_bytes: 226790 num_examples: 281 download_size: 1556610 dataset_size: 2100867 - config_name: mkqa_t5xlssm_subgraphs features: - name: question dtype: string - name: question_answer dtype: string - name: num_nodes dtype: int64 - name: num_edges dtype: int64 - name: density dtype: float64 - name: cycle dtype: int64 - name: bridge dtype: int64 - name: katz_centrality dtype: float64 - name: page_rank dtype: float64 - name: avg_ssp_length dtype: float64 - name: answerEntity sequence: string - name: groundTruthAnswerEntity sequence: string - name: questionEntity sequence: string - name: graph dtype: string - name: correct dtype: float64 - name: no_highlighted_determ_sequence dtype: string - name: no_highlighted_determ_sequence_embedding dtype: string - name: highlighted_determ_sequence dtype: string - name: highlighted_determ_sequence_embedding dtype: string - name: question_answer_embedding dtype: string splits: - name: train num_bytes: 976233063 num_examples: 32173 - name: test num_bytes: 125665572 num_examples: 4138 download_size: 607719690 dataset_size: 1101898635 configs: - config_name: mkqa_t5large_subgraphs data_files: - split: test path: mkqa_t5large_subgraphs/test-* - config_name: mkqa_t5largessm_outputs data_files: - split: train path: mkqa_t5largessm_outputs/train-* - split: test path: mkqa_t5largessm_outputs/test-* - config_name: mkqa_t5largessm_subgraphs data_files: - split: train path: mkqa_t5largessm_subgraphs/train-* - split: test path: mkqa_t5largessm_subgraphs/test-* - config_name: mkqa_t5xlssm_outputs data_files: - split: train path: mkqa_t5xlssm_outputs/train-* - split: test path: mkqa_t5xlssm_outputs/test-* - config_name: mkqa_t5xlssm_subgraphs data_files: - split: train path: mkqa_t5xlssm_subgraphs/train-* - split: test path: mkqa_t5xlssm_subgraphs/test-* ---

--- 数据集信息: - 配置名称:mkqa_t5large_subgraphs 特征列表: - 特征名称:question,数据类型:字符串 - 特征名称:question_answer,数据类型:字符串 - 特征名称:num_nodes,数据类型:64位整数 - 特征名称:num_edges,数据类型:64位整数 - 特征名称:density,数据类型:64位浮点数 - 特征名称:cycle,数据类型:64位整数 - 特征名称:bridge,数据类型:64位整数 - 特征名称:katz_centrality,数据类型:64位浮点数(卡茨中心性,Katz Centrality) - 特征名称:page_rank,数据类型:64位浮点数(PageRank) - 特征名称:avg_ssp_length,数据类型:64位浮点数(平均最短路径长度,Average Shortest Path Length) - 特征名称:answerEntity,数据类型:字符串序列 - 特征名称:groundTruthAnswerEntity,数据类型:字符串序列 - 特征名称:questionEntity,数据类型:字符串序列 - 特征名称:graph,数据类型:字符串 - 特征名称:correct,数据类型:64位浮点数 - 特征名称:no_highlighted_determ_sequence,数据类型:字符串 - 特征名称:no_highlighted_determ_sequence_embedding,数据类型:字符串 - 特征名称:highlighted_determ_sequence,数据类型:字符串 - 特征名称:highlighted_determ_sequence_embedding,数据类型:字符串 - 特征名称:question_answer_embedding,数据类型:字符串 - 特征名称:tfidf_vector,数据类型:64位浮点数序列 数据拆分: - 拆分名称:test,字节数:220110954,样本数量:7239 下载大小:121123993 数据集总大小:220110954 - 配置名称:mkqa_t5largessm_outputs 特征列表: - 特征名称:question,数据类型:字符串 - 特征名称:target,数据类型:字符串 - 特征名称:answer_0,数据类型:字符串 - 特征名称:answer_1,数据类型:字符串 - 特征名称:answer_2,数据类型:字符串 - 特征名称:answer_3,数据类型:字符串 - 特征名称:answer_4,数据类型:字符串 - 特征名称:answer_5,数据类型:字符串 - 特征名称:answer_6,数据类型:字符串 - 特征名称:answer_7,数据类型:字符串 - 特征名称:answer_8,数据类型:字符串 - 特征名称:answer_9,数据类型:字符串 - 特征名称:answer_10,数据类型:字符串 - 特征名称:answer_11,数据类型:字符串 - 特征名称:answer_12,数据类型:字符串 - 特征名称:answer_13,数据类型:字符串 - 特征名称:answer_14,数据类型:字符串 - 特征名称:answer_15,数据类型:字符串 - 特征名称:answer_16,数据类型:字符串 - 特征名称:answer_17,数据类型:字符串 - 特征名称:answer_18,数据类型:字符串 - 特征名称:answer_19,数据类型:字符串 - 特征名称:answer_20,数据类型:字符串 - 特征名称:answer_21,数据类型:字符串 - 特征名称:answer_22,数据类型:字符串 - 特征名称:answer_23,数据类型:字符串 - 特征名称:answer_24,数据类型:字符串 - 特征名称:answer_25,数据类型:字符串 - 特征名称:answer_26,数据类型:字符串 - 特征名称:answer_27,数据类型:字符串 - 特征名称:answer_28,数据类型:字符串 - 特征名称:answer_29,数据类型:字符串 - 特征名称:target_out_of_vocab,数据类型:布尔值 数据拆分: - 拆分名称:train,字节数:1681181,样本数量:2237 - 拆分名称:test,字节数:193836,样本数量:281 下载大小:1333228 数据集总大小:1875017 - 配置名称:mkqa_t5largessm_subgraphs 特征列表: - 特征名称:question,数据类型:字符串 - 特征名称:question_answer,数据类型:字符串 - 特征名称:num_nodes,数据类型:64位整数 - 特征名称:num_edges,数据类型:64位整数 - 特征名称:density,数据类型:64位浮点数 - 特征名称:cycle,数据类型:64位整数 - 特征名称:bridge,数据类型:64位整数 - 特征名称:katz_centrality,数据类型:64位浮点数(卡茨中心性,Katz Centrality) - 特征名称:page_rank,数据类型:64位浮点数(PageRank) - 特征名称:avg_ssp_length,数据类型:64位浮点数(平均最短路径长度,Average Shortest Path Length) - 特征名称:answerEntity,数据类型:字符串序列 - 特征名称:groundTruthAnswerEntity,数据类型:字符串序列 - 特征名称:questionEntity,数据类型:字符串序列 - 特征名称:graph,数据类型:字符串 - 特征名称:correct,数据类型:64位浮点数 - 特征名称:no_highlighted_determ_sequence,数据类型:字符串 - 特征名称:no_highlighted_determ_sequence_embedding,数据类型:字符串 - 特征名称:highlighted_determ_sequence,数据类型:字符串 - 特征名称:highlighted_determ_sequence_embedding,数据类型:字符串 - 特征名称:question_answer_embedding,数据类型:字符串 数据拆分: - 拆分名称:train,字节数:1300419788,样本数量:42704 - 拆分名称:test,字节数:220081998,样本数量:7239 下载大小:809052508 数据集总大小:1520501786 - 配置名称:mkqa_t5xlssm_outputs 特征列表: - 特征名称:question,数据类型:字符串 - 特征名称:target,数据类型:字符串 - 特征名称:answer_0,数据类型:字符串 - 特征名称:answer_1,数据类型:字符串 - 特征名称:answer_2,数据类型:字符串 - 特征名称:answer_3,数据类型:字符串 - 特征名称:answer_4,数据类型:字符串 - 特征名称:answer_5,数据类型:字符串 - 特征名称:answer_6,数据类型:字符串 - 特征名称:answer_7,数据类型:字符串 - 特征名称:answer_8,数据类型:字符串 - 特征名称:answer_9,数据类型:字符串 - 特征名称:answer_10,数据类型:字符串 - 特征名称:answer_11,数据类型:字符串 - 特征名称:answer_12,数据类型:字符串 - 特征名称:answer_13,数据类型:字符串 - 特征名称:answer_14,数据类型:字符串 - 特征名称:answer_15,数据类型:字符串 - 特征名称:answer_16,数据类型:字符串 - 特征名称:answer_17,数据类型:字符串 - 特征名称:answer_18,数据类型:字符串 - 特征名称:answer_19,数据类型:字符串 - 特征名称:answer_20,数据类型:字符串 - 特征名称:answer_21,数据类型:字符串 - 特征名称:answer_22,数据类型:字符串 - 特征名称:answer_23,数据类型:字符串 - 特征名称:answer_24,数据类型:字符串 - 特征名称:answer_25,数据类型:字符串 - 特征名称:answer_26,数据类型:字符串 - 特征名称:answer_27,数据类型:字符串 - 特征名称:answer_28,数据类型:字符串 - 特征名称:answer_29,数据类型:字符串 - 特征名称:target_out_of_vocab,数据类型:布尔值 数据拆分: - 拆分名称:train,字节数:1874077,样本数量:2237 - 拆分名称:test,字节数:226790,样本数量:281 下载大小:1556610 数据集总大小:2100867 - 配置名称:mkqa_t5xlssm_subgraphs 特征列表: - 特征名称:question,数据类型:字符串 - 特征名称:question_answer,数据类型:字符串 - 特征名称:num_nodes,数据类型:64位整数 - 特征名称:num_edges,数据类型:64位整数 - 特征名称:density,数据类型:64位浮点数 - 特征名称:cycle,数据类型:64位整数 - 特征名称:bridge,数据类型:64位整数 - 特征名称:katz_centrality,数据类型:64位浮点数(卡茨中心性,Katz Centrality) - 特征名称:page_rank,数据类型:64位浮点数(PageRank) - 特征名称:avg_ssp_length,数据类型:64位浮点数(平均最短路径长度,Average Shortest Path Length) - 特征名称:answerEntity,数据类型:字符串序列 - 特征名称:groundTruthAnswerEntity,数据类型:字符串序列 - 特征名称:questionEntity,数据类型:字符串序列 - 特征名称:graph,数据类型:字符串 - 特征名称:correct,数据类型:64位浮点数 - 特征名称:no_highlighted_determ_sequence,数据类型:字符串 - 特征名称:no_highlighted_determ_sequence_embedding,数据类型:字符串 - 特征名称:highlighted_determ_sequence,数据类型:字符串 - 特征名称:highlighted_determ_sequence_embedding,数据类型:字符串 - 特征名称:question_answer_embedding,数据类型:字符串 数据拆分: - 拆分名称:train,字节数:976233063,样本数量:32173 - 拆分名称:test,字节数:125665572,样本数量:4138 下载大小:607719690 数据集总大小:1101898635 配置项: - 配置名称:mkqa_t5large_subgraphs 数据文件: - 拆分:test,路径:mkqa_t5large_subgraphs/test-* - 配置名称:mkqa_t5largessm_outputs 数据文件: - 拆分:train,路径:mkqa_t5largessm_outputs/train-* - 拆分:test,路径:mkqa_t5largessm_outputs/test-* - 配置名称:mkqa_t5largessm_subgraphs 数据文件: - 拆分:train,路径:mkqa_t5largessm_subgraphs/train-* - 拆分:test,路径:mkqa_t5largessm_subgraphs/test-* - 配置名称:mkqa_t5xlssm_outputs 数据文件: - 拆分:train,路径:mkqa_t5xlssm_outputs/train-* - 拆分:test,路径:mkqa_t5xlssm_outputs/test-* - 配置名称:mkqa_t5xlssm_subgraphs 数据文件: - 拆分:train,路径:mkqa_t5xlssm_subgraphs/train-* - 拆分:test,路径:mkqa_t5xlssm_subgraphs/test-* ---
提供机构:
s-nlp
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作