hle2000/KGQA_Mistral
收藏Hugging Face2024-04-23 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/hle2000/KGQA_Mistral
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: index
dtype: int64
- name: id
dtype: string
- name: question
dtype: string
- name: answerEntity
dtype: string
- name: groundTruthAnswerEntity
dtype: string
- name: questionEntity
dtype: string
- name: complexityType
dtype: string
- name: graph
dtype: string
- name: correct
dtype: bool
- name: t5_sequence
dtype: string
- name: gap_sequence
dtype: string
- name: highlighted_t5_sequence
dtype: string
- name: no_highlighted_t5_sequence
dtype: string
- name: highlighted_gap_sequence
dtype: string
- name: no_highlighted_gap_sequence
dtype: string
- name: highlighted_determ_sequence
dtype: string
- name: no_highlighted_determ_sequence
dtype: string
- name: question_answer
dtype: string
- name: num_nodes
dtype: int64
- name: num_edges
dtype: int64
- name: density
dtype: float64
- name: cycle
dtype: int64
- name: bridge
dtype: int64
- name: katz_centrality
dtype: float64
- name: page_rank
dtype: float64
- name: avg_ssp_length
dtype: float64
- name: determ_sequence
dtype: string
- name: determ_sequence_embedding
dtype: string
- name: gap_sequence_embedding
dtype: string
- name: t5_sequence_embedding
dtype: string
- name: question_answer_embedding
dtype: string
splits:
- name: train
num_bytes: 1369797466
num_examples: 32757
- name: validation
num_bytes: 1369797466
num_examples: 32757
- name: test
num_bytes: 407425784
num_examples: 9749
download_size: 1809812397
dataset_size: 3147020716
---
# Dataset Card for "KGQA_Mistral"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
hle2000
原始信息汇总
数据集概述
数据集特征
- index: 数据类型为
int64 - id: 数据类型为
string - question: 数据类型为
string - answerEntity: 数据类型为
string - groundTruthAnswerEntity: 数据类型为
string - questionEntity: 数据类型为
string - complexityType: 数据类型为
string - graph: 数据类型为
string - correct: 数据类型为
bool - t5_sequence: 数据类型为
string - gap_sequence: 数据类型为
string - highlighted_t5_sequence: 数据类型为
string - no_highlighted_t5_sequence: 数据类型为
string - highlighted_gap_sequence: 数据类型为
string - no_highlighted_gap_sequence: 数据类型为
string - highlighted_determ_sequence: 数据类型为
string - no_highlighted_determ_sequence: 数据类型为
string - question_answer: 数据类型为
string - num_nodes: 数据类型为
int64 - num_edges: 数据类型为
int64 - density: 数据类型为
float64 - cycle: 数据类型为
int64 - bridge: 数据类型为
int64 - katz_centrality: 数据类型为
float64 - page_rank: 数据类型为
float64 - avg_ssp_length: 数据类型为
float64 - determ_sequence: 数据类型为
string - determ_sequence_embedding: 数据类型为
string - gap_sequence_embedding: 数据类型为
string - t5_sequence_embedding: 数据类型为
string - question_answer_embedding: 数据类型为
string
数据集分割
- train: 包含 32757 个样本,总字节数为 1369797466
- validation: 包含 32757 个样本,总字节数为 1369797466
- test: 包含 9749 个样本,总字节数为 407425784
数据集大小
- 下载大小: 1809812397 字节
- 数据集大小: 3147020716 字节



