s-nlp/KGQASubgraphsRanking
收藏Hugging Face2025-02-23 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/s-nlp/KGQASubgraphsRanking
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了多个配置,每个配置都有不同的特征和分割。主要特征包括问题、答案、实体、图结构、序列和嵌入向量等。数据集被分割为训练集、验证集和测试集,适用于机器学习模型的训练和评估。
The dataset consists of multiple configurations, each with different features and splits. Main features include questions, answers, entities, graph structures, sequences, and embedding vectors. The dataset is split into training, validation, and test sets, suitable for training and evaluating machine learning models.
提供机构:
s-nlp
原始信息汇总
数据集概述
数据集配置:mistral_outputs
- 特征:
id: 字符串question: 字符串target: 字符串answer_0至answer_49: 字符串
- 分割:
train: 14000个样本,119187546字节test: 4000个样本,12836757字节validation: 2000个样本,14022974字节
- 下载大小: 74850978字节
- 数据集大小: 146047277字节
数据集配置:mistral_subgraphs
- 特征:
index: 整数64位id: 字符串question: 字符串answerEntity: 字符串groundTruthAnswerEntity: 字符串questionEntity: 字符串complexityType: 字符串graph: 字符串correct: 布尔值t5_sequence至question_answer_embedding: 字符串或浮点数序列
- 分割:
train: 32757个样本,1369797466字节validation: 32757个样本,1369797466字节test: 9749个样本,407425784字节
- 下载大小: 1809811773字节
- 数据集大小: 3147020716字节
数据集配置:mixtral_outputs
- 特征:
id: 字符串question: 字符串target: 字符串answer_0至answer_49: 字符串
- 分割:
train: 14000个样本,119187546字节test: 4000个样本,34086529字节validation: 2000个样本,17181713字节
- 下载大小: 88750560字节
- 数据集大小: 170455788字节
数据集配置:mixtral_subgraphs
- 特征:
index: 整数64位id: 字符串question: 字符串answerEntity: 字符串groundTruthAnswerEntity: 字符串questionEntity: 字符串complexityType: 字符串graph: 字符串correct: 布尔值t5_sequence至question_answer_embedding: 字符串或浮点数序列
- 分割:
train: 32757个样本,1369797466字节validation: 32757个样本,1369797466字节test: 9749个样本,407425784字节
- 下载大小: 1809811773字节
- 数据集大小: 3147020716字节
数据集配置:t5largessm_outputs
- 特征:
question: 字符串target: 字符串answer_0至answer_199: 字符串target_out_of_vocab: 布尔值
- 分割:
train: 16000个样本,55937176字节validation: 2000个样本,7843856字节test: 4000个样本,13934904字节
- 下载大小: 52544514字节
- 数据集大小: 77715936字节
数据集配置:t5largessm_subgraphs
- 特征:
id: 字符串question: 字符串answerEntity: 字符串questionEntity: 字符串groundTruthAnswerEntity: 字符串complexityType: 字符串graph: 字符串correct: 布尔值t5_sequence至question_answer_embedding: 字符串或浮点数序列
- 分割:
train: 65402个样本,1776110070字节validation: 65402个样本,1776110070字节test: 16567个样本,449834999字节
- 下载大小: 3873022126字节
- 数据集大小: 4002055139字节
数据集配置:t5xlssm_outputs
- 特征:
question: 字符串target: 字符串answer_0至answer_56: 字符串
- 分割:
train: 16000个样本,55937176字节validation: 2000个样本,7843856字节test: 4000个样本,13934904字节
- 下载大小: 52544514字节
- 数据集大小: 77715936字节



