Sigurdur/is-trivia-questions
收藏Hugging Face2024-07-14 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/Sigurdur/is-trivia-questions
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为冰岛语问答数据集,包含冰岛语的问答数据,主要用于问答、文本生成和文本分类任务。数据集包含训练集和测试集,分别有9288和2322个样本。特征包括类别ID、子类别、难度、质量、问题和答案。数据集的许可证为cc,语言为冰岛语,数据集大小在10K到100K之间。
The Icelandic trivia questions dataset includes multiple features such as class ID, subclass, difficulty, quality, question, and answer, all of which are either string or numeric types. The dataset is divided into a training set and a test set, containing 9288 and 2322 samples respectively. It is primarily used for question-answering, text generation, and text classification tasks, with the language being Icelandic. The dataset is named Icelandic trivia questions, created by Sveinn Steinarsson and others.
提供机构:
Sigurdur
原始信息汇总
数据集概述
数据集名称
Icelandic trivia questions
数据集描述
该数据集包含冰岛语的问答题目,由Sveinn Steinarsson, Valur Freyr Steinarsson, 和 Svavar Kjarrval编译和创建。
数据集特征
- class_id: 类别ID,数据类型为int64
- subclass: 子类别,数据类型为string
- difficulty: 难度级别,数据类型为int64,取值范围为1(简单)到3(困难)
- quality: 质量级别,数据类型为float64,取值范围为1(低)到3(高)
- question: 问题,数据类型为string
- answer: 答案,数据类型为string
数据集分割
- train: 训练集,包含9288个样本,大小为1018849.6字节
- test: 测试集,包含2322个样本,大小为254712.4字节
数据集大小
- 下载大小: 625021字节
- 数据集总大小: 1273562字节
数据集配置
- default: 默认配置,包含训练集和测试集的数据文件路径
许可证
cc
任务类别
- 问答
- 文本生成
- 文本分类
语言
冰岛语
数据集规模
10K<n<100K



