hynky/klokan-qa
收藏Hugging Face2024-05-06 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/hynky/klokan-qa
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- cs
license: cc
size_categories:
- n<1K
task_categories:
- question-answering
pretty_name: KLOKAN - Czech matehmatical dataset
dataset_info:
- config_name: balanced
features:
- name: question
dtype: string
- name: correct_answer
dtype: string
- name: category
dtype: int64
- name: year
dtype: int64
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: E
dtype: string
splits:
- name: test
num_bytes: 206683
num_examples: 807
- name: train
num_bytes: 208050
num_examples: 813
download_size: 256943
dataset_size: 414733
- config_name: default
features:
- name: question
dtype: string
- name: correct_answer
dtype: string
- name: category
dtype: int64
- name: year
dtype: int64
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: E
dtype: string
splits:
- name: train
num_bytes: 208050
num_examples: 813
download_size: 125899
dataset_size: 208050
- config_name: unbalanced
features:
- name: question
dtype: string
- name: correct_answer
dtype: string
- name: category
dtype: int64
- name: year
dtype: int64
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: E
dtype: string
splits:
- name: test
num_bytes: 206683
num_examples: 807
- name: train
num_bytes: 208050
num_examples: 813
download_size: 255194
dataset_size: 414733
configs:
- config_name: balanced
data_files:
- split: train
path: balanced/train-*
- split: test
path: balanced/test-*
- config_name: default
data_files:
- split: train
path: data/train-*
- config_name: unbalanced
data_files:
- split: train
path: unbalanced/train-*
- split: test
path: unbalanced/test-*
---
- The dataset has been gather from assigments of [Klokánek](https://matematickyklokan.net/) competition from 2004-2022.
- I have done rule based filtering to filter-out picture related assigments
- The category denote the difficulty of the task, the range is elementary school to high-school. Check example test from Klokánek for more information.
- If you find an error in solution or find that the assigment is unsolvable, please contact me.
- If you have any question please contact me at kydlicek.hynek@gmail.com
- The dataset is realesed under non-comercial licence CC BY-NC-SA
Cite:
```
@misc{klokanek-dataset,
author = {Hynek Kydlíček, David Nocar et al.},
title = {Klokánek dataset},
year = {2023},
publisher = {Hynek Kydlíček},
doi = { 10.57967/hf/1608 },
url = {https://matematickyklokan.net/}
howpublished = "\url{https://huggingface.co/datasets/hynky/klokan-qa}"
}
```
提供机构:
hynky
原始信息汇总
KLOKAN - Czech Mathematical Dataset
基本信息
- 语言: 捷克语
- 许可证: CC
- 数据集大小: n<1K
- 任务类别: 问答
- 数据集名称: KLOKAN - Czech mathematical dataset
配置信息
配置: balanced
- 特征:
question: 字符串correct_answer: 字符串category: 整数64位year: 整数64位A: 字符串B: 字符串C: 字符串D: 字符串E: 字符串
- 分割:
test: 206683字节, 807个样本train: 208050字节, 813个样本
- 下载大小: 256943字节
- 数据集大小: 414733字节
配置: default
- 特征:
question: 字符串correct_answer: 字符串category: 整数64位year: 整数64位A: 字符串B: 字符串C: 字符串D: 字符串E: 字符串
- 分割:
train: 208050字节, 813个样本
- 下载大小: 125899字节
- 数据集大小: 208050字节
配置: unbalanced
- 特征:
question: 字符串correct_answer: 字符串category: 整数64位year: 整数64位A: 字符串B: 字符串C: 字符串D: 字符串E: 字符串
- 分割:
test: 206683字节, 807个样本train: 208050字节, 813个样本
- 下载大小: 255194字节
- 数据集大小: 414733字节
数据文件配置
配置: balanced
- 数据文件:
train: balanced/train-*test: balanced/test-*
配置: default
- 数据文件:
train: data/train-*
配置: unbalanced
- 数据文件:
train: unbalanced/train-*test: unbalanced/test-*



