five

hynky/klokan-qa

收藏
Hugging Face2024-05-06 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/hynky/klokan-qa
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - cs license: cc size_categories: - n<1K task_categories: - question-answering pretty_name: KLOKAN - Czech matehmatical dataset dataset_info: - config_name: balanced features: - name: question dtype: string - name: correct_answer dtype: string - name: category dtype: int64 - name: year dtype: int64 - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: E dtype: string splits: - name: test num_bytes: 206683 num_examples: 807 - name: train num_bytes: 208050 num_examples: 813 download_size: 256943 dataset_size: 414733 - config_name: default features: - name: question dtype: string - name: correct_answer dtype: string - name: category dtype: int64 - name: year dtype: int64 - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: E dtype: string splits: - name: train num_bytes: 208050 num_examples: 813 download_size: 125899 dataset_size: 208050 - config_name: unbalanced features: - name: question dtype: string - name: correct_answer dtype: string - name: category dtype: int64 - name: year dtype: int64 - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: E dtype: string splits: - name: test num_bytes: 206683 num_examples: 807 - name: train num_bytes: 208050 num_examples: 813 download_size: 255194 dataset_size: 414733 configs: - config_name: balanced data_files: - split: train path: balanced/train-* - split: test path: balanced/test-* - config_name: default data_files: - split: train path: data/train-* - config_name: unbalanced data_files: - split: train path: unbalanced/train-* - split: test path: unbalanced/test-* --- - The dataset has been gather from assigments of [Klokánek](https://matematickyklokan.net/) competition from 2004-2022. - I have done rule based filtering to filter-out picture related assigments - The category denote the difficulty of the task, the range is elementary school to high-school. Check example test from Klokánek for more information. - If you find an error in solution or find that the assigment is unsolvable, please contact me. - If you have any question please contact me at kydlicek.hynek@gmail.com - The dataset is realesed under non-comercial licence CC BY-NC-SA Cite: ``` @misc{klokanek-dataset, author = {Hynek Kydlíček, David Nocar et al.}, title = {Klokánek dataset}, year = {2023}, publisher = {Hynek Kydlíček}, doi = { 10.57967/hf/1608 }, url = {https://matematickyklokan.net/} howpublished = "\url{https://huggingface.co/datasets/hynky/klokan-qa}" } ```
提供机构:
hynky
原始信息汇总

KLOKAN - Czech Mathematical Dataset

基本信息

  • 语言: 捷克语
  • 许可证: CC
  • 数据集大小: n<1K
  • 任务类别: 问答
  • 数据集名称: KLOKAN - Czech mathematical dataset

配置信息

配置: balanced

  • 特征:
    • question: 字符串
    • correct_answer: 字符串
    • category: 整数64位
    • year: 整数64位
    • A: 字符串
    • B: 字符串
    • C: 字符串
    • D: 字符串
    • E: 字符串
  • 分割:
    • test: 206683字节, 807个样本
    • train: 208050字节, 813个样本
  • 下载大小: 256943字节
  • 数据集大小: 414733字节

配置: default

  • 特征:
    • question: 字符串
    • correct_answer: 字符串
    • category: 整数64位
    • year: 整数64位
    • A: 字符串
    • B: 字符串
    • C: 字符串
    • D: 字符串
    • E: 字符串
  • 分割:
    • train: 208050字节, 813个样本
  • 下载大小: 125899字节
  • 数据集大小: 208050字节

配置: unbalanced

  • 特征:
    • question: 字符串
    • correct_answer: 字符串
    • category: 整数64位
    • year: 整数64位
    • A: 字符串
    • B: 字符串
    • C: 字符串
    • D: 字符串
    • E: 字符串
  • 分割:
    • test: 206683字节, 807个样本
    • train: 208050字节, 813个样本
  • 下载大小: 255194字节
  • 数据集大小: 414733字节

数据文件配置

配置: balanced

  • 数据文件:
    • train: balanced/train-*
    • test: balanced/test-*

配置: default

  • 数据文件:
    • train: data/train-*

配置: unbalanced

  • 数据文件:
    • train: unbalanced/train-*
    • test: unbalanced/test-*
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作