azrilhafizi/tatabahasa
收藏Hugging Face2024-05-18 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/azrilhafizi/tatabahasa
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
task_categories:
- question-answering
language:
- ms
size_categories:
- 1K<n<10K
---
## Overview
This dataset contains over 1,000 questions and answers focused on Malay language grammar (tatabahasa), tailored for primary school students aged 7 to 12. The dataset is designed to aid in educational research, language learning applications, and to benchmark the performance of Large Language Models (LLMs) on Malay language understanding and processing.
## Dataset Description
The dataset is organized in JSON Lines (JSONL) format, where each line is a JSON object representing a single question. Each JSON object contains the following fields:
- instruction: Instructions for the question.
- question: The question.
- choices: A dictionary of answer choices, where each key is an option identifier (e.g., "A", "B") and the value is a dictionary containing:
- text: The text of the answer choice.
- answer: A boolean indicating whether the choice is the correct answer.
提供机构:
azrilhafizi
原始信息汇总
数据集概述
- 语言:马来语(ms)
- 任务类别:问答
- 大小类别:1K<n<10K
- 内容:包含超过1,000个针对7至12岁小学生的马来语语法(tatabahasa)问题和答案。
- 用途:支持教育研究、语言学习应用及评估大型语言模型(LLMs)在马来语理解和处理上的性能。
数据集描述
- 格式:JSON Lines(JSONL)
- 结构:每行代表一个单独的问题,包含以下字段:
- instruction:问题说明。
- question:问题本身。
- choices:答案选项,每个键为选项标识(如"A", "B"),值为包含以下内容的字典:
- text:答案选项文本。
- answer:布尔值,指示该选项是否为正确答案。



