five

Dataset for Generation of multiple true false questions

收藏
Zenodo2022-11-08 更新2026-05-25 收录
下载链接:
https://zenodo.org/record/7303300
下载链接
链接失效反馈
官方服务:
资源简介:
<strong>Generation of multiple true-false questions</strong> This project provides a Natural Language Pipeline for processing German Textbook sections as an input generating Multiple True-False Questions using GPT2. Assessments are an important part of the learning cycle and enable the development and promotion of competencies. However, the manual creation of assessments is very time-consuming. Therefore, the number of tasks in learning systems is often limited. In this repository, we provide an algorithm that can automatically generate an arbitrary number of German True False statements from a textbook using the GPT-2 model. The algorithm was evaluated with a selection of textbook chapters from four academic disciplines (see `data` folder) and rated by individual domain experts. One-third of the generated MTF Questions are suitable for learning. The algorithm provides instructors with an easier way to create assessments on chapters of textbooks to test factual knowledge. As a type of Multiple-Choice question, Multiple True False (MTF) Questions are, among other question types, a simple and efficient way to objectively test factual knowledge. The learner is challenged to distinguish between true and false statements. MTF questions can be presented differently, e.g. by locating a true statement from a series of false statements, identifying false statements among a list of true statements, or separately evaluating each statement as either true or false. Learners must evaluate each statement individually because a question stem can contain both incorrect and correct statements. Thus, MTF Questions as a machine-gradable format have the potential to identify learners’ misconceptions and knowledge gaps. Example MTF question: Check the correct statements: [ ] All trees have green leafs. [ ] Trees grow towards the sky. [ ] Leafes can fall from a tree. <strong>Features</strong> - generation of false statements - automatic selection of true statements - selection of an arbitrary similarity for true and false statements as well as the number of false statements - generating false statements by adding or deleting negations as well as using a german gpt2 <strong>Setup</strong> <strong>Installation</strong> 1. Create a new environment: `conda create -n mtfenv python=3.9` 2. Activate the environment: `conda activate mtfenv` 3. Install dependencies using anaconda: ``` conda install -y -c conda-forge pdfplumber conda install -y -c conda-forge nltk conda install -y -c conda-forge pypdf2 conda install -y -c conda-forge pylatexenc conda install -y -c conda-forge packaging conda install -y -c conda-forge transformers conda install -y -c conda-forge essential_generators conda install -y -c conda-forge xlsxwriter ``` 3. Download spacy: `python3.9 -m spacy download de_core_news_lg` <strong>Getting started</strong> After installation, you can execute the bash script `bash run.sh` in the terminal to compile MTF questions for the provided textbook chapters. To create MTF questions for your own texts use the following command: `python3 main.py --answers 1 --similarity 0.66 --input ./&lt;path&gt;/&lt;to&gt;/&lt;your&gt;/&lt;textbook&gt;.txt` The parameter `answers` indicates how many false answers should be generated. By configuring the parameter `similarity` you can determine what portion of a sentence should remain the same. The remaining portion will be extracted and used to generate a false part of the sentence. <strong>## History and roadmap </strong> * Outlook third iteration: Automatic augmentation of text chapters with generated questions * Second iteration: Generation of multiple true-false questions with improved text summarizer and German GPT2 sentence generator * First iteration: Generation of multiple true false questions in the Bachelor thesis of Mirjam Wiemeler <strong>Publications, citations, license</strong> <strong>Publications</strong> Kasakowskij, R., Kasakowskij, T. &amp; Seidel, N., (2022). Generation of Multiple True False Questions. In: Henning, P. A., Striewe, M. &amp; Wölfel, M. (Hrsg.), 20. Fachtagung Bildungstechnologien (DELFI). Bonn: Gesellschaft für Informatik e.V.. (S. 147-152). DOI: [10.18420/delfi2022-026](https://dl.gi.de/handle/20.500.12116/38826) <strong>Citation of the Dataset</strong> Kasakowskij, R., Kasakowskij, T., &amp; Seidel, N. (2022). <em>Dataset for Generation of multiple true false questions</em>. Zenodo. https://doi.org/10.5281/zenodo.7303300 The source code and data are maintained at GitHub: https://github.com/D2L2/multiple-true-false-question-generation <strong>Contact</strong> Regina Kasakowskij (M.A.) - regina.kasakowskij@fernuni-hagen.de Dr. Niels Seidel - niels.seidel@fernuni-hagen.de <strong>License</strong> Distributed under the MIT License. See [LICENSE.txt](https://gitlab.pi6.fernuni-hagen.de/la-diva/adaptive-assessment/generationofmultipletruefalsequestions/-/blob/master/LICENSE.txt) for more information. <strong>Acknowledgments</strong> This research was supported by CATALPA - Center of Advanced Technology for Assisted Learning and Predictive Analytics of the FernUniversität in Hagen, Germany. This project was carried out as part of research in the CATALPA project [LA DIVA](https://www.fernuni-hagen.de/forschung/schwerpunkte/catalpa/forschung/projekte/la-diva.shtml)
提供机构:
Zenodo
创建时间:
2022-11-08
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作