A Dataset of Prolog Submissions for Feedback Research: 7201 Programs and 200 Manual Annotations
收藏Figshare2025-08-13 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/A_Dataset_of_Prolog_Submissions_for_Feedback_Research_7201_Programs_and_200_Manual_Annotations/29899583/1
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains versioned Prolog program submissions from students, annotated with debugging and test result metadata. It is intended for research on automated feedback, bug fixing, program repair, and learning analytics in logic programming education.Files Descriptionprograms.jsonl.gz contains 7201 Prolog submissionsprograms_sample.json contains a sample of 10 entries from programs.jsonl.gz in human readable formatprograms_annotated.jsonl contains 200 programs from programs.jsonl.gz with additional manually created annotationsprograms_annotated_sample.json contains a sample of 10 entries from programs_annotated.jsonl in human readable formatDataset StructureEach dataset entry in programs.jsonl.gz is a JSON object with the following fields:<b>student_id</b>: Anonymized integer identifier for the student.<b>sequence_id</b>: Integer indicating the submission order for a given assignment.<b>time</b>: UNIX timestamp of the submission.<b>assignment</b>: Path or identifier of the assignment.<b>assignment_group</b>: Higher-level grouping of assignments (e.g. "labs").<b>program</b>: The student's submitted Prolog program.<b>previous_submission</b>: The student's immediately prior submission for the same assignment.<b>previous_tests_passed</b>: List of test cases passed in the previous submission.<b>passed_tests</b>: List of test cases passed by the current submission.<b>failed_tests</b>: List of test cases that failed in the current submission.<b>tests_passed_count</b>: Count of passed tests.<b>total_tests_count</b>: Total number of tests executed.<b>correct</b>: Boolean indicating whether the submission passes all tests.<b>category</b>: Label describing the nature of the change (e.g. <code>"BUGFIX_CORRECT"</code>).<b>diff</b>: A diff string showing changes between the previous and current submission.<b>parsed</b>: Normalized version of the current program.<b>syntax_error</b>: Syntax error message if present (else <code>null</code>).<b>interpreter_syntax_error</b>: Syntax errors raised by the Prolog interpreter (if any).<b>is_syntax_fix</b>: Boolean indicating if the fix resolved a syntax issue.<b>predicates</b>: List of predicates defined in the current program.<b>modified_predicates</b>: Predicates modified since the previous submission.<b>modified_predicates_count</b>: Number of predicates modified.<b>modified_predicates_fraction</b>: Fraction of modified predicates relative to total.<b>clauses</b>: Total number of clauses in the program.<b>modified_clauses</b>: Number of modified clauses.<b>modified_clauses_fraction</b>: Fraction of modified clauses.Additionally, the entries in programs_annotated.jsonl also have the following two fields:<b>bugfix_labels</b>: List of labeled bug categories identified in the submission (e.g., <code>"CUT_ISSUE:MISSING"</code>).<b>minimal_change</b>: Boolean indicating whether the fix involved minimal edits.<br>
提供机构:
Brancas, Ricardo; Da Cunha Carreira, Carolina; Orvalho, Pedro; Goncalves Martins, Ruben Carlos; Manquinho, Vasco
创建时间:
2025-08-13



