five

Dataset for: Investigating the Perception and Effects of Misunderstandings in Java Code

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/3822522
下载链接
链接失效反馈
官方服务:
资源简介:
Dataset and scripts used for thesis: Investigating the Perception and Effects of Misunderstandings in Java Code. # Contents of this archive The data and scripts used for the Master thesis `Investigating the Perception and Effects of Misunderstandings in Java Code`, by Chris Langhout. Provided for reproducibility and openness of data. In this readme, I try to specify what files are used for what, and what files include what results.   ## folders   ### /atoms-java Includes all .java files for the code examples used in the online experiment.   ### /atoms_screenshots-java-normal Includes screenshots of all .java files. Generated by `autoScreenshot.py`.   ### /data-analysis   #### /data-analysis/atom-data - `java-answers.csv` set of computed results of snippets. Created by `javaAnswers.py`. - `new-atom-java.csv` contains atom names, in combination with their hashed counterparts. Used for combining answers back to their original questions, and generated by `autoScreenshot.py`.   #### /data-analysis/survey-data   Contains the export of the survey tool. Only in scanned `.ods` format.   #### /data-analysis/grader.ts   Used to grade the responses, and generate new datasets that are used for data analysis. Made use of the raw export of the survey response (not included for participant data protection, contact chris.langhout *at* gmail *dot* com for any other info).   ### /result-data Output folder of `grader.ts` script. - `perceptAnswers.html` is used to find out what explanations were added by participants on the perception part of the experiment. To give context, the screenshots are used. Below each atom, the set of explanations is shown, based on what answer people gave. For better readability, the custom ordering this participant saw, is also included. e.g. (2 is ob) means that the right side screenshot the participant saw was the obfuscated variant. - `atomTimes.ods` includes a spreadsheet tab `RAWresultTable` which combines the counts of occurances, correct answers, incorrect answers, answers that were corrected by the grader, and total wrong answers. For every wrong answer, the given answer is displayed. Futhermore, the counts on how much participans were certain of their answer are counted and shown, split by correct/incorrect answer. An additional column with the comments placed here is added. The textual columns split different answers by a '|' character. - `effectOddsCI.ods` contains the same RAWresultTable, and also includes an `oddsRations` worksheet. Here the calculations for the odds ratios, including confidence intervals are shown. The `sheet3` highlights interesting atoms where the indicated confidence on answers is standing out. (this one does not work in excel, due to regular expression support. I used LibreOffice for this) - `perceptResultCounts.ods` shows the resulting counts of the perception side of the experiment. The sum of people that voted for the respective one of 4 options is displayed per atom.   Additionally several data files, that are used to manually create one of the files mentioned above.   ### /figures includes several `.xslx` files used for creating tables and figures.   ## Scripts   ### autoScreenshot.py used to open the individual .java files for the code examples in vscode. Have vscode open on the right half of the screen to make a screenshot, including the line numbers, excluding the first commented out lines, and save with the appropriate file name in the specified folder.   Includes support for hashing the filenames, so that inspecting the images in the online experiment will not show the name of the atom of confusion.   ### runJava.sh Create a `temp.java` file, copy the input into this file. Then try to compile this java code, run the compiled code (which prints results) and remove the created files again. save results in atom-data/java-answers.csv   ### javaAnswers.py run the `runJava.sh` script over the code snippets and save the results in `java-answers.csv`   ### gen_skeleton.py Make use of the data from https://atomsofconfusion.com/2016-snippet-study/questions.csv to generate files, and save metadata of the atoms-of-confusion from that source.
创建时间:
2020-05-13
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作