Junyi Academy Math Practicing Log (to Jan. 2015)
收藏DataCite Commons2026-04-16 更新2026-05-03 收录
下载链接:
https://datashop.org/DatasetInfo?datasetId=1198
下载链接
链接失效反馈官方服务:
资源简介:
The size of complete data in tab-delimiter format is too large (around 9GB) to upload using the raw text format, so we compress the file and upload them as extra files.
Junyi Academy -- http://www.junyiacademy.org/ -- is a Chinese e-learning website which is established on the basis of the open-source code released by Khan Academy (https://www.khanacademy.org/) in 2012.
In its math curriculum, Junyi Academy provides hundreds of exercises with randomly generated numbers like Khan Academy.
The dataset contains
1. Practicing log from Oct. 2012 to Jan. 2015
2. Exercise-related information on the platform
3. Annotations of exercise relationships
Additional Notes:
There is an exercise hierarchy in the dataset: Area -> Topic (Unit) -> Exercise (Section) -> Problem. In Khan Academy exercise framework, exercise contains one or more problem templates. The numbers in problems are randomly generated, so that users would practice problems with different numbers even though they select the same exercise. That is, the same problem in the dataset is generated by the same template with different numbers.
In addition, the practice log in Junyi Academy does not record some detail behaviors when users answer a problem. For example:
1. We do not know the order of each answering attempt and each hint request in the same problem.
2. The tutor only records the accuracy of the first attempt, so we are unable to infer whether users provide correct answers eventually if there are multiple attempts.
3. We only records IP instead of session ID.
Therefore, we prepare our transaction data by making following assumptions:
1. In the same problem, users always make all attempts first, and request all hints next.
2. If there are multiple attempts, the last attempt is always false.
3. The same user with same IP would have the same session ID.
提供机构:
DataShop
创建时间:
2026-04-16



