Data for paper on the evolution of Chinese characters
收藏NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7185330
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains all image, complexity and distinctiveness data that was used for:
Han, S. J, Kelly, P., Winters, J., & Kemp, C. (2022). Simplification is not dominant in the evolution of Chinese characters. Open Mind.
The code for this project can be found here. The file uploaded here is intended to replace the sample data folder that is available in the code repository.
Our dataset includes data scraped from hanziyuan.net, as well as data from the following sources:
Sun, C. C., Hendrix, P., Ma, J., & Baayen, R. H. (2018). Chinese lexical database (CLD): A large-scale lexical database for simplified Mandarin Chinese. Behavior Research Methods, 50(6), 2606–2629.
Wikimedia Commons. (2021). Chinese characters decomposition. https://commons.wikimedia.org/wiki/Commons:Chinese_characters_decomposition
Liu, C.-L., Yin, F., Wang, D.-H., & Wang, Q.-F. (2011). CASIA online and offline Chinese handwriting databases. In 2011 international conference on document analysis and recognition (pp. 37–41). https://doi.org/10.1109/ICDAR.2011.17
Chen, P.-C. (2020). Traditional Chinese handwriting dataset. GitHub. https://github.com/AI-FREE-Team/Traditional-Chinese-Handwriting-Dataset
创建时间:
2022-12-29



