five

Sensitivity to Meaningful Morphological Information Acquired through Reading Experience Data Collections, 2022-2025

收藏
DataCite Commons2025-09-15 更新2026-05-06 收录
下载链接:
http://reshare.ukdataservice.ac.uk/id/eprint/858014
下载链接
链接失效反馈
官方服务:
资源简介:
The data collection comprises three elements that link properties of the words that occur in books suitable for children and young people to the morpheme knowledge that readers display in reading tasks. These three elements include the following: (a) A lexical database of the words that occur in 1200 books suitable for children and young people aged 7-16. This database comprises over 100,000 words and a range of psycholinguistic properties such as word frequency and contextual diversity. The corpus from which these words were sourced contains over 70 million words. (b) A computational algorithm that parses these words into morphemes and provides data about their frequency of occurrence. Notably, the parser works on morphemes defined orthographically (as opposed to etymologically) and so captures what a child might learn about morphology through reading experience. (c) Response time and accuracy data from a large-scale study of human readers that links the corpus-based metrics pertaining to morphemes to reading performance. Each of these datasets, along with relevant pre-processing and analysis code, is available on the Open Science Framework (OSF). Each of these OSF projects also contains comprehensive documentation to facilitate reuse. Links are available as related resources.
提供机构:
UK Data Service
创建时间:
2025-09-15
二维码
社区交流群
二维码
科研交流群
商业服务