five

LEONIDE - Longitudinal Learner Corpus in Italiano, Deutsch and English

收藏
SSH Open MarketPlace2023-10-13 更新2024-08-03 收录
下载链接:
https://marketplace.sshopencloud.eu/dataset/6RTgc0
下载链接
链接失效反馈
官方服务:
资源简介:
This is a longitudinal corpus of student essays documenting the language competences and writing development of lower secondary school students in three different languages. The texts were collected over the span of 3 consecutive years (2015-2018) in public middle schools. The pupils were 11 years old at the beginning of the data collection and 13 years old at the end. In each grade, two written texts were collected that differ with respect to genre: the first text was elicited using a picture story re-telling task; the second text is an opinion text on different aspects related to the pupils’ life and public discourse. Manual annotation concerns the fact that the corpus is fully anonymised and annotated with target hypotheses correcting orthography errors in the text as well as annotations on structural elements (paragraphs, line breaks, bullet points, symbols or emoticons etc.), foreign word insertions and transcript surface features (e.g. deletions, corrections or insertions of the student, unreadable or ambiguous items). The corpus is available for download from the the Eurac Research CLARIN Centre Repository.
创建时间:
2023-10-13
二维码
社区交流群
二维码
科研交流群
商业服务