five

childPoeDE: A corpus of German Children's Poems for Computational and Experimental Studies - Metadata

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/7684911
下载链接
链接失效反馈
官方服务:
资源简介:
The childPoeDE corpus is a collection of 1082 German poems for children created within the CHYLSA project. The poems were taken from anthologies published between 1991 and 2019. This publication includes the poem-level metadata for each poem with information about the author, the poem's length, data on case, punctuation, layout, rhyme, type-token ratio (TTR and MATTR) and lexical density. It also includes token-level metadata, namely word length and position, POS tags in different levels of granularity as well as data on onomatopoeia and sonority. Furthermore, this publication provides a word frequency table and a Python script which was used to extract some of the metadata from the texts (poemtool.py). The childPoeDE corpus does not contain all poems from the anthologies. A list of the poems that have been omitted for different reasons (length, language, typography, ...) can be accessed as well. Read more about the childPoeDE corpus in our data paper published in the Journal of Open Humanities Data: The ChildPoeDE Corpus: 1082 German Children’s Poems for Computational and Experimental Studies on Poetry Reception. DFG Schwerpunktprogramm SPP 2207 “Computational Literary Studies“ Online: https://gepris.dfg.de/gepris/projekt/402743989 https://dfg-spp-cls.github.io/ Subproject: „CHYLSA (Children’s and Youth Literature Sentiment Analysis)“ Online: https://gepris.dfg.de/gepris/projekt/424250469 https://dfg-spp-cls.github.io/projects_en/2020/01/24/TP-CHYLSA/
创建时间:
2024-07-12
二维码
社区交流群
二维码
科研交流群
商业服务