five

TextComplexityDE Dataset

收藏
paperswithcode.com2025-01-15 收录
下载链接:
https://paperswithcode.com/dataset/textcomplexityde
下载链接
链接失效反馈
官方服务:
资源简介:
TextComplexityDE is a dataset consisting of 1000 sentences in German language taken from 23 Wikipedia articles in 3 different article-genres to be used for developing text-complexity predictor models and automatic text simplification in German language. The dataset includes subjective assessment of different text-complexity aspects provided by German learners in level A and B. In addition, it contains manual simplification of 250 of those sentences provided by native speakers and subjective assessment of the simplified sentences by participants from the target group. The subjective ratings were collected using both laboratory studies and crowdsourcing approach.

TextComplexityDE 是一个由 1000 个德语文本句子组成的语料库,这些句子源自 23 篇不同文章类型的维基百科文章。该语料库旨在用于开发德语文本复杂度预测模型以及自动文本简化技术。语料库中包含了来自 A 级和B级德语学习者的对文本复杂度不同方面的主观评估。此外,还包括了 250 个句子的手动简化版本,由母语者提供,以及目标群体参与者对简化文本的主观评估。这些主观评分是通过实验室研究和众包方法收集的。
提供机构:
Papers with Code
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作