CompLex

Name: CompLex
Creator: 曼彻斯特都会大学
Published: 2020-06-12 00:42:55
License: 暂无描述

arXiv2020-06-12 更新2024-06-21 收录

下载链接：

https://github.com/MMU-TDMLab/CompLex

下载链接

链接失效反馈

官方服务：

资源简介：

CompLex是一个用于词汇复杂性预测的新数据集，由曼彻斯特都会大学创建。该数据集包含从圣经、欧洲议会和生物医学文本中提取的9476个句子，每个句子由约7名注释者使用5点李克特量表进行注释。数据集旨在通过连续的词汇复杂性预测，解决自然语言处理中复杂词识别的问题。CompLex的应用领域包括文本简化和其他需要词汇复杂性评估的NLP应用。

CompLex is a novel dataset for lexical complexity prediction, developed by Manchester Metropolitan University. It includes 9476 sentences extracted from the Bible, European Parliament proceedings and biomedical texts. Each sentence was annotated by approximately 7 annotators using a 5-point Likert scale. This dataset aims to address the problem of complex word identification in natural language processing (NLP) through continuous lexical complexity prediction. Application areas of CompLex include text simplification and other NLP applications that require lexical complexity assessment.

提供机构：

曼彻斯特都会大学

创建时间：

2020-03-16

5,000+

优质数据集

54 个

任务类型

进入经典数据集