Persian Text Readability Dataset
收藏arXiv2020-04-22 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/1810.06639v4
下载链接
链接失效反馈官方服务:
资源简介:
本研究首次收集了用于波斯语文本可读性评估的数据集,名为Persian Text Readability Dataset。该数据集由K. N. Toosi大学的技术部门创建,包含12,780条来自不同来源和主题的文本,如儿童故事、新闻和哲学等。数据集的创建过程采用众包方法,通过Telegram聊天机器人收集了大量波斯语使用者的可读性评价。该数据集旨在解决波斯语文本可读性自动评估的问题,适用于教育、医疗文本评估等多个领域。
This study presents the first dataset specifically collected for Persian text readability assessment, named Persian Text Readability Dataset. Developed by the Technical Department of K. N. Toosi University, this dataset contains 12,780 texts from diverse sources and topics, including children's stories, news articles, philosophical works, and more. The dataset was constructed using a crowdsourcing approach, where readability ratings from a large number of Persian language users were collected via a Telegram chatbot. This dataset is designed to address the issue of automatic Persian text readability assessment, and is applicable to multiple fields such as educational and medical text evaluation.
提供机构:
K. N. Toosi University of Technology
创建时间:
2018-10-08



