five

Persian Text Readability Dataset

收藏
arXiv2020-04-22 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/1810.06639v4
下载链接
链接失效反馈
官方服务:
资源简介:
本研究首次收集了用于波斯语文本可读性评估的数据集,名为Persian Text Readability Dataset。该数据集由K. N. Toosi大学的技术部门创建,包含12,780条来自不同来源和主题的文本,如儿童故事、新闻和哲学等。数据集的创建过程采用众包方法,通过Telegram聊天机器人收集了大量波斯语使用者的可读性评价。该数据集旨在解决波斯语文本可读性自动评估的问题,适用于教育、医疗文本评估等多个领域。

This study presents the first dataset specifically collected for Persian text readability assessment, named Persian Text Readability Dataset. Developed by the Technical Department of K. N. Toosi University, this dataset contains 12,780 texts from diverse sources and topics, including children's stories, news articles, philosophical works, and more. The dataset was constructed using a crowdsourcing approach, where readability ratings from a large number of Persian language users were collected via a Telegram chatbot. This dataset is designed to address the issue of automatic Persian text readability assessment, and is applicable to multiple fields such as educational and medical text evaluation.
提供机构:
K. N. Toosi University of Technology
创建时间:
2018-10-08
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作