five

Welsh Summarisation Dataset

收藏
arXiv2022-05-05 更新2024-06-21 收录
下载链接:
https://github.com/UCREL/welsh-summarizationdataset
下载链接
链接失效反馈
官方服务:
资源简介:
Welsh Summarisation Dataset是由兰卡斯特大学计算与通信学院的UCREL NLP组创建的,旨在支持威尔士语文本摘要研究。该数据集包含513篇威尔士语维基百科文章,每篇文章均由威尔士语母语者手动摘要。数据集的创建过程涉及文本收集、参考摘要的创建、摘要系统的构建与评估。该数据集的应用领域包括文档准备、校对以及在特定情况下的翻译,旨在通过自动化工具促进威尔士语言技术的发展。

Welsh Summarisation Dataset was developed by the UCREL NLP Group within the School of Computing and Communications at Lancaster University, with the aim of supporting research on Welsh-language text summarization. This dataset contains 513 Welsh-language Wikipedia articles, each of which has been manually summarized by native Welsh speakers. The process of creating this dataset includes text collection, reference summary development, as well as the construction and evaluation of summarization systems. Its application areas cover document preparation, proofreading, and translation in specific contexts, and it seeks to advance the development of Welsh language technologies through automated tools.
提供机构:
兰卡斯特大学计算与通信学院
创建时间:
2022-05-05
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作