Maltese Simplification Corpus
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10419573
下载链接
链接失效反馈官方服务:
资源简介:
A document-level parallel corpus of simple and complex Maltese texts.
A number of websites of governmental and non-governmental Maltese organisations publish documents in a writing style called 'easy to read', which is meant to be accessible for children and people with particular special needs. These are sometimes included as 'translations' of other official publications that are deemed important for said target audience.
This project is a collection of plain texts extracted from these documents with the aim of creating an opportunity to study the differences between easy-to-read text and non-simplified text. It is useful for tasks such as automatic text simplification for the Maltese language, although the size and domain variability of the corpus is extremely limited at the moment.
Repository: https://github.com/mtanti/maltese-simplification-corpus
创建时间:
2023-12-21



