MADAR Parallel Corpus Dataset
收藏SSH Open MarketPlace2025-04-02 更新2025-04-05 收录
下载链接:
https://marketplace.sshopencloud.eu/dataset/9uRkkC
下载链接
链接失效反馈官方服务:
资源简介:
The MADAR corpus is a collection of parallel sentences covering the dialects of 25 cities from the Arab World, in addition to English, French, and MSA. The corpus is created by translating selected sentences from the Basic Traveling Expression Corpus (BTEC) (Takezawa et al., 2007) to the different dialects. The exact details on the translation process and source and target languages are described in Bouamor et al. (2018).
创建时间:
2025-04-02



