Parallel Corpus: Indonesian-Minang

Mendeley Data2026-04-09 收录

下载链接：

https://data.mendeley.com/datasets/6zghymcv2d

下载链接

链接失效反馈

官方服务：

资源简介：

This is a parallel corpus dataset containing pairs of sentences in two corresponding languages. This dataset is specifically designed to support and facilitate the application of machine learning techniques in language translation. Each sentence pair in the dataset has been meticulously compiled to cover a wide range of contexts and topics, providing extensive coverage of everyday language usage. By offering variations in context and topics, users can gain a deeper understanding of the nuances of everyday language use, as well as recognize language variations and idiomatic expressions from both languages involved. This enables further research and development in translation applications and natural language processing, while also offering deeper insights into the structure and function of language across different contexts.

5,000+

优质数据集

54 个

任务类型

进入经典数据集