five

Corpus Minangkabau

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://data.mendeley.com/datasets/kch8f4smtw
下载链接
链接失效反馈
官方服务:
资源简介:
In the development of language technologies such as machine translation, speech recognition, and others, the language corpus is very important as a source of training data. By having a corpus of Minangkabau and Indonesian languages, language technology developers can build models and systems that are more accurate and effective. Data for the corpus is collected from a variety of sources, including a number of websites and books that provide information in Minangkabau and Indonesian. Up to 520 Minangkabau and Indonesian sentences have been compiled in the data you want to publish, all of which are presented as rhymes and poetry.
创建时间:
2023-02-27
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作