five

Mboshi-French Parallel Corpus

收藏
arXiv2018-02-15 更新2024-06-21 收录
下载链接:
https://github.com/besacier/mboshi-french-parallel-cor
下载链接
链接失效反馈
官方服务:
资源简介:
Mboshi-French Parallel Corpus是由法国国家科学研究中心等多个机构合作创建,包含5000条Mboshi语言的语音数据及其法语翻译。该数据集旨在支持计算语言学研究,特别是对未书面语言的自动分析和标注。数据收集过程中使用了移动应用LigAikuma,确保了数据的真实性和多样性。该数据集适用于语音识别、词汇发现等任务,有助于解决语言灭绝问题,支持语言保护和研究。

The Mboshi-French Parallel Corpus was collaboratively created by multiple institutions including the French National Centre for Scientific Research. It contains 5000 speech samples in the Mboshi language paired with their corresponding French translations. This corpus is designed to support computational linguistics research, particularly automated analysis and annotation of unwritten languages. The data was collected using the mobile application LigAikuma, which ensures the authenticity and diversity of the dataset. This corpus is applicable to tasks such as speech recognition and lexical discovery, and helps address language endangerment while supporting language conservation and related research.
提供机构:
国家科学研究中心
创建时间:
2017-10-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作