sawadogosalif/MooreFRCollections
收藏Hugging Face2024-12-08 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/sawadogosalif/MooreFRCollections
下载链接
链接失效反馈官方服务:
资源简介:
MooreFRCollections是一个开放项目,致力于创建Mooré(布基纳法索的一种本地语言)与法语的双语语料库,用于翻译和其他机器学习应用的研究和开发。数据集包含来自圣经文本、双语词典和人权宣言等来源的文本数据,所有数据都经过清洗和格式化以适应现代机器学习工具。数据集可通过HuggingFace的`datasets`库加载,并支持多种应用,如自动翻译、语言研究、监督学习和教育应用。
MooreFRCollections is an open project dedicated to creating a Mooré-Français bilingual corpus for research and development of linguistic technologies adapted to the Burkinabé context. The goal of the project is to provide a key tool for testing, training, and refining translation models and other machine learning applications. The dataset is constructed from sources including biblical texts from JW.ORG, bilingual dictionaries, and versions of the Universal Declaration of Human Rights. The dataset currently contains only textual data but has been carefully cleaned and formatted to be compatible with modern machine learning tools. Applications include automatic translation, linguistic research, supervised learning, and educational applications.
提供机构:
sawadogosalif



