FrancophonIA/Fon_French_Daily_Dialogues_Parallel_Data
收藏Hugging Face2025-03-30 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/FrancophonIA/Fon_French_Daily_Dialogues_Parallel_Data
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含非洲本地语言Fon(也称为Fongbe)和法语的对齐句子。Fon主要在贝宁、多哥和尼日利亚使用,使用者约200万人。数据集的目的是为了自然语言处理研究,特别是针对资源匮乏的Fon语言,包括神经机器翻译和命名实体识别。通过众包和Google表单调查,收集并清洗了25377对Fon-法语平行句子,用于翻译和NLP模型的研发和设计。
This dataset contains aligned sentences in Fon, an African indigenous language, and French. Fon is primarily spoken in Benin, Togo, and Nigeria by approximately 2 million people. The dataset is intended for Natural Language Processing research, especially for the low-resourced Fon language, including Neural Machine Translation and Named Entity Recognition. Through crowdsourcing and Google Form Surveys, 25,377 parallel Fon-French sentences have been collected and cleaned for the development and design of translation and NLP models.
提供机构:
FrancophonIA



