Yor`ub´a Flickr Audio Captions Corpus (YFACC)

Name: Yor`ub´a Flickr Audio Captions Corpus (YFACC)
Creator: 斯坦伦博斯大学, 南非 2布加勒斯特理工大学, 罗马尼亚
Published: 2022-10-12 15:55:39
License: 暂无描述

arXiv2022-10-12 更新2024-06-21 收录

下载链接：

https://www.kamperh.com/yfacc/

下载链接

链接失效反馈

官方服务：

资源简介：

YFACC是一个针对尼日利亚低资源语言Yor`ub´a的语音-图像数据集，由斯坦伦博斯大学和布加勒斯特理工大学合作创建。该数据集包含6000条从Flickr8k数据集中选取的图像及其对应的Yor`ub´a语音描述，旨在通过视觉基础模型实现跨语言关键词定位。数据集的创建过程涉及图像选择、语音描述的翻译和录制，以及关键词的时间对齐。YFACC的应用领域主要集中在语言学研究，特别是对于无文字语言的记录和分析，以及开发适用于低资源语言的语音处理技术。

YFACC is a speech-image dataset targeting the low-resource Yorùbá language of Nigeria, co-developed by Stellenbosch University and Politehnica University of Bucharest. This dataset includes 6000 images selected from the Flickr8k dataset and their corresponding Yorùbá speech descriptions, aiming to enable cross-lingual keyword localization through vision-language foundation models. The dataset creation workflow covers image selection, translation and recording of speech descriptions, as well as temporal alignment of keywords. The application fields of YFACC primarily focus on linguistic research, especially the documentation and analysis of unwritten languages, as well as the development of speech processing technologies suitable for low-resource languages.

提供机构：

斯坦伦博斯大学, 南非 2布加勒斯特理工大学, 罗马尼亚

创建时间：

2022-10-10

5,000+

优质数据集

54 个

任务类型

进入经典数据集