WueNLP/sib-fleurs
收藏Hugging Face2025-05-07 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/WueNLP/sib-fleurs
下载链接
链接失效反馈官方服务:
资源简介:
SIB-Fleurs 数据集是一个多语言数据集,支持多种语言,适用于音频分类、自动语音识别、音频文本到文本、文本到语音、问答和文档问答等任务。数据集包含句子、URL、ID、领域、主题、是否有图像、是否有超链接、fleurs_id、文件名、原始转录、转录、样本数量、说话者ID、性别、whisper_asr、seamlessm4t_asr等特征。每个配置都提供了关于特征、数据集划分(训练集、验证集、测试集)、下载大小和数据集大小的详细信息。
The SIB-Fleurs dataset is a multilingual dataset that supports a wide range of languages and is designed for tasks such as audio classification, automatic speech recognition, audio-text-to-text, text-to-speech, question answering, and document-question answering. The dataset includes features such as sentence, URL, id, domain, topic, has_image, has_hyperlink, fleurs_id, filename, raw_transcription, transcription, num_samples, speaker_id, gender, whisper_asr, seamlessm4t_asr, and more. Each configuration provides details about the features, splits (train, validation, test), download size, and dataset size.
提供机构:
WueNLP



