five

Heroes

收藏
arXiv2023-01-25 更新2024-07-30 收录
下载链接:
https://facebookresearch.github.io/speech_translation/cascade_expressive_s2st
下载链接
链接失效反馈
官方服务:
资源简介:
Heroes数据集是由名古屋大学和Meta AI共同创建的,专注于电视系列领域的表达性语音到语音翻译测试集。该数据集包含406个样本,每个样本平均时长为3.10分钟,主要用于评估语音翻译系统在维持翻译准确性的同时,如何有效地转移源语音的韵律属性,如语调、强调和情感。数据集的创建过程包括去噪、质量检查和人工转录校正,确保数据质量。该数据集的应用领域主要集中在提高语音到语音翻译系统的表达性,解决跨语言交流中的韵律信息传递问题。

The Heroes dataset was co-created by Nagoya University and Meta AI, and it is an expressive speech-to-speech translation test set focused on the television series domain. It contains 406 samples, with an average duration of 3.10 minutes per sample. It is primarily used to evaluate how effectively speech translation systems can transfer the prosodic attributes of source speech, such as intonation, emphasis and emotion, while maintaining translation accuracy. The dataset's creation process includes denoising, quality inspection and manual transcription correction to ensure data quality. Its main application areas focus on enhancing the expressiveness of speech-to-speech translation systems and addressing the issue of prosodic information transfer in cross-linguistic communication.
提供机构:
名古屋大学
创建时间:
2023-01-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作