Data underlying the research between ASR and MT quality of Automatic Subtitling Platforms

Name: Data underlying the research between ASR and MT quality of Automatic Subtitling Platforms
Creator: 4TU.ResearchData
Published: 2023-10-18 09:29:19
License: 暂无描述

DataCite Commons2023-10-18 更新2024-07-03 收录

下载链接：

https://data.4tu.nl/datasets/7cfa296a-72b7-4460-acd4-86193b43701e

下载链接

链接失效反馈

官方服务：

资源简介：

In the first experiment of ASR accuracy comparison, 1 set of speech-to-text data (hereafter Veed 0 and Iflyrec 0 ) is generated after submitting the “Qantas Safety video” on “Iflyrec” and “Veed”. The reference speech-to-text data is transcribed from Qantas’ official channel on YouTube.In the second experiment of automatic subtitling translation comparison, 3 sets of data are collected and analyzed. The author uses the original speech-to-text data of “Iflyrec” and “Veed” to generate one set of automatic subtitling translations (hereafter Veed 1 and Iflyrec 1), and then inputs the speech-to-text data on these two platforms to generate the final automatic subtitling translation version (hereafter Veed 2 and Iflyrec 2). For the human translation reference, this paper uses the translation from a tutor affiliated with the Civil Aviation University of China.

提供机构：

4TU.ResearchData

创建时间：

2023-10-18

5,000+

优质数据集

54 个

任务类型

进入经典数据集