five

How2

收藏
arXiv2018-12-07 更新2024-06-21 收录
下载链接:
https://github.com/srvk/how2-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
How2数据集是由卡内基梅隆大学等机构创建的一个大规模多模态语言理解数据集,包含80,000个教学视频片段,总时长约2,000小时。该数据集不仅支持多模态分析,还提供了英语字幕及其众包葡萄牙语翻译,适用于多种语言处理任务。数据集的创建过程涉及从YouTube下载视频,提取视觉特征,并通过众包平台Figure Eight进行翻译。How2数据集的应用领域广泛,旨在解决机器在多模态环境下的语言理解和处理问题,促进语言、语音和视觉研究社区的协作。

The How2 dataset is a large-scale multimodal language understanding dataset created by Carnegie Mellon University and other institutions. It contains 80,000 instructional video clips, with a total duration of approximately 2,000 hours. This dataset not only supports multimodal analysis, but also provides English subtitles and their crowdsourced Portuguese translations, making it applicable to a variety of language processing tasks. The creation process of the dataset involves downloading videos from YouTube, extracting visual features, and carrying out translations via the crowdsourcing platform Figure Eight. The How2 dataset covers a wide range of application fields, aiming to solve the problems of machine language understanding and processing in multimodal environments, and promote collaboration among the language, speech and vision research communities.
提供机构:
卡内基梅隆大学
创建时间:
2018-11-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作