HAVIC Pilot Transcription

Name: HAVIC Pilot Transcription
Creator: Linguistic Data Consortium
Published: 2021-07-01 16:29:31
License: 暂无描述

DataCite Commons2021-07-01 更新2025-04-16 收录

下载链接：

https://catalog.ldc.upenn.edu/LDC2016V01

下载链接

链接失效反馈

官方服务：

资源简介：

<h3>Introduction</h3><br> <p>HAVIC Pilot Transcription was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 72 hours of user-generated videos with transcripts based on the English speech audio extracted from the videos. This data set was created in collaboration with <a href="http://www.nist.gov/">NIST</a> (the National Institute of Standards and Technology) as part of the <a href="https://www.ldc.upenn.edu/collaborations/past-projects/havic">HAVIC</a> (the Heterogeneous Audio Visual Internet Collection) project, the goal of which was to advance multimodal event detection and related technologies.</p><br> <p>LDC has developed a large, heterogeneous, annotated multimodal corpus for HAVIC that has been used in the NIST-sponsored <a href="http://nist.gov/itl/iad/mig/med.cfm">MED</a> (Multimedia Event Detection) task for several years. HAVIC Pilot Transcription supported an experiment to produce a verbatim transcript (quick and rich transcription) based on audio extracted from user-generated videos. It contains the pilot transcripts for selected MED 2011 video files as well as the associated videos.</p><br> <h3>Data</h3><br> <p>NIST designated the videos to be transcribed. Annotators generated the transcripts using <a href="https://www.ldc.upenn.edu/language-resources/tools/xtrans">XTrans</a>, which supports manual transcription across multiple channels, languages and platforms. HAVIC transcription guidelines are included in the documentation for this release.</p><br> <p>Each file was transcribed by a single annotator with no corpus-wide second pass. File samples from each annotator were checked for various errors, including missing transcription, improper mark-up, poor segmentation and missing/added words.</p><br> <p>All transcription files are in .tdf format, a plain-text, flat-table format with 13 tab-delimited fields. All video files are in .mp4 format (h264), with varying bit-rates and levels of audio fidelity and video resolution.</p><br> <h3>Samples</h3><br> <p>Please view these <a href="desc/addenda/LDC2016V01.mp4">video</a> and <a href="desc/addenda/LDC2016V01.txt">transcript</a> samples.</p><br> <h3>Updates</h3><br> <p>None at this time.</p></br> Portions © 2011-2016 YouTube, LLC, © 2011-2016 Trustees of the University of Pennsylvania

提供机构：

Linguistic Data Consortium

创建时间：

2020-11-30

5,000+

优质数据集

54 个

任务类型

进入经典数据集