five

HAVIC Pilot Transcription

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2016V01
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>HAVIC Pilot Transcription was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 72 hours of user-generated videos with transcripts based on the English speech audio extracted from the videos. This data set was created in collaboration with <a href="http://www.nist.gov/">NIST</a> (the National Institute of Standards and Technology) as part of the <a href="https://www.ldc.upenn.edu/collaborations/past-projects/havic">HAVIC</a> (the Heterogeneous Audio Visual Internet Collection) project, the goal of which was to advance multimodal event detection and related technologies.</p><br> <p>LDC has developed a large, heterogeneous, annotated multimodal corpus for HAVIC that has been used in the NIST-sponsored <a href="http://nist.gov/itl/iad/mig/med.cfm">MED</a> (Multimedia Event Detection) task for several years. HAVIC Pilot Transcription supported an experiment to produce a verbatim transcript (quick and rich transcription) based on audio extracted from user-generated videos. It contains the pilot transcripts for selected MED 2011 video files as well as the associated videos.</p><br> <h3>Data</h3><br> <p>NIST designated the videos to be transcribed. Annotators generated the transcripts using <a href="https://www.ldc.upenn.edu/language-resources/tools/xtrans">XTrans</a>, which supports manual transcription across multiple channels, languages and platforms. HAVIC transcription guidelines are included in the documentation for this release.</p><br> <p>Each file was transcribed by a single annotator with no corpus-wide second pass. File samples from each annotator were checked for various errors, including missing transcription, improper mark-up, poor segmentation and missing/added words.</p><br> <p>All transcription files are in .tdf format, a plain-text, flat-table format with 13 tab-delimited fields. All video files are in .mp4 format (h264), with varying bit-rates and levels of audio fidelity and video resolution.</p><br> <h3>Samples</h3><br> <p>Please view these <a href="desc/addenda/LDC2016V01.mp4">video</a> and <a href="desc/addenda/LDC2016V01.txt">transcript</a> samples.</p><br> <h3>Updates</h3><br> <p>None at this time.</p></br> Portions © 2011-2016 YouTube, LLC, © 2011-2016 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作