five

TV Series Corpus

收藏
DataCite Commons2025-05-01 更新2024-07-25 收录
下载链接:
https://figshare.com/articles/dataset/TV_Series_Corpus/3471839/3
下载链接
链接失效反馈
官方服务:
资源简介:
<b>Corpus of three TV Series</b> with <b>manual</b> and <b>automatic</b> annotations:<br><br>- <i>Breaking Bad</i>: S01--S03 (file 'bb.json')<br>- <i>Game of Thrones</i>: S01--05 (file 'got.json')<br>- <i>House of Cards</i>: S01--S02 (file 'hoc.json')<br><br>All three files are in .json format and contain TV Series <b>projects</b>.<br><br>A <b>project</b> is specified by a <b>name</b> and is related to a TV Series defined by its <b>name</b>,<br><br>A TV Series contains <b>seasons</b>, defined by their <b>number</b>.<br><br>Every season is made of <b>episodes</b>, defined by their <b>name</b> and <b>number</b>, along wih the <b>name</b> of the associated video file, its <b>frame rate</b> and size (<b>width</b> x <b>height</b>).<br><br>Each episode contains both <b>scenes</b> an <b>subtitles</b>.<br><br>Scenes are made of <b>shots.<br><br></b>A shot is defined by<b>:<br><br></b>-<b> Cam</b>era position labels, both manual and automatic.<br>- Starting and <b>end</b>ing <b>pos</b>itions.<br>- <b>Music</b>ality rate measured every second and ranging from 0 to 1.<br>- Coordinates of rectangular boxes surrounding possible <b>faces</b>. Face detection is performed on a sample of 5 frames uniformly distributed over the shot. The <b>pos</b>ition of each of these frames is mentioned.<br><br>The subtitles are defined by their:<br><br>- <b>Text</b>ual content.<br>- <b>Sp</b>ea<b>k</b>ing character.<br>- Possible<b> inter</b>locutor(s): manually annotated; automatically annotated from the subtitle sequence, automatically annotated from the characters co-appearing in the current scene.<br><br>All timestamps are expressed in milliseconds and are valid for the video files extracted from the commercial DVDs, with recaps included at the beginning of the <i>House of Cards</i> episodes.<br><br>The .pdf file details the <b>source of annotation</b> (manual/automatic) for each of the tasks considered. <b> <br> </b><br>
提供机构:
figshare
创建时间:
2016-07-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作