TV Series Corpus
收藏Figshare2020-02-17 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/TV_Series_Corpus/3471839/4
下载链接
链接失效反馈官方服务:
资源简介:
<b>Corpus of three TV Series</b> with <b>manual</b> and <b>automatic</b> annotations:<br><br>- <i>Breaking Bad</i>: S01--S03 (file 'bb.json')<br>- <i>Game of Thrones</i>: S01--05 (file 'got.json')<br>- <i>House of Cards</i>: S01--S02 (file 'hoc.json')<br><br>All three files are in .json format and contain TV Series <b>projects</b>.<br><br>A <b>project</b> is specified by a <b>name</b> and is related to a TV Series defined by its <b>name</b>,<br><br>A TV Series contains <b>seasons</b>, defined by their <b>number</b>.<br><br>Every season is made of <b>episodes</b>, defined by their <b>name</b> and <b>number</b>, along wih the <b>name</b> of the associated video file, its <b>frame rate</b> and size (<b>width</b> x <b>height</b>).<br><br>Each episode contains both <b>scenes</b> an <b>subtitles</b>.<br><br>Scenes are made of <b>shots.<br><br></b>A shot is defined by<b>:<br><br></b>-<b> Cam</b>era position labels, both manual and automatic.<br>- Starting and <b>end</b>ing <b>pos</b>itions.<br>- <b>Music</b>ality rate measured every second and ranging from 0 to 1.<br>- Coordinates of rectangular boxes surrounding possible <b>faces</b>. Face detection is performed on a sample of 5 frames uniformly distributed over the shot. The <b>pos</b>ition of each of these frames is mentioned.<br><br>The subtitles are defined by their:<br><br>- <b>Text</b>ual content (not reproduced here for copyright reasons).<br>- <b>Sp</b>ea<b>k</b>ing character.<br>- Possible<b> inter</b>locutor(s): manually annotated; automatically annotated from the subtitle sequence, automatically annotated from the characters co-appearing in the current scene.<br><br>All timestamps are expressed in milliseconds and are valid for the video files extracted from the commercial DVDs, with recaps included at the beginning of the <i>House of Cards</i> episodes. <b><br> </b><br>
创建时间:
2019-01-10



