TV Series Corpus
收藏DataCite Commons2020-09-04 更新2024-07-25 收录
下载链接:
https://figshare.com/articles/dataset/TV_Series_Corpus/3471839/1
下载链接
链接失效反馈官方服务:
资源简介:
<b>Corpus of three TV Series</b> with <b>manual</b> and <b>automatic</b> annotations:<br><br>- <i>Breaking Bad</i>: S01--S03 (file 'bb.json')<br>- <i>Game of Thrones</i>: S01--05 (file 'got.json')<br>- <i>House of Cards</i>: S01--S02 (file 'hoc.json')<br><br>All three files are in .json format and contain TV Series <b>projects</b>.<br><br>A <b>project</b> is specified by a <b>name</b> and is related to a TV Series defined by its <b>name</b>,<br><br>A TV Series contains <b>seasons</b>, defined by their <b>number</b>.<br><br>Every season is made of <b>episodes</b>, defined by their <b>name</b> and <b>number</b>, along wih the <b>name</b> of the associated video file, its <b>frame rate</b> and size (<b>width</b> x <b>height</b>).<br><br>Each episode contains both <b>scenes</b> an <b>subtitles</b>.<br><br>Scenes are made of <b>shots.<br><br></b>A shot is defined by<b>:<br><br></b>-<b> Cam</b>era position labels, both manual and automatic.<br>- Starting and <b>end</b>ing <b>pos</b>itions.<br>- <b>Music</b>ality rate measured every seconds and ranging from 0 to 1.<br>- Coordinates of rectangular boxes surrounding possible <b>faces</b>. Face detection is performed on a sample of 5 frames uniformly distributed over the shot. The <b>pos</b>ition of each of these frames is mentioned.<br><br>The subtitles are defined by their:<br><br>- <b>Text</b>ual content.<br>- <b>Sp</b>ea<b>k</b>ing character.<br>- Possible<b> inter</b>locutor(s): manually annotated; automatically annotated from the subtitle sequence, automatically annotated from the characters co-appearing in the current scene.<br><br>All positions are in milliseconds.<br><br>The .pdf file details the <b>source of annotation</b> (manual/automatic) for each of the tasks considered. <b> <br> </b><br>
提供机构:
figshare
创建时间:
2016-07-07



