five

FORM1 Kinematic Gesture

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2004V01
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3> <p> FORM1 Kinematic Gesture was produced by Linguistic Data Consortium (LDC) catalog number LDC2004V01 and ISBN 1-58563-299-6. </p><p> FORM is a gesture annotation scheme designed to capture the kinematic information in gesture from videos of speakers. This publication is a detailed database of gesture-annotated videos stored in the Anvil and FORM file formats. FORM encodes the "phonetics" of gesture by giving geometric descriptions of location and movement of the right and left arms. Other kinematic information such as effort and shape are also recorded. </p><p> FORM gesture data has applications in statistical natural language processsing, gesture recognition and generation, information extraction from video, and human-computer interaction. </p><p> Please go to the <a href="http://www.ldc.upenn.edu/Projects/FORM" rel="nofollow"> FORM website</a> for more information. The <a href="http://catalog.ldc.upenn.edu/LDC2003V01" rel="nofollow">FORM2</a> publication was released in 2003 by the LDC and encoded much of the same data provided here using a more recent tag set. </p><h3>Data</h3> <p> This publication contains gesture annotations created using the FORM 1.0 tag set. The Anvil annotation files used in their creation are also included, as are 29.5 minutes of the original audio and video recordings excerpted from a lecture given by Brian MacWhinney on January 24, 2000 at Carnegie Mellon University. A second data set, with 5.5 minutes of Paul Howard telling a story in conversation while being motion captured, is also supplied. These video recordings were chosen because they are part of the NSF-funded <a href="http://talkbank.org/" rel="nofollow">TalkBank</a> project. </p><p> There are a total of 69 data files: 21 movie (.mov) files, 24 Anvil (.anvil) files, and 24 FORM (.form1) files. </p><p> The movie files are in <a href="http://www.apple.com/quicktime/download/" rel="nofollow">Quicktime</a> format with the following specs: </p><table> <tr> <td colspan="40%">Size</td> <td colspan="60%">360 x 240 pixels</td> </tr> <tr> <td colspan="40%">Compression</td> <td colspan="60%">H.261</td> </tr> <tr> <td colspan="40%">Video rate</td> <td colspan="60%">29.97 fps</td> </tr> <tr> <td colspan="40%">Audio rate</td> <td colspan="60%">48 kHz</td> </tr> <tr> <td colspan="40%">Audio format</td> <td colspan="60%">8-bit/16-bit stereo</td> </tr> </table><p> Anvil files can be opened using the <a href="http://www.dfki.de/~kipp/anvil" rel="nofollow">Anvil</a> video annotation tool, which is freely available from Michael Kipp. The .form file format is an intermediate data format that contains only the FORM2 values from each .anvil in a comma-delimited, frame-by-frame listing of the following form: frame,upper_arm_lift,forearm_orientation,handshape,wrist_up_down,wrist_side_side,effort,tension </p><h3>Sponsorship</h3> <p> This research was conducted using funding from the following grant sources: ISLE - 9910603 NSF: TalkBank (via subcontract from Carnegie Mellon University) - BCS-998009 and BCS-9978056 NSF: Discourse and Gesture w/ Joshi, Liberman, and Martell - EIA98-09209 </p><h3>Updates</h3> <p>There are no updates available at this time. </p> </br> Portions © 2004 Trustees of the University of Pennsylvania, © 2000 Brian MacWhinney
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作