five

Speech Sentiment Annotations

收藏
DataCite Commons2020-07-23 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2020T14
下载链接
链接失效反馈
官方服务:
资源简介:
Introduction Speech Sentiment Annotations was developed by Google Inc. It consists of sentiment labels (positive, negative, neutral) for approximately 49,500 utterances covering 140 hours of audio from Switchboard-1 Release 2 (LDC97S62). Switchboard-1 Release 2 consists of approximately 260 hours of telephone speech from 543 speakers across the United States (302 male speakers, 241 female speakers). A computer-driven telephone collection platform paired two subjects for each conversation and provided a discussion topic. No two speakers conversed together more than once and no one speaker talked more than once on a given topic. Data Switchboard speech files were segmented based on the start and end time of transcript turns. Annotators listened to the audio corresponding to each segment (utterance) and classified each into positive, negative or neutral categories based on the emotion and attitude of the speaker. Annotators provided a justification for positive and negative classifications using a flow chart. Further information about the methodology and annotation process is contained in the documentation accompanying this release. The data is stored as a single UTF-8 encoded tab-delimited file. The annotation column in each row includes judgments from at least three annotators. Samples Please view the following sample. Updates None at this time.
提供机构:
Linguistic Data Consortium
创建时间:
2020-07-07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作