Corpus CLIPS_MT_MANUAL
收藏SSH Open MarketPlace2023-10-13 更新2024-08-03 收录
下载链接:
https://marketplace.sshopencloud.eu/dataset/lTshYO
下载链接
链接失效反馈官方服务:
资源简介:
This is a sub-corpus of the original Italian CLIPS corpus (Corpora e Lessici dell'Italiano Parlato e Scritto) that is manually annotated and covers only 15 maptask dialogues recorded in 15 locations by local speaker pairs. this corpus contains 3228 inspected and partially repaired WAV signal files, each containing one dialogue turn (*.wav), 3228 corrected original CLIPS annotation files (*.acs, *.phn, *.std, *.wrd), 3228 BAS Partitur files containing the annotation tiers ORT, KAN and SAP (*.par), 3228 EMU database annotation files (*.vot, *.hlb) covering 30 maptask dialogues performed by 30 speakers (each speaker pair performing two different map tasks) recorded in 15 different locations in Italy in 2000-2004.
本数据集为原始意大利语CLIPS语料库(Corpora e Lessici dell'Italiano Parlato e Scritto)的子语料库,经人工标注,仅涵盖由本地说话人对在15个地点录制的15段地图任务(Maptask)对话。本语料库包含3228条经检查与部分修复的WAV音频文件,每条对应一段对话轮次(*.wav);3228份经修正的原始CLIPS标注文件(*.acs、*.phn、*.std、*.wrd);3228份包含ORT、KAN与SAP标注层的BAS Partitur标注文件(*.par);以及3228份EMU数据库标注文件(*.vot、*.hlb)。该语料库涵盖2000年至2004年间在意大利15个不同地点录制的30段地图任务对话,由30名说话人完成,每对说话人需完成两段不同的地图任务对话。
创建时间:
2023-10-13



