MEDIA Dataset
收藏paperswithcode.com2025-03-25 收录
下载链接:
https://paperswithcode.com/dataset/media
下载链接
链接失效反馈官方服务:
资源简介:
The MEDIA French corpus is dedicated to semantic extraction from speech in a context of human/machine dialogues. The corpus has manual transcription and conceptual annotation of dialogues from 250 speakers. It is split into the following three parts : (1) the training set (720 dialogues, 12K sentences), (2) the development set (79 dialogues, 1.3K sentences, and (3) the test set (200 dialogues, 3K sentences).
《MEDIA 法语语料库》专注于人机对话语境下的语义提取。该语料库包含250名发言人的对话的手动转录和概念性标注。语料库分为以下三个部分:(1)训练集(720个对话,12,000个句子),(2)开发集(79个对话,1,300个句子),以及(3)测试集(200个对话,3,000个句子)。
提供机构:
Papers with Code



