MEDIA Dataset

Name: MEDIA Dataset
Creator: Papers with Code
License: 暂无描述

paperswithcode.com2025-03-25 收录

下载链接：

https://paperswithcode.com/dataset/media

下载链接

链接失效反馈

官方服务：

资源简介：

The MEDIA French corpus is dedicated to semantic extraction from speech in a context of human/machine dialogues. The corpus has manual transcription and conceptual annotation of dialogues from 250 speakers. It is split into the following three parts : (1) the training set (720 dialogues, 12K sentences), (2) the development set (79 dialogues, 1.3K sentences, and (3) the test set (200 dialogues, 3K sentences).

《MEDIA 法语语料库》专注于人机对话语境下的语义提取。该语料库包含250名发言人的对话的手动转录和概念性标注。语料库分为以下三个部分：（1）训练集（720个对话，12,000个句子），（2）开发集（79个对话，1,300个句子），以及（3）测试集（200个对话，3,000个句子）。

提供机构：

Papers with Code

5,000+

优质数据集

54 个

任务类型

进入经典数据集