Movie Triples Corpus (MTC)

DataCite Commons2025-01-02 更新2025-04-16 收录

下载链接：

https://service.tib.eu/ldmservice/dataset/afde7ac6-5f5d-4299-a489-6619fa8925bc

下载链接

链接失效反馈

官方服务：

资源简介：

The Movie Triples Corpus (MTC) dataset was derived from the Movie-DiC dataset by Banchs (2012). Although this dataset spans a wide range of topics with few spelling mistakes, its small size of only about 240,000 dialogue triples makes it difficult to train a dialogue model, as pointed out by Serban et al. (2016).

电影三元组语料库（Movie Triples Corpus，MTC）数据集源自Banchs于2012年提出的Movie-DiC数据集。尽管该数据集涵盖主题广泛且拼写错误较少，但正如Serban等人（2016）所指出的，其规模仅约24万条对话三元组（dialogue triples），难以用于训练对话模型。

提供机构：

TIB

创建时间：

2025-01-02

5,000+

优质数据集

54 个

任务类型

进入经典数据集