FrancophonIA/sharedtask2019
收藏Hugging Face2025-03-30 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/FrancophonIA/sharedtask2019
下载链接
链接失效反馈官方服务:
资源简介:
DISRPT 2019工作坊推出的跨格式话语单元切分数据集,包含德语、英语、法语、巴斯克语、葡萄牙语、荷兰语、俄语、西班牙语、中文和土耳其语等多种语言。该数据集旨在促进不同话语解析框架下方法的融合,并提供来自RST、SDRT和PDTB三种话语解析格式的训练、开发和测试数据集。数据集针对不同语料库和框架的切分指导原则,鼓励设计灵活的方法处理多样性,并推动话语单元标准的讨论。
The cross-formalism discourse unit segmentation dataset introduced by the DISRPT 2019 workshop, containing multiple languages such as German, English, French, Basque, Portuguese, Dutch, Russian, Spanish, Chinese, and Turkish. The dataset aims to promote the integration of methods under different discourse parsing frameworks and provides training, development, and test datasets from the RST, SDRT, and PDTB discourse parsing formats. It addresses the diverse segmentation guidelines across corpora and frameworks, encouraging the design of flexible methods to handle diversity and promoting the discussion of standards for discourse units.
提供机构:
FrancophonIA



