FrancophonIA/sharedtask2021
收藏Hugging Face2025-03-30 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/FrancophonIA/sharedtask2021
下载链接
链接失效反馈官方服务:
资源简介:
DISRPT 2021共享任务数据集是一个多语种的数据集,包含德语、英语、波斯语、法语、巴斯克语、葡萄牙语、荷兰语、俄语、西班牙语、中文和土耳其语等多种语言的数据。该数据集用于话语单元划分、连接词检测和话语关系分类任务,提供了RST、SDRT和PDTB三种格式的训练、开发和测试数据集。数据集包含有语法标注和无语法标注的版本,以及用于比较的自动解析数据。
The DISRPT 2021 shared task dataset is a multilingual dataset that includes data in German, English, Persian, French, Basque, Portuguese, Dutch, Russian, Spanish, Chinese, and Turkish. The dataset is used for discourse unit segmentation, connective detection, and discourse relation classification tasks, and provides training, development, and test datasets in the RST, SDRT, and PDTB formats. The dataset includes versions with and without syntactic annotations, as well as automatically parsed data for comparison.
提供机构:
FrancophonIA



