five

Source code and data for the PhD Thesis "Linguistically-Inspired Neural Coherence Modeling"

收藏
DataCite Commons2025-09-03 更新2026-05-07 收录
下载链接:
https://heidata.uni-heidelberg.de/citation?persistentId=doi:10.11588/DATA/ZBNUCG
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains source code and data used in the PhD thesis "Linguistically-Inspired Neural Coherence Modeling". The dataset is split into five repositories: <ul> <li> StruSim: Source code to run experiments for Chapter 4 "Document Structure Similarity-Enhanced Coherence Modeling". </li> <li> ConnRel: Source code to run experiments for Chapter 5 "Annotation-inspired Implicit Discourse Relation Classification". </li> <li> Exp2Imp: Source code to run experiments for Chapter 6 "Explicit to Implicit Discourse Relation Classification". </li> <li> RelCoh: Source code to run experiments for Chapter 7 "Discourse Relation-Enhanced Coherence Modeling". </li> <li> EntyRelCoh: Source code to run experiments for Chapter 8 "Coherence Modeling Using Entities and Discourse Relations". </li> </ul> The data used in the experiments can be downloaded from Linguistic Data Consortium (https://www.ldc.upenn.edu/): <ul> <li> PDTB 2.0: https://catalog.ldc.upenn.edu/LDC2008T05 </li> <li> PDTB 3.0: https://catalog.ldc.upenn.edu/LDC2019T05 </li> <li> TOEFL Dataset: https://catalog.ldc.upenn.edu/LDC2014T06 </li> <li> GCDC: https://github.com/aylai/GCDC-corpus </li> <li> CoheSentia: https://github.com/AviyaMn/CoheSentia </li> </ul>
提供机构:
heiDATA
创建时间:
2025-08-27
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作