Source code and data for the PhD Thesis "Linguistically-Inspired Neural Coherence Modeling"

Name: Source code and data for the PhD Thesis "Linguistically-Inspired Neural Coherence Modeling"
Creator: heiDATA
Published: 2025-09-03 09:44:47
License: 暂无描述

DataCite Commons2025-09-03 更新2026-05-07 收录

下载链接：

https://heidata.uni-heidelberg.de/citation?persistentId=doi:10.11588/DATA/ZBNUCG

下载链接

链接失效反馈

官方服务：

资源简介：

This dataset contains source code and data used in the PhD thesis "Linguistically-Inspired Neural Coherence Modeling". The dataset is split into five repositories: <ul> <li> StruSim: Source code to run experiments for Chapter 4 "Document Structure Similarity-Enhanced Coherence Modeling". </li> <li> ConnRel: Source code to run experiments for Chapter 5 "Annotation-inspired Implicit Discourse Relation Classification". </li> <li> Exp2Imp: Source code to run experiments for Chapter 6 "Explicit to Implicit Discourse Relation Classification". </li> <li> RelCoh: Source code to run experiments for Chapter 7 "Discourse Relation-Enhanced Coherence Modeling". </li> <li> EntyRelCoh: Source code to run experiments for Chapter 8 "Coherence Modeling Using Entities and Discourse Relations". </li> </ul> The data used in the experiments can be downloaded from Linguistic Data Consortium (https://www.ldc.upenn.edu/): <ul> <li> PDTB 2.0: https://catalog.ldc.upenn.edu/LDC2008T05 </li> <li> PDTB 3.0: https://catalog.ldc.upenn.edu/LDC2019T05 </li> <li> TOEFL Dataset: https://catalog.ldc.upenn.edu/LDC2014T06 </li> <li> GCDC: https://github.com/aylai/GCDC-corpus </li> <li> CoheSentia: https://github.com/AviyaMn/CoheSentia </li> </ul>

提供机构：

heiDATA

创建时间：

2025-08-27

5,000+

优质数据集

54 个

任务类型

进入经典数据集