Source code and data for the PhD Thesis "Linguistically-Inspired Neural Coherence Modeling"
收藏DataCite Commons2025-09-03 更新2026-05-07 收录
下载链接:
https://heidata.uni-heidelberg.de/citation?persistentId=doi:10.11588/DATA/ZBNUCG
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains source code and data used in the PhD thesis "Linguistically-Inspired Neural Coherence Modeling". The dataset is split into five repositories:
<ul>
<li>
StruSim: Source code to run experiments for Chapter 4 "Document Structure Similarity-Enhanced Coherence Modeling".
</li>
<li>
ConnRel: Source code to run experiments for Chapter 5 "Annotation-inspired Implicit Discourse Relation Classification".
</li>
<li>
Exp2Imp: Source code to run experiments for Chapter 6 "Explicit to Implicit Discourse Relation Classification".
</li>
<li>
RelCoh: Source code to run experiments for Chapter 7 "Discourse Relation-Enhanced Coherence Modeling".
</li>
<li>
EntyRelCoh: Source code to run experiments for Chapter 8 "Coherence Modeling Using Entities and Discourse Relations".
</li>
</ul>
The data used in the experiments can be downloaded from Linguistic Data Consortium (https://www.ldc.upenn.edu/):
<ul>
<li>
PDTB 2.0: https://catalog.ldc.upenn.edu/LDC2008T05
</li>
<li>
PDTB 3.0: https://catalog.ldc.upenn.edu/LDC2019T05
</li>
<li>
TOEFL Dataset: https://catalog.ldc.upenn.edu/LDC2014T06
</li>
<li>
GCDC: https://github.com/aylai/GCDC-corpus
</li>
<li>
CoheSentia: https://github.com/AviyaMn/CoheSentia
</li>
</ul>
提供机构:
heiDATA
创建时间:
2025-08-27



