DeSSE
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/serenayj/DeSSE
下载链接
链接失效反馈官方服务:
资源简介:
该数据集来源于一个本科社会科学班级,学生们在此课程中撰写关于种族关系的论文,旨在分析学生写作并就句子组织提供反馈。标注过程中,涉及识别复杂句子中的分割点,并将其改写为语义完整的简单句子。该数据集的规模为:训练集包含12,000个示例,测试集包含790个示例。所涉及的任务是将复杂句子分解为简单句子。
This dataset is sourced from an undergraduate social science course, where students completed papers focusing on racial relations. The core goal of this dataset is to analyze student writings and provide feedback on sentence organization. During the annotation process, annotators are required to identify split points within complex sentences and rewrite them into semantically complete simple sentences. The dataset has 12,000 training instances and 790 test instances. The task involved in this dataset is to decompose complex sentences into simple sentences.



