DC电路领域特定语料库
收藏arXiv2012-04-28 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/1204.6364v1
下载链接
链接失效反馈官方服务:
资源简介:
本研究开发了一个特定于DC电路领域的语料库,该语料库由141个网络资源中的文本手动构建而成,包含1029个句子和18,834个单词。该语料库的创建旨在评估一个特定领域的文本到知识映射原型,通过分析语料库中的词汇和语法结构,以及手动开发的概念结构,来调整原型的词汇资源和知识模型。此外,该语料库还用于进行修辞分析,以证明其在传达语义方面的代表性。最终,通过主题和话语分析,评估了原型在话语覆盖方面的表现。
This study developed a corpus specialized in the domain of DC circuits. This corpus was manually constructed from texts collected from 141 web resources, containing 1,029 sentences and 18,834 words. The corpus was created to evaluate a domain-specific text-to-knowledge mapping prototype, with its lexical resources and knowledge model adjusted through analysis of the corpus’s lexical and grammatical structures as well as the manually developed conceptual framework. Furthermore, this corpus was employed for rhetorical analysis to verify its representativeness in semantic communication. Finally, the prototype’s performance in discourse coverage was evaluated via thematic and discourse analysis.
提供机构:
计算机科学与工程系,库尔纳工程技术大学(KUET),孟加拉国
创建时间:
2012-04-28



