five

CCGBank: CCG Combinatory Categorical Grammar for Penn Treebank 2 - LDC2005T13

收藏
academictorrents.com2025-03-22 收录
下载链接:
https://academictorrents.com/details/0c11a1615ffb5b632d2f886fcd344fa7c43f5968
下载链接
链接失效反馈
官方服务:
资源简介:
# CCGbank * Item Name:CCGbank * Author(s):Julia Hockenmaier, Mark Steedman * LDC Catalog No.:LDC2005T13 * ISBN:1-58563-340-2 * ISLRN:181-921-208-336-7 * DOI: * Release Date:May 15, 2005 * Member Year(s):2005 * DCMI Type(s):Text * Data Source(s):newswire * Project(s):GALE, TIDES * Application(s):automatic content extraction, cross-lingual information retrieval, information detection, natural language processing * Language(s):English * Language ID(s):eng * Citation:Hockenmaier, Julia, and Mark Steedman. CCGbank LDC2005T13. Web Download. Philadelphia: Linguistic Data Consortium, 2005. ## Introduction CCGbank was developed by the University of Edinburgh and contains approximately 49,000 sentences of English text formatted in Combinatory Categorial Grammar (CCG) derivations. The sentences used for this corpus are from [Treebank-2 (LDC95T7)]() and represent 99.44% of the entire treebank. For the remaining 2

CCGbank数据集由爱丁堡大学开发,该数据集包含约49,000个英语句子,这些句子按照组合范畴语法(Combinatory Categorial Grammar,简称CCG)的推导格式进行编排。该语料库中使用的句子来源于[Treebank-2 (LDC95T7)](),并代表了整个语料库的99.44%。剩余的2
提供机构:
academictorrents.com
搜集汇总
背景与挑战
背景概述
CCGBank是一个基于组合范畴语法(CCG)的英语数据集,包含约49,000句从Treebank-2提取的文本,覆盖原树库的99.44%。该资源发布于2005年,主要用于自然语言处理、信息检索和内容提取等研究。
以上内容由遇见数据集搜集并总结生成
二维码
社区交流群
二维码
科研交流群
商业服务