five

SenSem Lexicons

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2015L01
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>SenSem (Sentence Semantics) Lexicons was developed by <a href="http://grial.uab.es/index.php">GRIAL</a>, the Linguistic Applications Inter-University Research Group that includes the following Spanish institutions: the <a href="http://www.uab.es/web/universitat-autonoma-de-barcelona-1345467954774.html">Universitat Autonoma de Barcelona</a>, the <a href="http://www.ub.edu/web/ub/en/index.html">Universitat de Barcelona</a>, the <a href="http://www.udl.es/en.html">Universitat de Lleida</a> and <a href="http://www.uoc.edu/portal/en/index.html">the Universitat Oberta de Catalunya</a>. It contains feature descriptions for approximately 1,300 Spanish verbs and 1,300 Catalan verbs in the SenSem Databank (<a href="../../../LDC2015T02">LDC2015T02</a>). GRIAL's work focuses on resources for applied linguistics, including lexicography, translation and natural language processing.</p><br> <h3>Data</h3><br> <p>The verb features for each language consist of two groups: those codified manually, including definition, <a href="https://wordnet.princeton.edu/">WordNet</a> synset, <a href="http://www.glottopedia.org/index.php/Aktionsart">Aktionsart</a>, arguments and semantic functions; and those extracted automatically from the SenSem Databank. Among the latter are verb frequency, semantic construction, syntactic categories and constituent order. The verbs analyzed correspond to the 250 most frequent verbs in Spanish and 320 lemmas in Catalan. Further information about the SenSem project can be obtained from the GRIAL website at <a href="http://grial.uab.es/sensem/corpus">http://grial.uab.es/sensem/corpus</a>.</p><br> <p>Data is presented in a single XML file per language.</p><br> <h3>Samples</h3><br> <p>Please view this <a href="desc/addenda/LDC2015L01.jpg">sample</a>.</p><br> <h3>Updates</h3><br> <p>None at this time.</p></br> Portions © 2015 Dr. Ana Fernandez Montraveta, Dr. Gloria Vázquez-Garcia, Trustees of the University of Pennsylvania

<h3>引言</h3><br><p>SenSem(句子语义)词库由<a href="http://grial.uab.es/index.php">GRIAL</a>开发,该跨校语言应用研究团队包含以下西班牙高校:<a href="http://www.uab.es/web/universitat-autonoma-de-barcelona-1345467954774.html">巴塞罗那自治大学(Universitat Autonoma de Barcelona)</a>、<a href="http://www.ub.edu/web/ub/en/index.html">巴塞罗那大学(Universitat de Barcelona)</a>、<a href="http://www.udl.es/en.html">莱里达大学(Universitat de Lleida)</a>以及<a href="http://www.uoc.edu/portal/en/index.html">加泰罗尼亚开放大学(Universitat Oberta de Catalunya)</a>。SenSem数据库(<a href="../../../LDC2015T02">LDC2015T02</a>)中收录了约1300个西班牙语动词与1300个加泰罗尼亚语动词的特征描述。GRIAL的研究聚焦于应用语言学相关资源,涵盖词典编纂、翻译与自然语言处理等领域。</p><br><h3>数据</h3><br><p>每种语言的动词特征分为两类:一类为人工编码特征,包含释义、<a href="https://wordnet.princeton.edu/">WordNet</a>同义词集、<a href="http://www.glottopedia.org/index.php/Aktionsart">动作方式(Aktionsart)</a>、论元与语义功能;另一类为从SenSem数据库中自动提取的特征,后者包含动词词频、语义结构、句法范畴与成分语序。本次分析的动词覆盖西班牙语最常用的250个动词,以及加泰罗尼亚语的320个词元。如需了解SenSem项目的更多信息,可访问GRIAL官网:<a href="http://grial.uab.es/sensem/corpus">http://grial.uab.es/sensem/corpus</a>。</p><br><p>数据按语言分别存储为单个XML文件。</p><br><h3>示例</h3><br><p>请查看该<a href="desc/addenda/LDC2015L01.jpg">示例文件</a>。</p><br><h3>更新记录</h3><br><p>暂无更新记录。</p><br>部分内容 © 2015 安娜·费尔南德斯·蒙特拉韦塔博士、格洛丽亚·巴斯克斯-加西亚博士、宾夕法尼亚大学托管方
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作