five

EVALution

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2020T06
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>EVALution was developed by <a href="https://www.polyu.edu.hk/web/en/home/index.html">The Hong Kong Polytechnic University</a>. It is comprised of English and Mandarin Chinese data sets -- EVALution 1.0 and EVALution-Man, respectively -- that contain semantic relations and metadata for training and evaluating distributional semantic models.</p><br> <h3>Data</h3><br> <p>EVALution 1.0 consists of approximately 7500 English tuples extracted from <a href="http://conceptnet.io/">ConceptNet 5.0</a> and <a href="https://wordnet.princeton.edu/">WordNet</a> and filtered through automatic methods and crowd-sourcing. Several semantic relations between word pairs were instantiated, including hypernymy, synonymy, antonymy and meronymy. The corpus also includes additional information that can be used to filter the pairs or to analyze the results, such as relation domain, word frequency, word part-of-speech and word semantic field.</p><br> <p>EVALution-MAN consists of Chinese word pairs from two sources: <a href="http://cwn.ling.sinica.edu.tw/">Chinese Wordnet</a> and humans who completed an elicitation task by supplying missing words to sentences. The human-supplied sentence word pairs were then judged by human raters for reliability.</p><br> <p>All text data is presented as UTF-8 encoded tab separated plain text.</p><br> <h3>Samples</h3><br> <p>Please view this <a href="desc/addenda/LDC2020T06.eng.txt">EVALutaion 1.0 sample</a> and <a href="desc/addenda/LDC2020T06.cmn.txt">EVALution-MAN sample</a>.</p><br> <h3>Updates</h3><br> <p>None at this time.</p></br> Portions © 2020 The Hong Kong Polytechnic University, © 2020 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作