five

WN18

收藏
DataCite Commons2020-08-25 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/WN18/11869548
下载链接
链接失效反馈
官方服务:
资源简介:
This WORDNET TENSOR DATA consists of a collection of triplets (synset, relation_type, triplet) extracted from WordNet 3.0 (http://wordnet.princeton.edu). This data set can be seen as a 3-mode tensor depicting ternary relationships between synsets.<br><br>The definitions file (wordnet-mlj12-definitions.txt) contains one synset per line with the following format: synset_id (a 8-digit unique identifier) intelligible name (word+POS_tag+sense_index), definition. The previous 3 pieces of information are separated by a tab ('\t').<br>All wordnet-mlj12-*.txt files contain one triplet per line, with 2 synset_ids and relation type identifier in a tab separated format. The first element is the synset_id of the left hand side of the relation triple, the third one is the synset_id of the right hand side and the second element is the name of the type of relations between them.<br>There are 40,943 synsets and 18 relation types among them. The training set contains 141,442 triplets, the validation set 5,000 and the test set 5,000.<br>All triplets are unique and we made sure that all synsets appearing in the validation or test sets were occurring in the training set.<br>The WN18.zip file contains the other files, with more compression than the default "download all".<br>
提供机构:
figshare
创建时间:
2020-02-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作