WN18
收藏Figshare2020-02-19 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/WN18/11869548
下载链接
链接失效反馈官方服务:
资源简介:
This WORDNET TENSOR DATA consists of a collection of triplets (synset, relation_type, triplet) extracted from WordNet 3.0 (http://wordnet.princeton.edu). This data set can be seen as a 3-mode tensor depicting ternary relationships between synsets.The definitions file (wordnet-mlj12-definitions.txt) contains one synset per line with the following format: synset_id (a 8-digit unique identifier) intelligible name (word+POS_tag+sense_index), definition. The previous 3 pieces of information are separated by a tab ('\t').All wordnet-mlj12-*.txt files contain one triplet per line, with 2 synset_ids and relation type identifier in a tab separated format. The first element is the synset_id of the left hand side of the relation triple, the third one is the synset_id of the right hand side and the second element is the name of the type of relations between them.There are 40,943 synsets and 18 relation types among them. The training set contains 141,442 triplets, the validation set 5,000 and the test set 5,000.All triplets are unique and we made sure that all synsets appearing in the validation or test sets were occurring in the training set.The WN18.zip file contains the other files, with more compression than the default "download all".
创建时间:
2020-02-19



