WN18
收藏DataCite Commons2025-05-01 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/WN18/11869548/1
下载链接
链接失效反馈官方服务:
资源简介:
This WORDNET TENSOR DATA consists of a collection of triplets (synset, relation_type, triplet) extracted from WordNet 3.0 (http://wordnet.princeton.edu). This data set can be seen as a 3-mode tensor depicting ternary relationships between synsets.<br><br>The definitions file (wordnet-mlj12-definitions.txt) contains one synset per line with the following format: synset_id (a 8-digit unique identifier) intelligible name (word+POS_tag+sense_index), definition. The previous 3 pieces of information are separated by a tab ('\t').<br>All wordnet-mlj12-*.txt files contain one triplet per line, with 2 synset_ids and relation type identifier in a tab separated format. The first element is the synset_id of the left hand side of the relation triple, the third one is the synset_id of the right hand side and the second element is the name of the type of relations between them.<br>There are 40,943 synsets and 18 relation types among them. The training set contains 141,442 triplets, the validation set 5,000 and the test set 5,000.<br>All triplets are unique and we made sure that all synsets appearing in the validation or test sets were occurring in the training set.<br>
提供机构:
figshare
创建时间:
2020-02-19



