Lists of Words Corpus (UHLCS)
收藏Mendeley Data2024-01-31 更新2024-06-27 收录
下载链接:
https://etsin.fairdata.fi/dataset/c52669cb-f054-477d-b79a-1b9086cde205
下载链接
链接失效反馈官方服务:
资源简介:
The corpus is available in Kielipankki - the Language Bank of Finland (puhti.csc.fi, access rights instructions: http://www.kielipankki.fi/access). Location: /appl/data/kielipankki/words (only Finnish available) The lists of words located at the University of Helsinki Language Corpus Server were generated from the corpora of the following languages: * Dutch: 178,430 words, 1,998,881 characters * Finnish: proper names: 714 words, 4,488 characters; general list of words: 264,654 words, 3,171,148 characters * French: 138,257 words, 1,524,757 characters * German: 160,086 words, 2,060,734 characters * Italian: 60,453 words, 561,982 characters * Norwegian: 61,843 words, 589,234 characters * Swedish: 13,328 words, 117,685 characters Type of the documents: words in alphabetic order. Character encoding: ASCII. The lists of words were compiled at the University of Helsinki, Department of General Linguistics. The Lists of Words Corpus is a part of the UHLCS corpus collection. UHLCS has many different IPR holders. Should you have any questions regarding the collection, please contact Pirkko Suihkonen (suihkonen.pirkko@gmail.com). License details: http://urn.fi/urn:nbn:fi:lb-2015041002 Detailed information: http://www.ling.helsinki.fi/uhlcs/readme-all/README-lexical-data-bases.html http://urn.fi/urn:nbn:fi:lb-201406041 The purpose of the resource use must be outlined in a research plan.
创建时间:
2024-01-31



