five

Lexicostatistical data (raw and derived text files) on 200 basic words in each of 95 Indoeuropean languages as collected/collated by Professor Isidore Dyen circa 1960

收藏
Research Data Australia2024-12-14 收录
下载链接:
https://researchdata.edu.au/lexicostatistical-raw-derived-circa-1960/249207
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset formed the basis of the 1992 seminal work 'An Indoeuropean Classification: A Lexicostatistical Experiment' by Isidore Dyen, Joseph B Kruskal and Paul Black. The publication tested lexicostatistical methods against what was already known about Indoeuropean languages using more traditional methods. The dataset comprises three descriptive documents and six raw and derived data text files. The three descriptive documents provide background and publication information on the dataset. The six text files contain all the data as collected and collated by Isidore Dyen circa 1960. One file contains the data that was placed on punched cards circa 1970, and transferred to disc circa 1990. It gives cognation data among 95 Indoeuropean speech varieties. For each meaning in the list of 200 basic meanings the file contains the forms used in the 95 speech varieties collected by Isidore Dyen and the cognation decisions among these forms made by Dyen circa 1970. Other files contain the statistical matrices produced from the raw data to determine similarities/differences between languages in order to create a tree like structure of evolution of Indoeuropean languages. Virtual copy of the data available at wordgumbo http://www.wordgumbo.com/ie/cmp
提供机构:
Charles Darwin University
二维码
社区交流群
二维码
科研交流群
商业服务