five

linguistics_datasets

收藏
Figshare2022-11-01 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/linguistics_datasets/20791696
下载链接
链接失效反馈
官方服务:
资源简介:
This item includes four linguistic datasets in the form of .xlsx files, which all follow the same tab structure: glottodata contains the linguistic data. Rows are languages identified with a glottocode (www.glottolog.org), columns are linguistic features. structure contains a list of features taken into consideration for a particular analysis, as well as their weight and type, with optionally further attributes. description contains a text description or all variables and their possible values, as well as further info (e.g. what the design was based on, further remarks, etc.) references contains the references to the sources and page number for each datapoint. sample contains the sample of languages taken into consideration for a particular analysis. readme contains contact and citation information lookup defines the types of data The four datasets differ in their structure and sample tabs. ling_nc_all contains the full range of variables, but the samples tab lacks control languages ling_nc_phon contains a selection of variables pertaining to phonology, and the samples tab lacks control languages ling_nc_ms contains a selection of variables pertaining to morphosyntax, and the samples tab lacks control languages ling_c_all contains the full range of variables, and the samples tab includes control languages
创建时间:
2022-11-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作