A demonstration of unsupervised machine learning in species delimitation
收藏DataONE2019-09-23 更新2025-07-19 收录
下载链接:
https://search.dataone.org/view/sha256:901324afe6fa58d9a3c7a5cdce1deaa8ff608ec90465e1a5b2524a33d1530bbc
下载链接
链接失效反馈官方服务:
资源简介:
One major challenge to delimiting species with genetic data is successfully differentiating population structure from species-level divergence, an issue exacerbated in taxa inhabiting naturally fragmented habitats. Many fields of science are now using machine learning, and in evolutionary biology supervised machine learning has recently been used to infer species boundaries. These supervised methods require training data with associated labels. Conversely, unsupervised machine learning (UML) uses inherent data structure and does not require user-specified training labels, potentially providing more objectivity in species delimitation. Here we demonstrate the utility of three UML approaches (random forests, variational autoencoders, t-distributed stochastic neighbor embedding) for species delimitation in an arachnid taxon with high population genetic structure (Opiliones, Laniatores, Metanonychus). We find that UML approaches successfully cluster samples according to species-level diverg...
创建时间:
2025-06-28



