five

Recommendations for population and individual diagnostic SNP selection in non-model species

收藏
DataONE2024-12-21 更新2025-04-26 收录
下载链接:
https://search.dataone.org/view/sha256:8e8dea4bfd5e39f69730e437371d09a28f928a512cd42d836b45ce22f187f734
下载链接
链接失效反馈
官方服务:
资源简介:
Despite substantial reductions in the cost of sequencing over the last decade, genetic panels remain relevant due to their cost-effectiveness and flexibility across a variety of sample types. In particular, single nucleotide polymorphism (SNP) panels are increasingly favored for conservation applications. SNP panels are often used because of their adaptability, effectiveness with low-quality samples, and cost-efficiency for use in population monitoring and forensics. However, the selection of diagnostic SNPs for population assignment and individual identification can be challenging. The consequences of poor SNP selection are under-powered panels, inaccurate results, and monetary loss. Here, we develop a novel user-friendly SNP selection pipeline for population assignment and individual identification, mPCRselect. mPCRselect allows any researcher, who has sufficient SNP-level data, to design a successful and cost-effective SNP panel for species of conservation concern., 13 tigers from each of the Amur and Bengal subspecies, and 13 Generic (N = 39 individuals total from Armstrong et al., 2024) were selected for this dataset. Unrelated individuals were previously identified in Armstrong et al., 2024. Then, we created a subset of 10,000 markers randomly sampled from across the genome using the remaining markers in linkage equilibrium. This dataset is provided as the test data for the mPCRselect pipeline (https://github.com/ellieearmstrong/mPCRselect). References: Armstrong et al. 2024, DOI: 10.1073/pnas.2402924121 , , # Data from: Recommendations for Population and Individual Diagnostic SNP Selection in Non-Model Species [https://doi.org/10.5061/dryad.0k6djhb96](https://doi.org/10.5061/dryad.0k6djhb96) ## Description of the data and file structure Tiger data from Armstrong et al. 2024 and Armstrong et al. 2021. File contains tigers from five tiger subspecies and generic (admixed) tigers which are unrelated. Data has been filtered down to two chromosomes (B1 and F2) and is filtered as described in manuscript. Test dataset for mPCRselect pipeline. ### Files and variables #### File: tiger\_pop.csv **Description:** Information corresponding to individuals contained in the VCF and their population assignment. ##### Variables * Sample: Tiger samples correspond to individuals from Armstrong et al. 2024 (see Supplementary Table 1) * Population: One of five tiger subspecies (Amur, Bengal, Indochinese, Malayan, Sumatran) or admixed population (Generic) #### File: tiger\_chromosomes.txt **Description:...
创建时间:
2024-12-21
二维码
社区交流群
二维码
科研交流群
商业服务