five

or_stickleback.tar

收藏
DataONE2013-04-01 更新2024-06-27 收录
下载链接:
https://search.dataone.org/view/null
下载链接
链接失效反馈
官方服务:
资源简介:
The contents of or_sticleback.tar.gz relate to the study, "The population structure and recent colonization history of Oregon threespine stickleback determined using RAD-seq" by Catchen, Julian, Bassham, Susan, Wilson, Taylor, Currey, Mark, O'Brien, Conor, Yeates, Quick, and Cresko, William. The raw data used to create these files are available from the Sequence Read Archive (SRA), accession SRA070979. The Stickleback used for this study were collected from both marine and freshwater sites along the Oregon coast, and from freshwater habitats in the Willamette Basin and central Oregon. Extracted genomic DNA was processed into RAD libraries using the restriction enzyme SbfI-HF to digest the genome. After sequencing, reads were aligned to the stickleback reference genome using GSnap and processed with the Stacks pipeline. File Contents ------------- build_samples.sh - this file contains the commands run to clean and demultiplex the data. This transforms the raw data files into cleaned sample files, one per individual stickleback fish, named according to population. build_tags.sh - this file contains the commands run to align demultiplexed reads to the stickleback reference genome and to execute the Stacks pipeline. batch_2.fst_summary.tsv - a summary of the Fst values for each population. batch_2.structure_1000.tsv - 1000, randomly selected loci take from the batch_2.structure.tsv file for analysis in Structure. batch_2.structure.tsv - all variant sites in the Oregon dataset formatted for analysis in Structure. batch_2.sumstats_summary.tsv - summaries of the summary statistics, Pi, observed/expected heterozygosity, Fis, etc. batch_2.sumstats.tsv - summary statistics for each variable site in the set, Pi, observed/expected heterozygosity, Fis, etc. batch_2.vcf - variant sites in the Oregon dataset formatted in VCF (http://www.1000genomes.org/node/101). popmap - the population map file fed to the populations program. Describes which samples belong to which populations. Here is the population key: Crooked River 1 Cushman Slough 2 Pony Creek Reservoir 3 Paulina Lake 4 South Jetty 5 South Twin Lake 6 Winchester Creek 7 Riverbend 8 Millport Slough 9
创建时间:
2013-04-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作