five

Data from: Whole-genome analysis of giraffe supports four distinct species

收藏
Mendeley Data2024-06-25 更新2024-06-27 收录
下载链接:
https://zenodo.org/records/4781356
下载链接
链接失效反馈
官方服务:
资源简介:
The data deposited here was generated by and reported in Coimbra et al. (2021). Annotation of the Kordofan giraffe genome assembly: SGN_Gcam_PLA01_asm_pseudohap.final.gtf.gz: annotation of protein-coding genes generated with BRAKER using protein sequences from Bos taurus (GCA_002263795.2) as extrinsic evidence. SGN_Gcam_PLA01_asm_pseudohap.final.cds.all.fa.gz: coding sequences of the annotated genes (including alternative transcripts). SGN_Gcam_PLA01_asm_pseudohap.final.pep.all.fa.gz: peptide sequences of the annotated genes (including alternative transcripts). SGN_Gcam_PLA01_asm_pseudohap.final.repeatmasker.out.gz: annotation of repetitive elements generated with RepeatMasker and RepeatModeler. Nuclear phylogenomic inference: genomefragments.tar.gz: main dataset of 1,068 genome fragment (GF) alignments, each 450 kbp in length, with 43 giraffe (Giraffa spp.) and an okapi (Okapia johnstoni). The genome consensus sequences and subsequent processed genome fragment alignments were generated from BAM files following the method described in the article. estimated_gene_trees.tree: list containing the 1,068 maximum likelihood GF trees inferred with IQ-TREE. estimated_species_trees.tree: multispecies coalescent tree inferred by ASTRAL from the 1,068 maximum likelihood GF trees listed in 'estimated_gene_trees.tree'. annotation.txt: annotation file used in the analysis of quartet frequencies with DiscoVista. Phylogeny of mitochondrial genomes: mtdna_alignments.tar: dataset containing alignments of the 13 mitochondrial protein coding genes, with 50 giraffe and an okapi, a nexus file specifying partitions, and the resulting maximum likelihood tree iferred in IQ-TREE. Demographic reconstruction: psmc_files.tar: output files of the PSMC analysis of 21 giraffe. psmc_boot_files.tar.gz: output files of the PSMC analysis with 100 bootstrap replicates. Heterozygosity: foldedSFS.tar: dataset containing the per-sample bootstrapped folded site frequency spectrum (SFS) of 50 giraffe individuals estimated with ANGSD and its subprogram realSFS. ROH and Inbreeding: giraffeROH.tar: CSV files extracted from RZooROH results: roh_outputs_giraffe.csv: realized inbreeding coefficient (FROH) per homozygosity-by-descent (HBD) class per individual. giaffe_roh_sgmt.csv: number and accumulated length of ROH per individual.
创建时间:
2023-06-28
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作