Data from: Whole-genome analysis of giraffe supports four distinct species
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/4781355
下载链接
链接失效反馈官方服务:
资源简介:
The data deposited here was generated by and reported in Coimbra et al. (2021).
Annotation of the Kordofan giraffe genome assembly:
SGN_Gcam_PLA01_asm_pseudohap.final.gtf.gz: annotation of protein-coding genes generated with BRAKER using protein sequences from Bos taurus (GCA_002263795.2) as extrinsic evidence.
SGN_Gcam_PLA01_asm_pseudohap.final.cds.all.fa.gz: coding sequences of the annotated genes (including alternative transcripts).
SGN_Gcam_PLA01_asm_pseudohap.final.pep.all.fa.gz: peptide sequences of the annotated genes (including alternative transcripts).
SGN_Gcam_PLA01_asm_pseudohap.final.repeatmasker.out.gz: annotation of repetitive elements generated with RepeatMasker and RepeatModeler.
Nuclear phylogenomic inference:
genomefragments.tar.gz: main dataset of 1,068 genome fragment (GF) alignments, each 450 kbp in length, with 43 giraffe (Giraffa spp.) and an okapi (Okapia johnstoni). The genome consensus sequences and subsequent processed genome fragment alignments were generated from BAM files following the method described in the article.
estimated_gene_trees.tree: list containing the 1,068 maximum likelihood GF trees inferred with IQ-TREE.
estimated_species_trees.tree: multispecies coalescent tree inferred by ASTRAL from the 1,068 maximum likelihood GF trees listed in 'estimated_gene_trees.tree'.
annotation.txt: annotation file used in the analysis of quartet frequencies with DiscoVista.
Phylogeny of mitochondrial genomes:
mtdna_alignments.tar: dataset containing alignments of the 13 mitochondrial protein coding genes, with 50 giraffe and an okapi, a nexus file specifying partitions, and the resulting maximum likelihood tree iferred in IQ-TREE.
Demographic reconstruction:
psmc_files.tar: output files of the PSMC analysis of 21 giraffe.
psmc_boot_files.tar.gz: output files of the PSMC analysis with 100 bootstrap replicates.
Heterozygosity:
foldedSFS.tar: dataset containing the per-sample bootstrapped folded site frequency spectrum (SFS) of 50 giraffe individuals estimated with ANGSD and its subprogram realSFS.
ROH and Inbreeding:
giraffeROH.tar: CSV files extracted from RZooROH results:
roh_outputs_giraffe.csv: realized inbreeding coefficient (FROH) per homozygosity-by-descent (HBD) class per individual.
giaffe_roh_sgmt.csv: number and accumulated length of ROH per individual.
创建时间:
2023-09-28



