five

Data repository for "The genomic basis of temporal niche evolution in a diurnal rodent"

收藏
DataCite Commons2023-07-13 更新2025-04-17 收录
下载链接:
https://melbourne.figshare.com/articles/dataset/Data_repository_for_The_genomic_basis_of_temporal_niche_evolution_in_a_diurnal_rodent_/20321655
下载链接
链接失效反馈
官方服务:
资源简介:
This repository contains datasets assocaited with the publication "The genomic basis of temporal niche evolution in a diurnal rodent", a collaboration between the Mallarino Lab at Princeton University and the Lucas lab at the Universty of Manchester. This study examined the evolution of diel temporal niche traits in the diurnal African striped mouse (<em>Rhabdomys pumilio</em>) from a comparative and functional perspective. Additionally, this study presents the first genome assembly for this species, deposited at NCBI under BioProject: PRJNA858857. <br> This repository contains the following: <br> 1) <strong>Rhabdomys_pumilio_final.gff.gz:</strong> A raw GFF-formatted gene annotation set for the <em>Rhabdomys pumilio </em>genome produced by Funannotate and used in transcriptomic analyses <br> 2) <strong>Rhabdomys_pumilio.mouse_gene_name_final.gff.gz:</strong> A copy of the above GFF-formatted gene annotation set for the <em>Rhabdomys pumilio</em> genome, in which gene symbols from the laboratory mouse (Mus musculus) have been assigned to their predicted R. pumilio orthologs. <br> 3) <strong>CompGenAnno.tar.gz:</strong> A folder of GFF-formatted annotations used in comparative genomic analyses, produced by directly lifting-over gene annotations from the Mus musculus genome (annotation: GCF_000001635.27_GRCm39_genomic.gff, assembly: GCF_000001635.27_GRCm39_genomic.fna) onto each of 23 other murid genome assemblies. Additionally, a manifest of each reference genome can be found in .tsv format (manifest_of_genome_assemblies_and_liftover_annotations.txt) along with a file with locus trees (locus_trees.txt) for each orthologous group of genes used in comparative genomic analyses are provided. <br> 4) <strong>Table_of_RER_data_for_examined_species.xlsx:</strong> A large table showing relative evolutionary rates measurements for each orthologous group of gene sequences (referenced against a Mus musculus transcript), for each species examined. Each species may be listed in multiple columns, reflecting different species representation for each orthologous group (i.e. representing cases in which the branch to a given leaf node originates at a different ancestral node due to sister species not represented in that alignment). <br> 5) <strong>Rhabdomys_pumilio_Princeton_asm1.0_preNCBI.fasta.gz:</strong> A copy of the genome assembly prior to any re-formatting that NCBI performs after upload. <br> 6) <strong>start_codon_perc_check.pl:</strong> A script used to filter multifastas of orthologous transcripts by the percent of sequences with recovered start codons (prior to mafft alignment) <br> 7) <strong>filter_seqs_by_percent_ref_length_2.pl:</strong> A script used to filter orthologous transcripts based on their pre-alignment length vs the reference <em>Mus musculus</em> ortholog used to annotate them via LiftOff (prior to mafft alignment) <br> 8) <strong>remove_orthos_with_large_gaps_vs_ref_1.pl:</strong> A script used to filter orthologous transcripts based on the presence of gaps or insertions (i.e. gaps introduced into the reference <em>Mus musculus</em> ortholog) after initial mafft alignment. <br> 9) <strong>calc_plot_murid_RERs.r:</strong> A script used to calculate and plot relative evolutionary rates based on RAxML trees for processed murid ortholog alignments.
提供机构:
University of Melbourne
创建时间:
2022-07-15
二维码
社区交流群
二维码
科研交流群
商业服务