five

GBA3 Sequence alignment data for Phylogenetic and CodeML analysis

收藏
DataCite Commons2022-05-12 更新2025-04-09 收录
下载链接:
https://dmportal.biodata.pt/citation?persistentId=doi:10.34636/DMPortal/DBDXKB
下载链接
链接失效反馈
官方服务:
资源简介:
1- Description of data files Sequence alignment used for Phylogenetic and CodeML analysis described in https://doi.org/10.1038/s41598-020-68106-y. Deposited files: -Nucleotide sequence alignment_GBA3.fasta (fasta format); -Nucleotide sequence alignment_GBA3.phy (phylip format input for PhyML3.0); -nucleotide sequence alignment_GBA3_stats.txt (PhyML3.0 output with data and analysis description); -nucleotide sequence alignment_GBA3_stout.txt (PhyML3.0 output describing smart model and phylogenetic analysis input parameters and calculation statistics); -sms.csv (PhyML3.0 output smart model selection statistics); -nucleotide_alignment_GBA3_tree.txt (Phylogenetic tree file in txt format) 2- Sequences 2.1 Sequence information Nucleotide sequences obtained from NCBI nucleotide database or manually annotated using genomic information available in NCBI. All accession numbers of coding nucleotide sequences and genomic scaffolds used for manual gene annotation are available in supplementary table 5 of https://doi.org/10.1038/s41598-020-68106-y. 2.2 Naming rules Each sequence presents a header with the following information for coding sequences >3 letter code_ accession number or MA (manual annotation) For pseudogenes >3 letter code_PSEUDO Correspondence between 3 letter code and species name can be found in supplementary table 5 of https://doi.org/10.1038/s41598-020-68106-y. 3-Sequence alignment and Phylogenetic analysis All details are provided in the materials and methods of the publication. Coding nucleotide sequences were aligned in Geneious R7.1.9 using the translation align option (codon align) all remaning options were set to default. Poorly aligned codons were manually curated. Sequence alignment was inspected and columns contains 90% gaps were removed from the alignment. Premature stop codon in pseudogene and final stop codons were also removed. Final sequence alignment contained 97 sequences and a total of 1410 positions. Phylogenetic analysis was calculated in PhyML 3.0 (http://www.atgc-montpellier.fr/phyml/) using smart model selection option and branch support was calculated using aBayes algorithm. Phylogenetic tree for selection was calculated using RAxML as described in https://doi.org/10.1038/s41598-020-68106-y.
提供机构:
Home
创建时间:
2022-05-12
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作