five

Alignments

收藏
DataONE2015-11-02 更新2024-06-27 收录
下载链接:
https://search.dataone.org/view/null
下载链接
链接失效反馈
官方服务:
资源简介:
We downloaded gene sequences of 11 of the 16 species from Ensembl release 75 (Flicek et al. 2014). After identifying orthologous loci using the EnsemblCompara pipeline (Vilella et al. 2009), we extracted the longest transcript coding sequence (CDS) of each gene. Sequences for cichlid species other than the Nile Tilapia, Oreochromis niloticus, were obtained from the deposited genome assemblies (Brawand et al. 2014; Elmer et al. 2014). Orthologous loci in cichlid genomes were identified by BLASTN searches using Nile Tilapia sequences as queries and an e-value threshold of 10 -4.The integrity of obtained sequences was double-checked by BLASTN searches against available transcriptomes to ensure correct reading frames. Multiple sequence alignments of individual genes were constructed with translatorX (Abascal et al. 2010): After translation, sequences are aligned at the amino acid level using MUSCLE (Edgar 2004), unreliable amino acid positions are removed with Gblocks (Castresana 2000) under the least stringent parameters, and the corresponding nucleotide alignments are created guided by trimmed amino acid alignments. Maximum-likelihood phylogenetic reconstruction was performed with PhyML version 3.1 (Guindon et al. 2010) after best-fit models of evolution were chosen based on AICc (Akaike information criterion with correction) scores (Hurvich and Tsai 1989) calculated in jModeltest version 2.1.5 (Posada 2008).
创建时间:
2015-11-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作