The rate of amino acid divergence in Arabidopsis lyrata Plech population.
收藏Figshare2025-12-24 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/The_rate_of_amino_acid_divergence_in_Arabidopsis_lyrata_Plech_population_/29459012
下载链接
链接失效反馈官方服务:
资源简介:
We retrieved the Arabidopsis thaliana TAIR10 coding sequences (CDS) FASTA from the Joint Genome Institute (JGI) portal. For Arabidopsis lyrata NT1, we downloaded the reference genome FASTA and corresponding GTF annotation (Kolesnikova et al., 2013). We used a custom Python/GFFutils pipeline to extract and concatenate CDS exon features for each transcript directly from the GFF3 and genome FASTA, writing one CDS FASTA per transcript. We aligned each filtered CDS pair in codon space via a two-step MAFFT + PAL2NAL pipeline. First, protein translations were aligned with MAFFT v7.480 (–auto). Second, PAL2NAL v14 was used to back-translate to a two-sequence codon alignment in FASTA. Alignments were then filtered to retain only those with 100% coverage (no gaps in either sequence). Pairwise nonsynonymous (Ka) and synonymous (Ks) substitution rates were calculated on the codon alignments using KaKs_Calculator v2.0 with the Yang–Nielsen (YN00) method. We excluded any pairs for which Ka or Ks could not be estimated (e.g. no observed synonymous changes or saturated Ks) and any alignments yielding Ks
创建时间:
2025-12-24



