five

Adaptive protein evolution through length variation of short tandem repeats in Arabidopsis. Supplementary Materials.

收藏
Figshare2023-01-13 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/Adaptive_protein_evolution_through_length_variation_of_short_tandem_repeats_in_Arabidopsis_Supplementary_Materials_/16708561/1
下载链接
链接失效反馈
官方服务:
资源简介:
Additional materials linked to submitted paper "Adaptive protein evolution through length variation of short tandem repeats in Arabidopsis" <br> In this repository, there are data necessary to run the scripts provided in the Jupyter Notebook 'AdaptiveProteinEvolutionNotebook.ipynb' <br> Additional File 1: HipSTR variant calling of Arabidopsis accessions formatted as a zipped zarr folder. <br> Additional File 2: A python dictionary containing a map of STRs IDs and the amino acid tract the sites encode (at least four repeated amino acids required). <br> Additional File 3: The version of the TAIR10 peptide sequences used in the manuscript (FASTA text file). <br> Additional File 4: Map of STR coordinates to coordinates in the protein sequence (TSV). <br> Additional File 5: STR IDs in different IDR regions (TSV). <br> Additional File 6: Disorder predictions from d2p2 (BED). <br> Additional File 7: Disorder predictions, protein binding from DisoRDPbind (BED). <br> Additional File 8: Disorder predictions, DNA binding from DisoRDPbind (BED). <br> Additional File 9: Disorder predictions, RNA binding from DisoRDPbind (BED). <br> Additional File 10: Disorder predictions, MoRF regions from fMoRFpred (BED). <br> Additional File 11: Disorder predictions, ANCHOR2 regions from IUpred2a (BED). <br> Additional File 12: Protein coding STR unit counts, 770 accessions (TSV). <br> Additional File 13: STR lengts vs. gene expression LIMIX model results (TSV) (from ref 9. in the manuscript). <br> Additional File 14: Metadata on accessions (SRA run table) (TSV). <br> Additional File 15: BCFtools variant calling of Arabidopsis genes after mapping with BWA-MEM. <br> Additional File 16. Lists of ecotypes and group/subpopulation-designations (TSV). <br> Additional File 17. Variance-covariance matrix of the 770 accessions (TSV). <br> Additional File 18. Scaled mean environmental values per group/subpopulation (TSV). <br> Additional File 19: Bayenv2.0 results of the second poly-Q tract in TCP14 (TSV). <br> Additional File 20: Zipped OLS results folder. <br> Additional File 21: Bayenv2.0 results, all protein coding STRs. <br> Additional File 22: FST results, all protein coding STRs. <br> Additional File 23: Environmental variables retrieved for the Arabidopsis accessions from data produced by ref 17 (CSV). <br> Additional File 24: Alelle counts, most common allele and less common alleles (combined if multiallelic) for STRs used in Bayenv2.0. <br>
提供机构:
Reinar, William; Tørresen, Ole K.; Reitan, Trond; Jentoft, Sissel; Stø, Ida; Jakobsen, Kjetill S; Butenko, Melinka A.; Greulich, Anne; Knutsen, Jonfinn B.
创建时间:
2023-01-13
二维码
社区交流群
二维码
科研交流群
商业服务