five

Scripts, data, and analyses for extending flavoprotein (FLX) domains C-terminally to form extended FLX domains (eFLX) & performing structural predictions & alignments using AlphaFold2, RoseTTAFold, FoldSeek, and DALI

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10120639
下载链接
链接失效反馈
官方服务:
资源简介:
See associated manuscript for more details.  To capture unannotated regions of the FLX domains for structural modeling, an extended flavin oxygenase domain (eFLX) was annotated by extending the C-terminus of the G3DSA:3.50.50.60 InterProScan FLX annotation to the N-terminus of the downstream annotation, using bedtools closest [1] and awk. Next, the annotation was expanded by 50 residues both upstream and downstream to account for potential inaccuracy in the domain annotations. The peptide FASTA representation of these eFLX domains were extracted, and subjected to 3D modelling using RoseTTAFold via the Robetta server [2], and AlphaFold2 (ColabFold v1.5.2-patch: AlphaFold2 using MMseqs2) using the ColabFold Google Collaboratory notebooks [3], [4]. After modeling, the rank 1 model and model 1 model from AlphaFold2 and RoseTTAFold, respectively, were used for structural similarity searches with FoldSeek via the FoldSeek server [5]. The DALI webserver was used for the structural similarity search to establish structural homology [6].  [1]        A. R. Quinlan and I. M. Hall, "BEDTools: a flexible suite of utilities for comparing genomic features," Bioinformatics, vol. 26, no. 6, pp. 841–842, Mar. 2010, doi: 10.1093/bioinformatics/btq033. [2]        M. Baek et al., "Accurate prediction of protein structures and interactions using a three-track neural network," Science, Jul. 2021, doi: 10.1126/science.abj8754. [3]        M. Mirdita, K. Schütze, Y. Moriwaki, L. Heo, S. Ovchinnikov, and M. Steinegger, "ColabFold: making protein folding accessible to all," Nat. Methods, vol. 19, no. 6, Art. no. 6, Jun. 2022, doi: 10.1038/s41592-022-01488-1. [4]        J. Jumper et al., "Highly accurate protein structure prediction with AlphaFold," Nature, pp. 1–11, Jul. 2021, doi: 10.1038/s41586-021-03819-2. [5]        M. van Kempen et al., "Fast and accurate protein structure search with Foldseek," Nat. Biotechnol., pp. 1–4, May 2023, doi: 10.1038/s41587-023-01773-0. [6]        L. Holm, A. Laiho, P. Törönen, and M. Salgado, "DALI shines a light on remote homologs: One hundred discoveries," Protein Sci., vol. 32, no. 1, p. e4519, 2023, doi: 10.1002/pro.4519.
创建时间:
2024-01-19
二维码
社区交流群
二维码
科研交流群
商业服务