Supporting data for "Deepdefense: Annotation of immune systems in prokaryotes using DeepLearning"
收藏DataCite Commons2025-05-26 更新2025-04-15 收录
下载链接:
http://gigadb.org/dataset/102550
下载链接
链接失效反馈官方服务:
资源简介:
Due to a constant evolutionary arms race, archaea and bacteria have evolved an abundance and diversity of immune responses to protect themselves against phages. Since the discovery and application of CRISPR-Cas adaptive immune systems, numerous novel candidates for immune systems have been identified. Previous approaches to identifying these new immune systems rely on HMM-based homolog searches or use labor-intensive and costly wet-lab experiments. To aid in finding and classifying immune systems, we use machine learning to classify already known immune system proteins and discover potential candidates in the genome. Neural networks have shown promising results in classifying and predicting protein functionality in recent years. However, these approaches frequently rely on the assumption of a closed world, which is often different in practice. <br>In this work, we explore neural networks for immune protein classification and deal with different methods for rejecting unrelated proteins in a genome-wide search, and establish a benchmark. Then, we optimize our approach for accuracy. Based on this, we develop an algorithm called Deepdefense to predict immune cassette classes based on a genome. Finally, we test our approach for detecting immune systems in the genome against an HMM-based method. <br>Deepdefense can automatically detect genes and define cassette annotations and classifications using two model classifications. This is achieved by creating an optimized deep learning model to annotate immune systems, in combination with calibration methods, and a second model to enable the scanning of an entire genome
提供机构:
GigaScience Database
创建时间:
2024-07-22



